Hao Tan

Research Software Engineer

M.S., Computer and Information Technology, University of Pennsylvania

M.A., East Asian Languages and Civilizations, University of Pennsylvania

B.A., Interdisciplinary Studies, Duke University

Machine Learning
Natural Language Processing
Large Language Models (LLMs)
Cultural Heritage
Multilingual DH
Hao Tan

As a Research Software Engineer at Princeton’s Center for Digital Humanities (CDH), Hao Tan works with the team and partners with faculty to create custom research software in support of CDH Research Partnerships. With training in both Computer Science and East Asian Studies, she brings a multilingual, cross-cultural perspective to digital humanities work and consults with the Princeton community on a wide range of computational projects—particularly those applying machine learning to non-alphabetic texts and cultural heritage materials.

Before joining CDH, Hao earned dual Master’s degrees in Computer and Information Technology and East Asian Languages and Civilizations from the University of Pennsylvania. She previously collaborated with the Research Center for Digital Humanities at Peking University and published research in Digital Humanities (Shuzi Renwen).

At CDH, Hao contributes to projects involving large language models, text analytics, and generative AI. She is particularly interested in expanding the digital humanities toolkit through emerging technologies such as multimodal models and computer vision, and in exploring new applications of LLMs for cultural memory, storytelling, and computational reading.

Related projects

Citing Marx

Identifying Marx citations within Die Neue Zeit

Built by CDH
marx