Infrastructure for African Languages
Increasing representation of African languages in NLP, LLMs, and AI
African Languages continue to lag behind in the rapid advancements of language technology and applications driven by LLMs. Princeton’s Infrastructure for African Languages initiative aims to increase representation of African languages while also foregrounding their values and culture.
This group runs a series projects, courses, and hosts speakers to generate campus-wide discussion about this important topic.
Contact CDH postdoc Happy Buzaaba to learn more and get involved in this research group.
Related projects
African_UD: Universal Dependencies Treebank for African Languages
Increasing the representation of African languages in NLP by creating quality datasets for eleven African languages
African Languages in the Age of AI (AAA) Speaker Series
Bringing leading scholars to Princeton to discuss the opportunities and challenges for developing technologies that empower African languages
Computational Approaches to Nigerian Literature
Experiments in NLP for texts in Yoruba and Efik