Projet ROSETTA

Projet Projet international

ResOurceS for Andagered languages Through TranslAted texts

Titre : ResOurceS for Andagered languages Through TranslAted texts

Durée : 12 mois 

Date de début et fin : Du 01/09/2018 - 03/08/2019

Résumé : Out of the world’s 6000+ languages only a small fraction currently enjoys the benefits of modern information technologies. Languages left behind are called endangered, indigenous or technologically low-resourced (even though they may have millions of speakers). This collaborative and interdisciplinary digital humanities research project aims to help salvage those languages by combining computational linguistics, Library and Information Science, American Literature, and Translation Studies.

Very much as the Rosetta stone helped decipher the demotic and hieroglyphic scripts thanks to the presence of the Greek translation, our project intends to preserve contemporary endangered languages and assist with their survival through translation.

Our project puts to use the extant translated versions of a single fictional text into a number of low-resourced languages spanning a period of nearly a century and a half. The project relies on the involvement of humans for data collection while natural language processing tools generate language resources (corpora, dictionaries, thesauri, lexicons) for those endangered languages.

Contact (s) : Amel Fraisse 

Partenariat (s) : Stanford University