27 septembre 2021
info:eu-repo/semantics/OpenAccess
Thomas Gaillat et al., « A data repository for the management of dynamic linguistic datasets », HAL-SHS : sciences de l'éducation, ID : 10670/1.9fb788
This paper addresses the issue of using Nakala, a dynamic database technology, for the management of language corpora. We present our ongoing attempt at storing and classifying multimedia documents of a corpus of language learner oral and written productions with universal resource identifiers. The architecture supports query APIs compatible with R packages and other tools which will facilitate the generation of linguistically enriched datasets for a more effective corpus-based study of language acquisition.