7 mai 2018
info:eu-repo/semantics/OpenAccess
Kyungtae Lim et al., « Multilingual Dependency Parsing for Low-Resource Languages: Case Studies on North Saami and Komi-Zyrian », HAL-SHS : sciences de l'information, de la communication et des bibliothèques, ID : 10670/1.io5wrc
The paper presents a method for parsing low-resource languages with very small training corpora using multilingual word embeddings and annotated corpora of larger languages. The study demonstrates that specific language combinations enable improved dependency parsing when compared to previous work, allowing for wider reuse of pre-existing resources when parsing low-resource languages. The study also explores the question of whether contemporary contact languages or genetically related languages would be the most fruitful starting point for multilingual parsing scenarios.