Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR

Adèle Désoyer et al., « Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR », HAL-SHS : linguistique, ID : 10670/1.7vbdbo

Partage / Export

Résumé En

We present CROC (Coreference Resolution for Oral Corpus), the first machine learning system for coreference resolution in French. One specific aspect of the system is that it has been trained on data that come exclusively from transcribed speech, namely ANCOR (ANaphora and Coreference in ORal corpus), the first large-scale French corpus with anaphorical relation annotations. In its current state, the CROC system requires pre-annotated mentions. We detail the features used for the learning algorithms, and we present a set of experiments with these features. The scores we obtain are close to those of state-of-the-art systems for written English.

Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR

Fiche du document

Mots-clés En

Sujets proches En

Citer ce document

Métriques

Partage / Export

Résumé En

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en