Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR

Fiche du document

Date

3 avril 2016

Discipline
Type de document
Périmètre
Langue
Identifiants
Collection

Archives ouvertes

Licence

info:eu-repo/semantics/OpenAccess




Citer ce document

Adèle Désoyer et al., « Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR », HAL-SHS : linguistique, ID : 10670/1.7vbdbo


Métriques


Partage / Export

Résumé En

We present CROC (Coreference Resolution for Oral Corpus), the first machine learning system for coreference resolution in French. One specific aspect of the system is that it has been trained on data that come exclusively from transcribed speech, namely ANCOR (ANaphora and Coreference in ORal corpus), the first large-scale French corpus with anaphorical relation annotations. In its current state, the CROC system requires pre-annotated mentions. We detail the features used for the learning algorithms, and we present a set of experiments with these features. The scores we obtain are close to those of state-of-the-art systems for written English.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en