Ambiguity rates: Automatic analysis of French text corpora and computation of ambiguity rates for different tagsets

Fiche du document

Date

1996

Discipline
Type de document
Périmètre
Langue
Identifiants
Collection

Archives ouvertes

Licence

info:eu-repo/semantics/OpenAccess



Sujets proches En

Frenchmen (French people)

Citer ce document

Eric Laporte et al., « Ambiguity rates: Automatic analysis of French text corpora and computation of ambiguity rates for different tagsets », HAL SHS (Sciences de l’Homme et de la Société), ID : 10670/1.43e463...


Métriques


Partage / Export

Résumé En

We analysed a French textual corpus in order to evaluate its rate of lexical ambiguity (number of lexical tags per word). Since this rate theoretically depends on the tagset and on whether compounds are delimited by tagging, the experiment was repeated with eight different tagsets. The results show that, although the information content of the tags is very different depending on the tagsets, the variation of the rate of lexical ambiguity is limited: when one shifts from the least to the most informative of the tagsets, the rate increases only from 1.6 to 2.0 tags per word.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines