Discourse Data in DiET

Fiche du document

Date

1999

Discipline
Type de document
Périmètre
Langue
Identifiants
Collection

Archives ouvertes

Licence

info:eu-repo/semantics/OpenAccess



Sujets proches En

Assessment diet

Citer ce document

Ian Lewin et al., « Discourse Data in DiET », HAL-SHS : linguistique, ID : 10670/1.3cnbul


Métriques


Partage / Export

Résumé En

The DiET project provides systematically constructed and annotated test items and associated tools, enabling fast system debugging and evaluation, and automatic linkage from test items to real corpora instances. This paper concentrates on the discourse test suite and its use. The discourse test suite covers discourse phenomena such as pronouns, def-inites and ellipsis. These can be used to evaluate the coverage and accuracy of implementations of anaphora resolution algorithms. We also examine the text prooling support within the Diet tools. Text Prooling identiies typical and salient corpus characteristics, e.g. the frequency and distribution of part of speech tags and vocabulary richness. Prooling also provides candidate sentences instantiating predeened syntactic phenomena. Prooling enables users to select test-items appropriate to their domain speciic corpus. The paper shows how the corpus search engine can be used to identify discourse phenomena in a corpus and presents concrete results of this evaluation scenario.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en