The notion of sentence and other discourse units in corpus annotation

Fiche du document

Date

14 novembre 2014

Discipline
Périmètre
Langue
Identifiants
Relations

Ce document est lié à :
info:eu-repo/semantics/altIdentifier/doi/10.1075/scl.61.12pie

Collection

Archives ouvertes

Licence

info:eu-repo/semantics/OpenAccess


Sujets proches En

Units Measurement, Units of

Citer ce document

Paola Pietrandrea et al., « The notion of sentence and other discourse units in corpus annotation », HAL SHS (Sciences de l’Homme et de la Société), ID : 10.1075/scl.61.12pie


Métriques


Partage / Export

Résumé En

The notion of sentence-as it is defined in syntactic, semantic, graphic and prosodic terms-is not a suitable maximal unit for the prosodic and syntactic annotation of spoken corpora. Still, this notion is taken as a reference in many syntactic and prosodic annotation systems. We present here the modular approach we adopted for the annotation of the Rhapsodie corpus of spoken French, which led us to distinguish three types of elementary units operating in discourse (government units, illocutionary units, and intonational periods) and to annotate them separately. We describe the types of interactions identified among these various levels of cohesion. On this basis we propose a reappraisal of the traditional notion of sentence and we define two additional types of discourse units that we consider as the minimal and the maximal span for the notion of sentence.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines