Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account

Fiche du document

Date

22 juin 2022

Discipline
Type de document
Périmètre
Langue
Identifiant
Relations

Ce document est lié à :
info:eu-repo/semantics/reference/issn/2162-5603

Organisation

OpenEdition

Licences

https://creativecommons.org/licenses/by/4.0/ , info:eu-repo/semantics/openAccess



Sujets proches En

Papers

Citer ce document

Naomi Truan et al., « Building, Encoding, and Annotating a Corpus of Parliamentary Debates in TEI XML: A Cross-Linguistic Account », Journal of the Text Encoding Initiative, ID : 10.4000/jtei.4164


Métriques


Partage / Export

Résumé 0

This paper introduces an integrative and comprehensive method for the linguistic annotation of parliamentary discourse. Initially conceived as documentation for a specific and small-scale research project, the annotation scheme takes into account national specificities and is geared to proposing an annotation scheme that is both highly standardized and adaptable to other research contexts. In this paper we present a specific application of the Text Encoding Initiative (TEI) framework applied to a subset of official transcripts of plenary proceedings in three parliamentary cultures. The TEI annotation scheme proposed here has two main applications: first, it serves as a basis for encoding parliamentary corpora by providing a systematic way of annotating both elements within the text (e.g., turns, incidents, and interruptions) and the metadata associated with it (e.g., variables pertaining to the speaker or the speech event); second, it provides a cross-linguistic empirical basis for further annotation projects.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en