E:Calm Resource: a Resource for Studying Texts Produced by French Pupils and Students

Fiche du document

Date

11 mai 2020

Discipline
Type de document
Périmètre
Langue
Identifiants
Collection

Archives ouvertes

Licence

info:eu-repo/semantics/OpenAccess




Citer ce document

Lydia-Mai Ho-Dac et al., « E:Calm Resource: a Resource for Studying Texts Produced by French Pupils and Students », HAL-SHS : linguistique, ID : 10670/1.c9me05


Métriques


Partage / Export

Résumé En

TheÉ:CALM resource is constructed from French student texts produced in a variety of usual contexts of teaching. The distinction of theÉ:CALM resource is to provide an ecological data set that gives a broad overview of texts written at elementary school, high school and university. This paper describes the whole data processing: encoding of the main graphical aspects of the handwritten primary sources according to the TEI-P5 norm; spelling standardizing; POS tagging and syntactic parsing evaluation.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en