11 mai 2020
info:eu-repo/semantics/OpenAccess
Lydia-Mai Ho-Dac et al., « E:Calm Resource: a Resource for Studying Texts Produced by French Pupils and Students », HAL-SHS : linguistique, ID : 10670/1.c9me05
TheÉ:CALM resource is constructed from French student texts produced in a variety of usual contexts of teaching. The distinction of theÉ:CALM resource is to provide an ecological data set that gives a broad overview of texts written at elementary school, high school and university. This paper describes the whole data processing: encoding of the main graphical aspects of the handwritten primary sources according to the TEI-P5 norm; spelling standardizing; POS tagging and syntactic parsing evaluation.