GOOFRE version 2

Etienne Brunet; Laurent Vanni

GOOFRE version 2

Fiche du document

Auteurs

Date

3 juin 2014

Type de document

Colloques et conférences

Périmètre

Publications

Langue

Français

Identifiants

handle: 10670/1.liia3b
hal: hal-01196595

Source

HAL-SHS : linguistique

Collection

Archives ouvertes

Organisation

Centre pour la communication scientifique directe

Licence

info:eu-repo/semantics/OpenAccess

Mots-clés En

statistique logometrie informatique

Sujets proches En

Tools Hand tools Handtools

Citer ce document

Etienne Brunet et al., « GOOFRE version 2 », HAL-SHS : linguistique, ID : 10670/1.liia3b

Partage / Export

Résumé En

The amount of data contained within Google Books has doubled over the last two years and now exceeds 500 billion words. A new treatment of the data has included a re-examination of scanned images, offering a more accurate recognition of the text. In addition, for the first time, included texts have been subjected to deambigation and lemmatisation. Finally, the website Culturomics has made tools available that facilitate its accessibility. It seemed interesting, therefore, to develop a new expertise and to create a new database, complete with all the necessary statistical tools, available online or locally, for exploiting such large corpora.

GOOFRE version 2

Fiche du document

Mots-clés En

Sujets proches En

Citer ce document

Métriques

Partage / Export

Résumé En

Par les mêmes auteurs

Sur les mêmes sujets

Exporter en