Identifying Strategic Information from Scientific Articles through Sentence Classification.

Fiche du document

Date

26 mai 2008

Type de document
Périmètre
Langue
Identifiants
Collection

Archives ouvertes

Licence

info:eu-repo/semantics/OpenAccess




Citer ce document

Fidelia Ibekwe-Sanjuan et al., « Identifying Strategic Information from Scientific Articles through Sentence Classification. », HAL-SHS : sciences de l'information, de la communication et des bibliothèques, ID : 10670/1.gmhmyv


Métriques


Partage / Export

Résumé En

We address here the need to assist users in rapidly accessing the most important or strategic information in the text corpus by identifying sentences carrying specific information. More precisely, we want to identify contribution of authors of scientific papers through a categorization of sentences using rhetorical and lexical cues. We built local grammars to annotate sentences in the corpus according to their rhetorical status: objective, new things, results, findings, hypotheses, conclusion, related_word, future work. The annotation is automatically projected automatically onto two other corpora to test their portability across several domains. The local grammars are implemented in the Unitex system. After sentence categorization, the annotated sentences are clustered and users can navigate the result by accessing specific information types. The results can be used for advanced information retrieval purposes.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en