A Fuzzy Decision Strategy for Topic Identification and Dynamic Selection of Language Models

Fiche du document

Date

2000

Type de document
Périmètre
Langue
Identifiants
Collection

Archives ouvertes




Citer ce document

Brigitte Bigi et al., « A Fuzzy Decision Strategy for Topic Identification and Dynamic Selection of Language Models », HAL-SHS : sciences de l'information, de la communication et des bibliothèques, ID : 10670/1.4hrnwb


Métriques


Partage / Export

Résumé En

The paper introduces a new effective model for topic recognition. The model follows a multi-expert decision paradigm based on fuzzy relations in which fuzzy variables express degrees of reliability of expert decision. Heterogeneous measures are integrated by the fuzzy relations whose structure and components may evolve in time. Experiments resulted in more than 80% topic classification accuracy on articlesof the French newspaper Le Monde which describe a very large variety of facts with a very large vocabulary (of the order of 500,000 words). Experiments show a significant improvement when the above mentioned integration of multi-expert decision is used. A robust strategy for dynamic Language Model (LM) selection, based on topic recognition and switching between topic models, is proposed. It is effective because it relies on a small set of well trained topic-dependent LMs and on reliable topic recognition. By using perplexity as a performance measure of the LM switching model, a tangible reduction is observed with respect to the use of a single, general, static LM.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en