Combined models for topic spotting and topic-dependent language modeling

Brigitte Bigi; Renato de Mori; Marc El-Bèze; Thierry Spriet

Combined models for topic spotting and topic-dependent language modeling

Fiche du document

Auteurs

Date

1997

Discipline

Sciences de l'information et de la communication

Type de document

Colloques et conférences

Périmètre

Publications

Langue

Anglais

Identifiants

Source

HAL-SHS : sciences de l'information, de la communication et des bibliothèques

Relations

Ce document est lié à :
info:eu-repo/semantics/altIdentifier/doi/10.1109/ASRU.1997.659133

Collection

Archives ouvertes

Organisation

Centre pour la communication scientifique directe

Licence

info:eu-repo/semantics/OpenAccess

Mots-clés En

Topic identification Speech & Image processing system language modelling

Sujets proches En

Language (New words, slang, etc.) Probability Statistical inference

Citer ce document

Brigitte Bigi et al., « Combined models for topic spotting and topic-dependent language modeling », HAL-SHS : sciences de l'information, de la communication et des bibliothèques, ID : 10.1109/ASRU.1997.659133

Partage / Export

Résumé En

A new statistical method for Language Modeling and spoken document classification is proposed. It is based on a mixture of topic dependent probabilities. Each topic dependent probability is in turn a mixture of n-gram probabilities and the probability of Kullback-Lieber (KL) distances between keyword unigrams and distribution obtained from the content of a cache memory. Experimental result on topic classification using a corpus of 60 Mword from the French newspaper Le Monde show the excellent performance of the cache memory and its complementary role in providing different statistics for the decision process.

Combined models for topic spotting and topic-dependent language modeling

Fiche du document

Mots-clés En

Sujets proches En

Citer ce document

Métriques

Partage / Export

Résumé En

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en