Reference-free and Confidence-independent Binary Quality Estimation for Automatic Speech Recognition

Fiche du document

Date

11 novembre 2016

Discipline
Périmètre
Langue
Identifiants
Collection

OpenEdition Books

Organisation

OpenEdition

Licences

https://creativecommons.org/licenses/by-nc-nd/4.0/ , info:eu-repo/semantics/openAccess




Citer ce document

Hamed Zamani et al., « Reference-free and Confidence-independent Binary Quality Estimation for Automatic Speech Recognition », Accademia University Press, ID : 10.4000/books.aaccademia.1562


Métriques


Partage / Export

Résumé En It

We address the problem of assigning binary quality labels to automatically transcribed utterances when neither reference transcripts nor information about the decoding process are accessible. Our quality estimation models are evaluated in a large vocabulary continuous speech recognition setting (the transcription of English TED talks). In this setting, we apply different learning algorithms and strategies and measure performance in two testing conditions characterized by different distributions of “good” and “bad” instances. The positive results of our experiments pave the way towards the use of binary estimators of ASR output quality in a number of application scenarios.

Questo lavoro descrive un approccio che consente di assegnare un valore di qualità “binario” a trascrizioni generate da un sistema di riconoscimento automatico della voce. Il classificatore da noi sviluppato è stato valutato in un’applicazione di riconoscimento di parlato continuo per grandi vocabolari (la trascrizione di “TED talks” in Inglese), confrontando tra di loro diverse strategie.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en