Exploiting Alternatives for Text-To-Speech Synthesis: From Machine to Human

Nicolas Obin; Christophe Veaux; Pierre Lanchantin

Exploiting Alternatives for Text-To-Speech Synthesis: From Machine to Human

Fiche du document

Auteurs

Date

26 février 2015

Discipline

Linguistique

Type de document

Livres et chapitres d'ouvrages

Périmètre

Publications

Langue

Anglais

Identifiants

Source

HAL-SHS : linguistique

Relations

Ce document est lié à :
info:eu-repo/semantics/altIdentifier/doi/10.1007/978-3-662-45258-5_13

Collection

Archives ouvertes

Organisation

Centre pour la communication scientifique directe

Licence

info:eu-repo/semantics/OpenAccess

Mots-clés En

Prosody Text-to-Speech

Sujets proches En

Speech--Synthesis Synthetic speech Talking

Citer ce document

Nicolas Obin et al., « Exploiting Alternatives for Text-To-Speech Synthesis: From Machine to Human », HAL-SHS : linguistique, ID : 10.1007/978-3-662-45258-5_13

Partage / Export

Résumé En

he absence of alternatives/variants is a dramatical limitation of text-to- speech synthesis compared to the variety of human speech. This paper introduces the use of speech alternatives/variants in order to improve text-to-speech synthesis systems. Speech alternatives denote the variety of possibilities that a speaker has to pronounce a sentence - depending on linguistic constraints, specific strategies of the speaker, speaking style, and pragmatic constraints. During the training, symbolic and acoustic characteristics of a unit-selection speech synthesis system are statisti- cally modelled with context-dependent parametric models (GMMs/HMMs). During the synthesis, symbolic and acoustic alternatives are exploited using a GENERALIZED VITERBI ALGORITHM (GVA) to determine the sequence of speech units used for the synthesis. Objective and subjective evaluations support evidence that the use of speech alternatives significantly improves speech synthesis over conventional speech synthesis systems. Beyond, speech alternatives can also be used to vary the speech synthesis for a given text. The proposed method can easily be extended to HMM-based speech synthesis.

Exploiting Alternatives for Text-To-Speech Synthesis: From Machine to Human

Fiche du document

Mots-clés En

Sujets proches En

Citer ce document

Métriques

Partage / Export

Résumé En

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en