Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative Analysis

Ildikó Pilán; Laurent Prévot; Hendrik Buschmeier; Pierre Lison

Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative Analysis

Fiche du document

Auteurs

Date

6 avril 2024

Discipline

Linguistique

Type de document

Prépublication

Périmètre

Publications

Langue

Anglais

Identifiants

Source

HAL-SHS : linguistique

Relations

Ce document est lié à :
info:eu-repo/semantics/altIdentifier/doi/10.48550/arXiv.2309.15656

Collection

Archives ouvertes

Organisation

Centre pour la communication scientifique directe

Licences

http://creativecommons.org/licenses/by/ , info:eu-repo/semantics/OpenAccess

Mots-clés En

Computation and Language (cs.CL) FOS: Computer and information sciences

Sujets proches En

Dialogs

Citer ce document

Ildikó Pilán et al., « Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative Analysis », HAL-SHS : linguistique, ID : 10.48550/arXiv.2309.15656

Partage / Export

Résumé 0

Scripted dialogues such as movie and TV subtitles constitute a widespread source of training data for conversational NLP models. However, the linguistic characteristics of those dialogues are notably different from those observed in corpora of spontaneous interactions. This difference is particularly marked for communicative feedback and grounding phenomena such as backchannels, acknowledgments, or clarification requests. Such signals are known to constitute a key part of the conversation flow and are used by the dialogue participants to provide feedback to one another on their perception of the ongoing interaction. This paper presents a quantitative analysis of such communicative feedback phenomena in both subtitles and spontaneous conversations. Based on dialogue data in English, French, German, Hungarian, Italian, Japanese, Norwegian and Chinese, we extract both lexical statistics and classification outputs obtained with a neural dialogue act tagger. Two main findings of this empirical study are that (1) conversational feedback is markedly less frequent in subtitles than in spontaneous dialogues and (2) subtitles contain a higher proportion of negative feedback. Furthermore, we show that dialogue responses generated by large language models also follow the same underlying trends and include comparatively few occurrences of communicative feedback, except when those models are explicitly fine-tuned on spontaneous dialogues.

Conversational Feedback in Scripted versus Spontaneous Dialogues: A Comparative Analysis

Fiche du document

Mots-clés En

Sujets proches En

Citer ce document

Métriques

Partage / Export

Résumé 0

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en