Towards Qualitative Word Embeddings Evaluation: Measuring Neighbors Variation

Fiche du document

Date

1 juin 2018

Discipline
Type de document
Périmètre
Langue
Identifiants
Collection

Archives ouvertes

Licence

info:eu-repo/semantics/OpenAccess



Citer ce document

Bénédicte Pierrejean et al., « Towards Qualitative Word Embeddings Evaluation: Measuring Neighbors Variation », HAL-SHS : linguistique, ID : 10670/1.knzjer


Métriques


Partage / Export

Résumé En

We propose a method to study the variation lying between different word embeddings models trained with different parameters. We explore the variation between models trained with only one varying parameter by observing the distributional neighbors variation and show how changing only one parameter can have a massive impact on a given semantic space. We show that the variation is not affecting all words of the semantic space equally. Variation is influenced by parameters such as setting a parameter to its minimum or maximum value but it also depends on the corpus intrinsic features such as the frequency of a word. We identify semantic classes of words remaining stable across the models trained and specific words having high variation.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en