3 septembre 2021
https://www.openedition.org/12554 , info:eu-repo/semantics/openAccess
Francesca Masini et al., « Multiword expressions we live by: a validated usage-based dataset from corpora of written Italian », Accademia University Press, ID : 10.4000/books.aaccademia.8710
The paper describes the creation of a manually validated dataset of Italian multiword expressions, building on candidates automatically extracted from corpora of written Italian. The main features of the resource, such as POS-pattern and lemma distribution, are also discussed, together with possible applications.