The Need for a Novel Approach to Design Derivation Lexicon for Semitic Languages

Fiche du document

Date

27 août 2021

Discipline
Type de document
Périmètre
Langue
Identifiants
Relations

Ce document est lié à :
info:eu-repo/semantics/altIdentifier/doi/10.1007/978-3-030-93709-6_35

Collection

Archives ouvertes

Licence

info:eu-repo/semantics/OpenAccess




Citer ce document

Enchalew y Ayalew et al., « The Need for a Novel Approach to Design Derivation Lexicon for Semitic Languages », HAL-SHS : linguistique, ID : 10.1007/978-3-030-93709-6_35


Métriques


Partage / Export

Résumé En

Morphology knowledge is relevant in language learning, information retrieval and natural language processing. Derivation lexicons are organized and comprehensive collections of the morphological variants of a language's vocabulary. These lexicons can be developed either through analysis-based synthesis of large text corpora or through synthesis of surface forms from roots, stems, lemmas and morphological rules. Much of the research attempted in developing derivation lexicon for Indo-European languages, which are concatenative, focus on analysis-based synthesis, as they do have well-developed preprocessing tools and organized text corpora. However, the methods for these languages are not appropriate for non-concatenative languages such as Semitic languages. Moreover, most of the Semitic languages, except Arabic and Hebrew, do not have well-developed text corpora and language processing tools. Hence, a novel approach that can cater for the root-pattern and rich morphology of these languages is necessary. This paper is therefore a comprehensive survey of the literature, an analysis motivating an innovative and generic morphological synthesis approach with illustrated architecture. It is part of a larger project tailored for designing an innovative, generic, approach to derivation lexicon development for Semitic languages.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en