21 avril 2023
http://creativecommons.org/licenses/by-nc/ , info:eu-repo/semantics/OpenAccess
Moumita Pakrashi et al., « Resources Creation of Bengali for SPPAS », HAL-SHS : linguistique, ID : 10670/1.csb23a
The development of HLT tools inevitably involves the need for language resources. However, only a handful number of languages possess such resources for free. This paper presents the development of speech tools for the Bengali language. Particularly, this paper focuses on developing language resources of a tokenizer, an automatic speech system for predicting the pronunciation of the words and their segmentation in this low-resourced language. The newly created resources have been integrated into SPPAS software tool and distributed under the terms of public licenses.