Back to Search
Start Over
Developing an Embosi (Bantu C25) Speech Variant Dictionary to Model Vowel Elision and Morpheme Deletion
- Source :
- Annual Conference of the International Speech Communication Association, Annual Conference of the International Speech Communication Association, ISCA, Aug 2017, Stockholm, Sweden, INTERSPEECH
- Publication Year :
- 2017
- Publisher :
- HAL CCSD, 2017.
-
Abstract
- International audience; This paper investigates vowel elision and morpheme deletion inEmbosi (Bantu C25), an under-resourced language spoken inthe Republic of Congo. We propose that the observed mor-pheme deletion is morphological, and that vowel elision isphonological. The study focuses on vowel elision that occursacross word boundaries between the contact of long/short vow-els (i.e. CV[long] # V[short].CV), and between the contact ofshort/short vowels (CV[short] # V[short].CV). Several differ-ent categories of morphemes are explored: (i) prepositions (ya,mo), (ii) class-noun nominal prefixes (ba, etc.), (iii) singularsubject pronouns (ngá, nO, wa). For example, the preposition,ya, regularly deletes allowing for vowel elision if vowel contactoccurs between the head of the noun phrase and the previousword. Phonetically motivated speech variants are proposed inthe lexicon used for forced alignment (segmentation) enablingthese phenomena to be quantified in the corpus so as to developa dictionary containing relevant phonetic variants.
- Subjects :
- language modeling
Head (linguistics)
Computer science
Speech recognition
phonetics
Bantu languages
02 engineering and technology
[INFO] Computer Science [cs]
Lexicon
under-resourced languages
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Morpheme
Vowel
0202 electrical engineering, electronic engineering, information engineering
[INFO]Computer Science [cs]
060201 languages & linguistics
Subject pronoun
Phonetics
Phonology
06 humanities and the arts
Noun phrase
Linguistics
Prefix
phonology
[INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL]
0602 languages and literature
020201 artificial intelligence & image processing
Language model
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Annual Conference of the International Speech Communication Association, Annual Conference of the International Speech Communication Association, ISCA, Aug 2017, Stockholm, Sweden, INTERSPEECH
- Accession number :
- edsair.doi.dedup.....e6fdd0cb507e3a8137fe60a0419a6d9c