22 results on '"Barbosa, Plinio A."'
Search Results
2. Evaluation of the decolorization potential of azo dyes by aerobic granular sludge
- Author
-
Xavier, Renan da Silva, Barbosa, Plínio Tavares, dos Santos, André Bezerra, da Silva, Marcos Erick Rodrigues, and Firmino, Paulo Igor Milen
- Published
- 2023
- Full Text
- View/download PDF
3. Multiparametric Analysis of Speaking Fundamental Frequency in Genetically Related Speakers Using Different Speech Materials: Some Forensic Implications
- Author
-
Cavalcanti, Julio Cesar, Eriksson, Anders, and Barbosa, Plinio A.
- Published
- 2021
- Full Text
- View/download PDF
4. The Interplay between Syllabic Duration and Melody to Indicate Prosodic Functions in Brazilian Portuguese Story Retelling.
- Author
-
Barbosa, Plinio A. and Alvarenga, Luís H. G.
- Subjects
TIME series analysis ,PORTUGUESE language ,MELODY ,INTONATION (Musical pitch) ,RHYTHM - Abstract
This paper investigates the relationship between syllabic duration and F0 contours for implementing three prosodic functions. Work on rhythm usually describes the evolution of syllable-sized durations throughout utterances, rarely making reference to melodic events. On the other hand, work on intonation usually describes linear sequences of melodic events with indirect references to duration. Although some scholars have explored the relationship between these two parameters for particular functions, to our knowledge, there has been no investigation on the systematic correlation between syllabic duration and F0 values throughout narrative sequences. Based on a corpus of story retelling with nine speakers of Brazilian Portuguese from two regions, our work investigated the interplay between syllabic duration and melody to signal three prosodic functions: terminal and non-terminal boundary marking and prominence. The examination of local syllabic duration maxima and four F0 descriptors revealed that these maxima act as landmarks for particular F0 shapes: for non-terminal boundaries, the great majority of shapes were increasing and increasing–decreasing patterns; for terminal boundaries, almost all shapes were decreasing F0 patterns; and for prominence marking, the great majority of shapes were high tones across the stressed syllable. Time series analyses revealed significant correlations between duration and specific F0 descriptors, pointing to a ruled interplay between F0 and syllabic duration patterns in Brazilian Portuguese story retelling. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Exploring the performance of automatic speaker recognition using twin speech and deep learning-based artificial neural networks.
- Author
-
Cesar Cavalcanti, Julio, Rodrigues da Silva, Ronaldo, Eriksson, Anders, and Barbosa, Plinio A.
- Published
- 2024
- Full Text
- View/download PDF
6. NEOS: An odour‐induced affect scale for use in the cosmetic industry.
- Author
-
Barbosa, Plinio A., Semenzim, Thaís Bellintani, Marques, Lucas Murrins, Serpa, Alexandre Luiz de Oliveira, Yoshimine, Elise, and Tobo, Patricia
- Subjects
- *
ODORS , *COSMETICS industry , *NEW product development , *AFFECT (Psychology) - Abstract
This work proposes an odour‐induced affect scale for use in the cosmetic industry that relies on the approach that produced the UniGEOS, a universal odour‐related emotional scale from the Swiss Center for Affective Sciences. The Natura Emotion and Odor Scale (NEOS) was built on experiments conducted with a larger set of participants (491) and a set of 35 scents that combine seven commercial perfumes from Natura & Co cosmetic company with 28 odours from different olfactory classes important for the cosmetic industry. The results showed the stability of 60 Emotion‐Related terms in Brazilian Portuguese split into five emotion‐related dimensions: Romance, Attention, Energy, Well‐being and Negative feelings. The association of the scents evoking these five dimensions has direct implications in the design of new products. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style.
- Author
-
Cavalcanti, Julio Cesar, Eriksson, Anders, and Barbosa, Plinio A.
- Subjects
PROSODIC analysis (Linguistics) ,POWER (Social sciences) ,SPEECH ,ERROR rates ,STATISTICAL models - Abstract
This study aimed to assess what we refer to as the speaker discriminatory power asymmetry and its forensic implications in comparisons performed in different speaking styles: spontaneous dialogues vs. interviews. We also addressed the impact of data sampling on the speaker's discriminatory performance concerning different acoustic-phonetic estimates. The participants were 20 male speakers, Brazilian Portuguese speakers from the same dialectal area. The speech material consisted of spontaneous telephone conversations between familiar individuals, and interviews conducted between each individual participant and the researcher. Nine acoustic-phonetic parameters were chosen for the comparisons, spanning from temporal and melodic to spectral acoustic-phonetic estimates. Ultimately, an analysis based on the combination of different parameters was also conducted. Two speaker discriminatory metrics were examined: Cost Log-likelihood-ratio (Cllr) and Equal Error Rate (EER) values. A general speaker discriminatory trend was suggested when assessing the parameters individually. Parameters pertaining to the temporal acoustic-phonetic class depicted the weakest performance in terms of speaker contrasting power as evidenced by the relatively higher Cllr and EER values. Moreover, from the set of acoustic parameters assessed, spectral parameters, mainly high formant frequencies, i.e., F3 and F4, were the best performing in terms of speaker discrimination, depicting the lowest EER and Cllr scores. The results appear to suggest a speaker discriminatory power asymmetry concerning parameters from different acoustic-phonetic classes, in which temporal parameters tended to present a lower discriminatory power. The speaking style mismatch also seemed to considerably impact the speaker comparison task, by undermining the overall discriminatory performance. A statistical model based on the combination of different acoustic-phonetic estimates was found to perform best in this case. Finally, data sampling has proven to be of crucial relevance for the reliability of discriminatory power assessment. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
8. The Dance of Pauses in Poetry Declamation.
- Author
-
Barbosa, Plinio A.
- Subjects
PROSODIC analysis (Linguistics) ,PORTUGUESE language ,CORPORA ,PHONETICS ,PLEASANTNESS & unpleasantness (Psychology) - Abstract
In poetry declamation, the appropriate use of prosody to cause pleasure is essential. Among the prosodic parameters, pause is one of the most effective to engage the listeners and provide them with a pleasant experience. The declamation of three poems in two varieties of Portuguese by ten Brazilian Portuguese (BP) speakers and ten European Portuguese (EP) speakers, balanced for gender, was used as a corpus for evaluating the degree of pleasantness by listeners from the same language variety. The distributions of pause duration and inter-pause interval (IPI) both varied greatly across the subjects, being the main source of variability and strongly right-tailed. The evaluation of the degree of pleasantness revealed that pause duration predicts degree of pleasantness in EP, whereas IPI predicts degree of pleasantness in BP. Reciters perform a kind of complex "dance", where sonority between pauses is favored in BP and pause duration in EP. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. Measuring the impact of data size on the speaker discriminatory performance: a spontaneous speech-based study
- Author
-
Cavalcanti, Julio Cesar, Eriksson, Anders, and Barbosa, Plinio A.
- Subjects
General Language Studies and Linguistics ,Jämförande språkvetenskap och allmän lingvistik ,Phonetics ,Data size ,Acoustic Phonetics - Abstract
This study aimed to analyze the impact of the amount of data on the discriminatory performance of acoustic-phonetic parameters, some of which are frequently assessed in forensic speaker comparisons. Parameters from three distinct phonetic domains were considered, namely, spectral, melodic, and temporal, which were assessed separately within the same phonetic domain and in combination. The speech material consisted of spontaneous telephone conversations between two subjects. During the recording sessions, the participants were placed in different rooms, not directly seeing, hearing, or interacting with each other. The speakers were encouraged to start a conversation using a mobile phone while being simultaneously recorded. All recordings were carried out with a high resolution (44.1 kHz and 16-bit). Data segmentation and transcription were performed in the Praat software [1]. The participants were 20 male subjects, Brazilian Portuguese speakers from the same dialectal area. Their age ranged from 19 to 35 years, with a mean of 26.4 years. Although the subjects (10 identical twin pairs) were recruited from a twin research project, cf. [2, 3, 4], the focus here was comparisons among all speakers (i.e., 190 inter-speaker comparisons) rather than on individual twin pairs
- Published
- 2022
10. Brazilian Portuguese
- Author
-
Barbosa, Plínio A. and Albano, Eleonora C.
- Published
- 2004
11. Multi-parametric analysis of speech timing in inter-talker identical twin pairs and cross-pair comparisons: Some forensic implications.
- Author
-
Cavalcanti, Julio Cesar, Eriksson, Anders, and Barbosa, Plinio A.
- Subjects
TWINS ,SPEECH processing systems ,AUTOMATIC speech recognition ,CELL phones ,ARTICULATION (Speech) - Abstract
The purpose of this study was to assess the speaker-discriminatory potential of a set of speech timing parameters while probing their suitability for forensic speaker comparison applications. The recordings comprised of spontaneous dialogues between twin pairs through mobile phones while being directly recorded with professional headset microphones. Speaker comparisons were performed with twins speakers engaged in a dialogue (i.e., intra-twin pairs) and among all subjects (i.e., cross-twin pairs). The participants were 20 Brazilian Portuguese speakers, ten male identical twin pairs from the same dialectal area. A set of 11 speech timing parameters was extracted and analyzed, including speech rate, articulation rate, syllable duration (V-V unit), vowel duration, and pause duration. Three system performance estimates were considered for assessing the suitability of the parameters for speaker comparison purposes, namely global Cllr, EER, and AUC values. These were interpreted while also taking into consideration the analysis of effect sizes. Overall, speech rate and articulation rate were found the most reliable parameters, displaying the largest effect sizes for the factor "speaker" and the best system performance outcomes, namely lowest Cllr, EER, and highest AUC values. Conversely, smaller effect sizes were found for the other parameters, which is compatible with a lower explanatory potential of the speaker identity on the duration of such units and a possibly higher linguistic control regarding their temporal variation. In addition, there was a tendency for speech timing estimates based on larger temporal intervals to present larger effect sizes and better speaker-discriminatory performance. Finally, identical twin pairs were found remarkably similar in their speech temporal patterns at the macro and micro levels while engaging in a dialogue, resulting in poor system discriminatory performance. Possible underlying factors for such a striking convergence in identical twins' speech timing patterns are presented and discussed. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
12. Acoustic analysis of vowel formant frequencies in genetically-related and non-genetically related speakers with implications for forensic speaker comparison.
- Author
-
Cavalcanti, Julio Cesar, Eriksson, Anders, and Barbosa, Plinio A.
- Subjects
VOWELS ,TWINS ,AUTOMATIC speech recognition - Abstract
The purpose of this study was to explore the speaker-discriminatory potential of vowel formant mean frequencies in comparisons of identical twin pairs and non-genetically related speakers. The influences of lexical stress and the vowels' acoustic distances on the discriminatory patterns of formant frequencies were also assessed. Acoustic extraction and analysis of the first four speech formants F1-F4 were carried out using spontaneous speech materials. The recordings comprise telephone conversations between identical twin pairs while being directly recorded through high-quality microphones. The subjects were 20 male adult speakers of Brazilian Portuguese (BP), aged between 19 and 35. As for comparisons, stressed and unstressed oral vowels of BP were segmented and transcribed manually in the Praat software. F1-F4 formant estimates were automatically extracted from the middle points of each labeled vowel. Formant values were represented in both Hertz and Bark. Comparisons within identical twin pairs using the Bark scale were performed to verify whether the measured differences would be potentially significant when following a psychoacoustic criterion. The results revealed consistent patterns regarding the comparison of low-frequency and high-frequency formants in twin pairs and non-genetically related speakers, with high-frequency formants displaying a greater speaker-discriminatory power compared to low-frequency formants. Among all formants, F4 seemed to display the highest discriminatory potential within identical twin pairs, followed by F3. As for non-genetically related speakers, both F3 and F4 displayed a similar high discriminatory potential. Regarding vowel quality, the central vowel /a/ was found to be the most speaker-discriminatory segment, followed by front vowels. Moreover, stressed vowels displayed a higher inter-speaker discrimination than unstressed vowels in both groups; however, the combination of stressed and unstressed vowels was found even more explanatory in terms of the observed differences. Although identical twins displayed a higher phonetic similarity, they were not found phonetically identical. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
13. Multiparametric Analysis of Speaking Fundamental Frequency in Genetically Related Speakers Using Different Speech Materials: Some Forensic Implications.
- Author
-
Cavalcanti, Julio Cesar, Eriksson, Anders, and Barbosa, Plinio A.
- Abstract
To assess the speaker-discriminatory potential of a set of fundamental frequency estimates in intraidentical twin pair comparisons and cross-pair comparisons (i.e., among all speakers). A total of 20 Brazilian Portuguese speakers of the same dialect, namely 10 male identical twin pairs aged between 19 and 35, were recruited. The participants were recorded directly through professional microphones while taking part in a spontaneous dialogue over mobile phones. Acoustic measurements were performed in connected speech samples, and in lengthened vowels, at least 160 ms long produced during spontaneous speech. f 0 baseline, central tendency, and extreme values were found mostly discriminatory in intra-twin pair and cross-pair comparisons. These were also the estimates displaying the largest effect sizes. Overall, only three identical twins were found statistically different regarding their f 0 patterns in connected speech, but not for lengthened vowel-based f 0 metrics. Estimates of f 0 variation and modulation were found the least discriminatory across speakers, which may signal the control of speaking style and dialect on dynamic patterns of f 0. Concerning system performance, the base value of f 0 (f 0 baseline) was found the most reliable metric, displaying the lowest equal error rate (EER). The outcomes suggest that, although identical twins were very closely related regarding their f 0 patterns, some pairs could still be differentiated acoustically, only in connected speech. Such findings reinforce the relevance of analyzing long-term f 0 metrics for speaker comparison purposes, with particular consideration to f 0 baseline. Furthermore, f 0 differences across subjects were suggested as more expressive in connected speech than in lengthened vowels. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Cross-cultural and cross-linguistic perception of authentic emotions through speech: An acoustic-phonetic study with Brazilian and Swedish listeners
- Author
-
SILVA, Wellington da, BARBOSA, Plinio Almeida, and ABELIN, Åsa
- Subjects
expressive speech ,fala expressiva ,intercultural communication ,dimensões emocionais ,emoções básicas ,comunicação intercultural ,basic emotions ,emotional dimensions - Abstract
This study was conducted to investigate whether the listeners' culture and mother language influence the perception of emotions through speech and which acoustic cues listeners use in this process. Swedish and Brazilian listeners were presented with authentic emotional speech samples of Brazilian Portuguese and Swedish. They judged on 5-point Likert scales the expression of basic emotions as described by eight adjectives in the utterances in Brazilian Portuguese and the expression of five emotional dimensions in the utterances in Swedish. The PCA technique revealed that two components explain more than 94% of the variance of the judges' responses in both experiments. These components were predicted through multiple linear regressions from twelve acoustic parameters automatically computed from the utterances. The results point to a similar perception of the emotions between both cultures. RESUMO Este estudo foi conduzido para investigar se a cultura e a língua materna dos ouvintes influenciam a percepção das emoções na fala e quais pistas acústicas os ouvintes utilizam nesse processo. Trechos de fala de emoções autênticas em português brasileiro e em sueco foram apresentados a sujeitos brasileiros e suecos. Os sujeitos avaliaram, em escalas de 5 pontos, o grau de expressão de emoções básicas descritas por oito adjetivos nos enunciados em português brasileiro e o grau de expressão de cinco dimensões emocionais nos enunciados em sueco. A técnica de PCA revelou que dois componentes explicam mais do que 94% da variância das respostas dos juízes nos dois experimentos. Esses componentes foram preditos através de regressões lineares múltiplas por doze parâmetros acústicos automaticamente extraídos dos enunciados. Os resultados mostram uma percepção semelhante das emoções entre ambas as culturas.
- Published
- 2016
15. Perception of emotional prosody: investigating the relation between the discrete and dimensional approaches to emotions.
- Author
-
da Silva, Wellington and Barbosa, Plinio Almeida
- Subjects
- *
VERSIFICATION , *LANGUAGE & emotions , *LANGUAGE & languages - Abstract
Emotional phenomena can be described according to various psychological approaches, the most adopted being the discrete (basic) and the dimensional ones. This study aimed at investigating the relation between the perception of some basic emotions and emotional dimensions in speech, as well as determining which acoustic cues are related to their perception. We conducted two perception experiments with utterances selected from a foreign language (Swedish) of which the listeners had no knowledge. In the first one, Brazilian subjects rated on 5-point scales the expressivity of four basic emotions: joy, anger, sadness, and calmness. In the second, a distinct group of Brazilian subjects rated the expressivity of five emotional dimensions: activation, fairness, valence, motivation, and involvement. The perception of the basic emotions and of the emotional dimensions was then compared by means of the Spearman's correlation coefficient. The five emotional dimensions were significantly correlated to some extent with the basic emotions, and these correlations were, in general, consistent with the literature and with the hypotheses that guided this study. We also performed an acoustic analysis, utterances evaluated by the listeners. The parameters which correlated better with the listeners' judgments were fundamental frequency (median, interquantile semi-amplitude, 99.5% quantile), spectral tilt (mean and standard deviation), and LTAS slope. We concluded that it is possible to describe the perception of basic emotions in speech as a combination of emotional dimensions and that emotional dimensions may be better for describing the expression of emotions in speech. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
16. Special Issue on Iberian Languages
- Author
-
Trancoso, Isabel, Becerra-Yoma, Nestor, Barbosa, Plínio, San-Segundo, Rubén, and Paliwal, Kuldip
- Published
- 2008
- Full Text
- View/download PDF
17. Characterisation of rhythmic patterns for text-to-speech synthesis
- Author
-
Barbosa, Plínio and Bailly, Gérard
- Published
- 1994
- Full Text
- View/download PDF
18. Affective semantic space of scents. Towards a universal scale to measure self-reported odor-related feelings.
- Author
-
Ferdenzi, Camille, Delplanque, Sylvain, Barbosa, Plinio, Court, Kimberly, Guinard, Jean-Xavier, Guo, Taomei, Craig Roberts, S., Schirmer, Annett, Porcherot, Christelle, Cayeux, Isabelle, Sander, David, and Grandjean, Didier
- Subjects
- *
FOOD aroma , *ECOLOGICAL zones , *CROSS-cultural differences , *FOOD consumption , *DATA analysis - Abstract
Highlights: [•] This article introduces new scales to measure odor-related feelings in four geographic areas. [•] Cultural specificities as well as cultural differences are highlighted and discussed. [•] A unique scale is proposed (UniGEOS) for future research in other cultures. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
19. Prosody and the production of structurally ambiguous phrases
- Author
-
Edmonds, Amanda, Fultz, Audrey Liljestrand, Killam, Jason, Etudes montpelliéraines du monde anglophone (EMMA), Université Paul-Valéry - Montpellier 3 (UPVM), Barbosa Plinio, Madureira Sandra, Reis César, and des publications scientifiques, Base
- Subjects
[SHS.LANGUE]Humanities and Social Sciences/Linguistics ,[SHS.LANGUE] Humanities and Social Sciences/Linguistics ,GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries) ,ComputingMilieux_MISCELLANEOUS - Abstract
Communication disponible en ligne à l'adresse : http://isle.illinois.edu/sprosig/sp2008/papers/id002.pdf; International audience
- Published
- 2008
20. Para um estudo da estruturação rítmica na fala disártrica
- Author
-
Vieira, Jussara Melo, Barbosa, Plinio Almeida, 1966, Coudry, Maria Irma Hadler, Pegoraro-Krook, Mair Ines, Madureira, Sandra, Camargo, Zuleica Antonia de, Universidade Estadual de Campinas. Instituto de Estudos da Linguagem, Programa de Pós-Graduação em Linguística, and UNIVERSIDADE ESTADUAL DE CAMPINAS
- Subjects
Protese de palato ,Traumatic brain injury ,Dysarthria ,Speech rhytm ,Palatal lift ,Traumatismo craniano ,Prosody ,Distúrbios da articulação - Abstract
Orientador: Plinio Almeida Barbosa Tese (doutorado) - Universidade Estadual de Campinas, Instituto de Estudos da Linguagem Resumo: A disartria é uma desordem da fala decorrente de comprometimentos neuromusculares. Tais comprometimentos podem causar alterações na respiração, na fonação, na ressonância e na articulação da fala. Estes comprometimentos podem ser decorrentes de traumatismo craniano (TC). O TC é uma agressão cerebral resultante de uma pancada na cabeça e/ou uma queda com fratura craniana. Na fala disártrica pode haver, também, falha no mecanismo velofaríngeo devido à paralisia/paresia do palato mole. Neste caso, o resultado acústico para a fala é a hipernasalidade e a emissão de ar nasal, que podem ser tratadas através do uso de uma prótese de palato elevadora (PPE). Na disartria, o ritmo da fala também pode estar alterado, implicando alterações nos locais de acento frasal (AF), na inserção e na duração das pausas silenciosas (PS). Sendo assim, interessou-nos investigar a estruturação rítmica da fala disártrica decorrente de TC. Para tanto, obtivemos a colaboração de uma falante disártrica, usuária de PPE, falante nativa do português brasileiro (PB) acometida por TC. O estudo desta fala isártrica teve como objetivos: comparar as condições de uso e não uso da PPE; comparar a estruturação rítmica da fala disártrica brasileira (FDB) com uma fala sem comprometimentos (FSC); comparar a estruturação rítmica da FDB com uma fala isártrica francesa (FDF); estudar a relação sintaxeprosódia na estruturação rítmica da fala tanto na comparação com a FSC quanto na comparação com a FDF; adquirir melhor entendimento da fala disártrica pós TC, considerando o modelo de ritmo da fala para o PB de Barbosa (2006). Para cumprir estes objetivos, acompanhamos o tratamento com PPE da falante disártrica de 1995 a 2005. Neste período coletamos e analisamos oito leituras de um mesmo texto realizadas por ela usando a PPE e duas leituras sem esta prótese: uma em 1995 e outra em 2005. A FSC correspondeu à leitura do mesmo texto da FDB. A FDF correspondeu à leitura de outro texto francês. Nestas leituras realizamos análises acústicas e de sua organização rítmica em unidades vogal-vogal (VV), grupos acentuais (GA), AF e PS. Nas análises das oito leituras da FDB houve redução no número de PS, redução da participação das PS na duração do texto lido e aumento da taxa de elocução (TE). A estruturação rítmica da FDB com e sem a prótese foi a mesma. Houve diferença significante entre a FDB e a FSC para a TE e para a duração das PS, mas não para a duração dos GA nem para o número de unidades VV. Na comparação entre a FDB e a FDF não houve diferenças significativas quanto à TE, taxa de articulação e número de unidades VV por GA. Na relação sintaxe-prosódia, as marcas sintáticas IDF (independência forte) e COORD (independência de uma conjunção coordenada) foram significativas tanto para a FDB quanto para a FSC. Mas não para a FDF. Nosso trabalho contribuiu para destacar a pertinência de estudos que considerem a relação dinâmica entre fatores biomecânicos e lingüísticos da fala, notadamente para a fala comprometida e, em especial, para a fala disártrica pós TC Abstract: Dysarthria is a speech disorder caused by neuromotors problems. These neuromotors problems may cause breath, phonation, resonance and speech articulation disturbance. The cause of these neuromotors problems can be the traumatic brain injury (TBI). TBI is a lesion that result of cerebral aggression, a collision in the head/or a fall happening cranial breaking. Dysarthric speech can have too velopharyngeal dysfunction by total/partial paralysis soft palate. In case of soft palate palsy the hypernasality may be present, with emission of nasal air during oral sounds. In order to deal with hypernasality and nasal air emission a palatal lift (PL) is indicated. The use of PL stimulates the correct movement of the soft palate, which produces the appropriate closure of the velopharyngeal port. Rhythm speech can be altered in dysarthric speech. We realize then a study of the rhythmic structure of the dysarthric speech by TBI. For doing so, eight dysarthric speaker?s readings during prosthesis treatment with the palatal lift, are compared with readings without prosthesis, with a reference speaker and a French dysarthric speaker. These comparisons are done by analyzing vowel-to-vowel units (VV), stress groups, phrase stress and silent pauses under the theoretical framework of Barbosa (2006)?s speech rhythm model applied to Brazilian Portuguese. In the eight readings of the Brazilian dysarthric speaker, there were reductions on the number of silent pauses and pause occurrence. An increase of speech rate was also observed. The results showed that there is no distinction in rhythmic structure with and without prosthesis. The comparative analysis between the Brazilian dysarthric speaker and the reference speaker showed significant differences in speech rate and silent pause duration, but no significant difference in the duration of stress groups or in the number of VV units. The comparative analysis between the Brazilian dysarthric speaker and the French dysarthric speaker showed no significant differences in speech rate, articulation rate and number of VV units inside stress groups. As regards the syntax-prosody interface, the syntactic markers IDF (strong independency) and COORD (coordinated conjunction independency) were significant for both the Brazilian dysarthric and the reference speaker. However, these markers were not significant for the French dysarthric speaker. This work reinforces the importance of considering both linguistics and biomechanics aspects on the analysis of dysarthric speech Doutorado Doutor em Linguística
- Published
- 2007
- Full Text
- View/download PDF
21. Exploring the performance of automatic speaker recognition using twin speech and deep learning-based artificial neural networks.
- Author
-
Cavalcanti JC, da Silva RR, Eriksson A, and Barbosa PA
- Abstract
This study assessed the influence of speaker similarity and sample length on the performance of an automatic speaker recognition (ASR) system utilizing the SpeechBrain toolkit. The dataset comprised recordings from 20 male identical twin speakers engaged in spontaneous dialogues and interviews. Performance evaluations involved comparing identical twins, all speakers in the dataset (including twin pairs), and all speakers excluding twin pairs. Speech samples, ranging from 5 to 30 s, underwent assessment based on equal error rates (EER) and Log cost-likelihood ratios (Cllr). Results highlight the substantial challenge posed by identical twins to the ASR system, leading to a decrease in overall speaker recognition accuracy. Furthermore, analyses based on longer speech samples outperformed those using shorter samples. As sample size increased, standard deviation values for both intra and inter-speaker similarity scores decreased, indicating reduced variability in estimating speaker similarity/dissimilarity levels in longer speech stretches compared to shorter ones. The study also uncovered varying degrees of likeness among identical twins, with certain pairs presenting a greater challenge for ASR systems. These outcomes align with prior research and are discussed within the context of relevant literature., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2024 Cavalcanti, da Silva, Eriksson and Barbosa.)
- Published
- 2024
- Full Text
- View/download PDF
22. R.H. Stetson, Motor Phonetics: A Study of Speech Movements in Action, 2nd ed., Amsterdam, North Holland Publishing Co., 1951.
- Author
-
Barbosa PA
- Published
- 2017
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.