Author: "Jesin James" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Jesin James"' showing total 16 results

Start Over Author "Jesin James"

16 results on '"Jesin James"'

1. Exploring Prosodic Features Modelling for Secondary Emotions Needed for Empathetic Speech Synthesis

Author: Jesin James, Balamurali B.T., Catherine Watson, and Hansjörg Mixdorff
Subjects: secondary emotions, emotional speech synthesis, fundamental frequency contour, Fujisaki model, low resource, empathetic speech, Chemical technology, TP1-1185
Abstract: A low-resource emotional speech synthesis system for empathetic speech synthesis based on modelling prosody features is presented here. Secondary emotions, identified to be needed for empathetic speech, are modelled and synthesised in this investigation. As secondary emotions are subtle in nature, they are difficult to model compared to primary emotions. This study is one of the few to model secondary emotions in speech as they have not been extensively studied so far. Current speech synthesis research uses large databases and deep learning techniques to develop emotion models. There are many secondary emotions, and hence, developing large databases for each of the secondary emotions is expensive. Hence, this research presents a proof of concept using handcrafted feature extraction and modelling of these features using a low-resource-intensive machine learning approach, thus creating synthetic speech with secondary emotions. Here, a quantitative-model-based transformation is used to shape the emotional speech’s fundamental frequency contour. Speech rate and mean intensity are modelled via rule-based approaches. Using these models, an emotional text-to-speech synthesis system to synthesise five secondary emotions-anxious, apologetic, confident, enthusiastic and worried-is developed. A perception test to evaluate the synthesised emotional speech is also conducted. The participants could identify the correct emotion in a forced response test with a hit rate greater than 65%.
Published: 2023
Full Text: View/download PDF

2. Development of Community-Oriented Text-to-Speech Models for Māori 'Avaiki Nui (Cook Islands Māori).

Author: Jesin James, Rolando Coto-Solano, Sally Akevai Nicholas, Joshua Zhu, Bovey Yu, Fuki Babasaki, Jenny Tyler Wang, and Nicholas Derby
Published: 2024

3. Improved Multi-Modal Emotion Recognition Using Squeeze-and-Excitation Block in Cross-Modal Attention.

Author: Junchen Liu, Jesin James, and Karan Nathwani
Published: 2023
Full Text: View/download PDF

4. Visualising Model Training via Vowel Space for Text-To-Speech Systems.

Author: Binu Nisal Abeysinghe, Jesin James, Catherine I. Watson, and Felix Marattukalam
Published: 2022
Full Text: View/download PDF

5. Language Models for Code-switch Detection of te reo Māori and English in a Low-resource Setting.

Author: Jesin James, Vithya Yogarajan, Isabella Shields, Catherine I. Watson, Peter Keegan, Keoni Mahelona, and Peter-Lucas Jones
Published: 2022
Full Text: View/download PDF

6. Developing Resources for Te Reo Māori Text To Speech Synthesis System.

Author: Jesin James, Isabella Shields, Rebekah Berriman, Peter Keegan, and Catherine Inez Watson
Published: 2020
Full Text: View/download PDF

7. An Open Source Emotional Speech Corpus for Human Robot Interaction Applications.

Author: Jesin James, Li Tian, and Catherine Inez Watson
Published: 2018
Full Text: View/download PDF

8. Artificial Empathy in Social Robots: An analysis of Emotions in Speech.

Author: Jesin James, Catherine Inez Watson, and Bruce A. MacDonald
Published: 2018
Full Text: View/download PDF

9. Pause duration model for Malayalam TTS.

Author: Jesin James and Deepa P. Gopinath
Published: 2015
Full Text: View/download PDF

10. Modeling pause duration for Malayalam language TTS.

Author: Jesin James and Deepa P. Gopinath
Published: 2014
Full Text: View/download PDF

11. Empathetic Speech Synthesis and Testing for Healthcare Robots

Author: Catherine Watson, Bruce A. MacDonald, B. T. Balamurali, and Jesin James
Subjects: 0209 industrial biotechnology, General Computer Science, Social Psychology, media_common.quotation_subject, Speech synthesis, Empathy, 02 engineering and technology, computer.software_genre, 020901 industrial engineering & automation, 0501 psychology and cognitive sciences, Electrical and Electronic Engineering, Prosody, Sensory cue, 050107 human factors, media_common, Social robot, business.industry, 05 social sciences, Robotics, Human-Computer Interaction, Philosophy, Control and Systems Engineering, Robot, Affect (linguistics), Artificial intelligence, Psychology, business, computer, Cognitive psychology
Abstract: One of the major factors that affect the acceptance of robots in Human-Robot Interaction applications is the type of voice with which they interact with humans. The robot’s voice can be used to express empathy, which is an affective response of the robot to the human user. In this study, the aim is to find out if social robots with empathetic voice are acceptable for users in healthcare applications. A pilot study using an empathetic voice spoken by a voice actor was conducted. Only prosody in speech is used to express empathy here, without any visual cues. Also, the emotions needed for an empathetic voice are identified. It was found that the emotions needed are not only the stronger primary emotions, but also the nuanced secondary emotions. These emotions are then synthesised using prosody modelling. A second study, replicating the pilot test is conducted using the synthesised voices to investigate if empathy is perceived from the synthetic voice as well. This paper reports the modelling and synthesises of an empathetic voice, and experimentally shows that people prefer empathetic voice for healthcare robots. The results can be further used to develop empathetic social robots, that can improve people’s acceptance of social robots.
Published: 2020

12. The Development of a Labelled te reo Māori-English Bilingual Database for Language Technology

Author: Jesin James, Isabella Shields, Vithya Yogarajan, Peter Keegan, Catherine Watson, Peter-Lucas Jones, and Keoni Mahelona
Subjects: FOS: Computer and information sciences, Computation and Language (cs.CL)
Abstract: Te reo Māori (referred to as Māori), New Zealand's indigenous language, is under-resourced in language technology. Māori speakers are bilingual, where Māori is code-switched with English. Unfortunately, there are minimal resources available for Māori language technology, language detection and code-switch detection between Māori-English pair. Both English and Māori use Roman-derived orthography making rule-based systems for detecting language and code-switching restrictive. Most Māori language detection is done manually by language experts. This research builds a Māori-English bilingual database of 66,016,807 words with word-level language annotation. The New Zealand Parliament Hansard debates reports were used to build the database. The language labels are assigned using language-specific rules and expert manual annotations. Words with the same spelling, but different meanings, exist for Māori and English. These words could not be categorised as Māori or English based on word-level language rules. Hence, manual annotations were necessary. An analysis reporting the various aspects of the database such as metadata, year-wise analysis, frequently occurring words, sentence length and N-grams is also reported. The database developed here is a valuable tool for future language and speech technology development for Aotearoa New Zealand. The methodology followed to label the database can also be followed by other low-resourced language pairs., Submitted to Springer Language Resources and Evaluation Journal 2022
Published: 2022
Full Text: View/download PDF

13. Performance evaluation of speech masking design among listeners with varying language backgrounds

Author: Hinako Masuda, Yusuke Hioka, C.T. Justine Hui, Jesin James, and Catherine I. Watson
Subjects: Acoustics and Ultrasonics
Published: 2022

14. Masker design for real-time informational masking with mitigated annoyance

Author: Jesin James, Catherine Watson, and Yusuke Hioka
Subjects: 010302 applied physics, Masking (art), Reverberation, Acoustics and Ultrasonics, Computer science, Speech recognition, Cognitive distraction, Annoyance, 01 natural sciences, Informational masking, Distraction, 0103 physical sciences, Active listening, 010301 acoustics
Abstract: An alternative design of masker for real-time speech masking is proposed. Time-reversed speech, which is also adopted in the proposed design, has been commonly used for speech masking because of its efficient performance in making the target speech unintelligible using informational masking. However, previous studies revealed that a time-reversed speech causes more annoyance and cognitive distraction for the listeners. Meanwhile, although generating time-reversed speech from the target speech would help improve masking effectiveness, it would require real-time processing as the target speech would not be available beforehand. The proposed masker design utilises techniques known for minimising discontinuities in the waveform of synthesised speech in order to minimise annoyance and distraction to the listeners. The design also avoids processes that hinders real-time generation of the masker. Results of subjective listening tests reveal that the proposed design is able to compromise the level of annoyance and the masking effect. Only marginal improvement is observed in the level of distraction. Further attempt to mitigate annoyance of the proposed masker by adding artificial reverberation did not help as it also reduces the masking effect significantly.
Published: 2020

15. Intelligent track cleaning robot

Author: Alna Thomas, Jesse Wilson, V K Dhahabiya, Jesin James, and Jovna Jetto
Subjects: 010302 applied physics, Computer science, business.industry, Control (management), ComputerApplications_COMPUTERSINOTHERSYSTEMS, Mobile robot, Robotics, 02 engineering and technology, 021001 nanoscience & nanotechnology, Track (rail transport), 01 natural sciences, Industrial engineering, Robot control, Unit (housing), 0103 physical sciences, Robot, Artificial intelligence, 0210 nano-technology, business, Garbage
Abstract: From the times of the British raj to this date, scavenging rail tracks has been a puzzling social issue. The fact that humans have to clean human waste and other garbage thrown on rail tracks is a situation that needs immediate remedy. Manual scavenging is done now-a-days, owing to the peculiar nature of the job, and many who are engaged for this job suffer from related health problems. The prevailing condition can be rectified to some extent by the adequate use of robotics and control technology. The proposed idea of automatic railway track cleaning system comprises an automatic vehicle that goes on land and track. This proposed intelligent machine specially designed for the Indian railways can clean the railway tracks in a systematic manner. It consists of a four-wheel running robot with a suction unit, cleaning unit, automatic displacement unit, an intelligent control system, an intelligent train sensing unit, and power unit. This device is the first of its kind proposed to be developed exclusively for the Indian Railways. Also, the railways can save a lot of money on water and labour charges. The application of this project in the current railway cleaning scenario will ensure that there will not be any nauseating scenes at railway stations across the country. We have accomplished some functionality critical in the waste clean-up in railway tracks, and have also tried to find solution for connected problems.
Published: 2016

16. Pause duration model for Malayalam TTS

Author: Deepa P. Gopinath and Jesin James
Subjects: Phrase, Software_GENERAL, Computer science, business.industry, Speech recognition, Speech synthesis, computer.software_genre, language.human_language, InformationSystems_GENERAL, Duration (music), Malayalam, language, Artificial intelligence, Software_PROGRAMMINGLANGUAGES, Syllable, business, computer, Natural language processing, Sentence
Abstract: In this paper a CART-based pause duration prediction model has been developed for Malayalam language. Prosodic features like pause durations, syllable prolongations etc. play an important role in making the speech output from a Text To Speech (TTS) system more intelligible. An analysis on the various factors that affect pause duration for Malayalam language has not been conducted till date. Here, inferential and descriptive statistical analysis tools are used to analyze the effect of various factors on pause duration for Malayalam. These identified factors are then used to model pause duration for pause after a word, pause after a phrase, pause after a comma and pause after a sentence separately. The RMSE and correlation values are calculated to evaluate the correctness of the model.
Published: 2015

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

16 results on '"Jesin James"'

1. Exploring Prosodic Features Modelling for Secondary Emotions Needed for Empathetic Speech Synthesis

2. Development of Community-Oriented Text-to-Speech Models for Māori 'Avaiki Nui (Cook Islands Māori).

3. Improved Multi-Modal Emotion Recognition Using Squeeze-and-Excitation Block in Cross-Modal Attention.

4. Visualising Model Training via Vowel Space for Text-To-Speech Systems.

5. Language Models for Code-switch Detection of te reo Māori and English in a Low-resource Setting.

6. Developing Resources for Te Reo Māori Text To Speech Synthesis System.

7. An Open Source Emotional Speech Corpus for Human Robot Interaction Applications.

8. Artificial Empathy in Social Robots: An analysis of Emotions in Speech.

9. Pause duration model for Malayalam TTS.

10. Modeling pause duration for Malayalam language TTS.

11. Empathetic Speech Synthesis and Testing for Healthcare Robots

12. The Development of a Labelled te reo Māori-English Bilingual Database for Language Technology

13. Performance evaluation of speech masking design among listeners with varying language backgrounds

14. Masker design for real-time informational masking with mitigated annoyance

15. Intelligent track cleaning robot

16. Pause duration model for Malayalam TTS

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

16 results on '"Jesin James"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources