Author: "Yuksel, Kamer" / Publication Type: Electronic Resources - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yuksel, Kamer"' showing total 12 results

Start Over Author "Yuksel, Kamer" Publication Type Electronic Resources

12 results on '"Yuksel, Kamer"'

1. An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset Generation

Author: Gunduz, Ahmet, Yuksel, Kamer Ali, Darwish, Kareem, Javadi, Golara, Minazzi, Fabio, Sobieski, Nicola, Bratieres, Sebastien, Gunduz, Ahmet, Yuksel, Kamer Ali, Darwish, Kareem, Javadi, Golara, Minazzi, Fabio, Sobieski, Nicola, and Bratieres, Sebastien
Abstract: Data availability is crucial for advancing artificial intelligence applications, including voice-based technologies. As content creation, particularly in social media, experiences increasing demand, translation and text-to-speech (TTS) technologies have become essential tools. Notably, the performance of these TTS technologies is highly dependent on the quality of the training data, emphasizing the mutual dependence of data availability and technological progress. This paper introduces an end-to-end tool to generate high-quality datasets for text-to-speech (TTS) models to address this critical need for high-quality data. The contributions of this work are manifold and include: the integration of language-specific phoneme distribution into sample selection, automation of the recording process, automated and human-in-the-loop quality assurance of recordings, and processing of recordings to meet specified formats. The proposed application aims to streamline the dataset creation process for TTS models through these features, thereby facilitating advancements in voice-based technologies., Comment: 9 Pages, 6 Figures, 4 Tables, LREC-COLING 2024
Published: 2024

2. Word-Level ASR Quality Estimation for Efficient Corpus Sampling and Post-Editing through Analyzing Attentions of a Reference-Free Metric

Author: Javadi, Golara, Yuksel, Kamer Ali, Kim, Yunsu, Ferreira, Thiago Castro, Al-Badrashiny, Mohamed, Javadi, Golara, Yuksel, Kamer Ali, Kim, Yunsu, Ferreira, Thiago Castro, and Al-Badrashiny, Mohamed
Abstract: In the realm of automatic speech recognition (ASR), the quest for models that not only perform with high accuracy but also offer transparency in their decision-making processes is crucial. The potential of quality estimation (QE) metrics is introduced and evaluated as a novel tool to enhance explainable artificial intelligence (XAI) in ASR systems. Through experiments and analyses, the capabilities of the NoRefER (No Reference Error Rate) metric are explored in identifying word-level errors to aid post-editors in refining ASR hypotheses. The investigation also extends to the utility of NoRefER in the corpus-building process, demonstrating its effectiveness in augmenting datasets with insightful annotations. The diagnostic aspects of NoRefER are examined, revealing its ability to provide valuable insights into model behaviors and decision patterns. This has proven beneficial for prioritizing hypotheses in post-editing workflows and fine-tuning ASR models. The findings suggest that NoRefER is not merely a tool for error detection but also a comprehensive framework for enhancing ASR systems' transparency, efficiency, and effectiveness. To ensure the reproducibility of the results, all source codes of this study are made publicly available.
Published: 2024

3. Generative Meta-Learning Robust Quality-Diversity Portfolio

Author: Yuksel, Kamer Ali and Yuksel, Kamer Ali
Abstract: This paper proposes a novel meta-learning approach to optimize a robust portfolio ensemble. The method uses a deep generative model to generate diverse and high-quality sub-portfolios combined to form the ensemble portfolio. The generative model consists of a convolutional layer, a stateful LSTM module, and a dense network. During training, the model takes a randomly sampled batch of Gaussian noise and outputs a population of solutions, which are then evaluated using the objective function of the problem. The weights of the model are updated using a gradient-based optimizer. The convolutional layer transforms the noise into a desired distribution in latent space, while the LSTM module adds dependence between generations. The dense network decodes the population of solutions. The proposed method balances maximizing the performance of the sub-portfolios with minimizing their maximum correlation, resulting in a robust ensemble portfolio against systematic shocks. The approach was effective in experiments where stochastic rewards were present. Moreover, the results (Fig. 1) demonstrated that the ensemble portfolio obtained by taking the average of the generated sub-portfolio weights was robust and generalized well. The proposed method can be applied to problems where diversity is desired among co-optimized solutions for a robust ensemble. The source-codes and the dataset are in the supplementary material.
Published: 2023

4. NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning

Author: Yuksel, Kamer Ali, Ferreira, Thiago, Javadi, Golara, El-Badrashiny, Mohamed, Gunduz, Ahmet, Yuksel, Kamer Ali, Ferreira, Thiago, Javadi, Golara, El-Badrashiny, Mohamed, and Gunduz, Ahmet
Abstract: This paper introduces NoRefER, a novel referenceless quality metric for automatic speech recognition (ASR) systems. Traditional reference-based metrics for evaluating ASR systems require costly ground-truth transcripts. NoRefER overcomes this limitation by fine-tuning a multilingual language model for pair-wise ranking ASR hypotheses using contrastive learning with Siamese network architecture. The self-supervised NoRefER exploits the known quality relationships between hypotheses from multiple compression levels of an ASR for learning to rank intra-sample hypotheses by quality, which is essential for model comparisons. The semi-supervised version also uses a referenced dataset to improve its inter-sample quality ranking, which is crucial for selecting potentially erroneous samples. The results indicate that NoRefER correlates highly with reference-based metrics and their intra-sample ranks, indicating a high potential for referenceless ASR evaluation or a/b testing.
Published: 2023

5. A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision

Author: Yuksel, Kamer Ali, Ferreira, Thiago, Gunduz, Ahmet, Al-Badrashiny, Mohamed, Javadi, Golara, Yuksel, Kamer Ali, Ferreira, Thiago, Gunduz, Ahmet, Al-Badrashiny, Mohamed, and Javadi, Golara
Abstract: The common standard for quality evaluation of automatic speech recognition (ASR) systems is reference-based metrics such as the Word Error Rate (WER), computed using manual ground-truth transcriptions that are time-consuming and expensive to obtain. This work proposes a multi-language referenceless quality metric, which allows comparing the performance of different ASR models on a speech dataset without ground truth transcriptions. To estimate the quality of ASR hypotheses, a pre-trained language model (LM) is fine-tuned with contrastive learning in a self-supervised learning manner. In experiments conducted on several unseen test datasets consisting of outputs from top commercial ASR engines in various languages, the proposed referenceless metric obtains a much higher correlation with WER scores and their ranks than the perplexity metric from the state-of-art multi-lingual LM in all experiments, and also reduces WER by more than $7\%$ when used for ensembling hypotheses. The fine-tuned model and experiments are made available for the reproducibility: https://github.com/aixplain/NoRefER, Comment: arXiv admin note: substantial text overlap with arXiv:2306.12577
Published: 2023

6. Efficient Machine Translation Corpus Generation

Author: Yuksel, Kamer Ali, Gunduz, Ahmet, Sharma, Shreyas, Sawaf, Hassan, Yuksel, Kamer Ali, Gunduz, Ahmet, Sharma, Shreyas, and Sawaf, Hassan
Abstract: This paper proposes an efficient and semi-automated method for human-in-the-loop post-editing for machine translation (MT) corpus generation. The method is based on online training of a custom MT quality estimation metric on-the-fly as linguists perform post-edits. The online estimator is used to prioritize worse hypotheses for post-editing, and auto-close best hypotheses without post-editing. This way, significant improvements can be achieved in the resulting quality of post-edits at a lower cost due to reduced human involvement. The trained estimator can also provide an online sanity check mechanism for post-edits and remove the need for additional linguists to review them or work on the same hypotheses. In this paper, the effect of prioritizing with the proposed method on the resulting MT corpus quality is presented versus scheduling hypotheses randomly. As demonstrated by experiments, the proposed method improves the lifecycle of MT models by focusing the linguist effort on production samples and hypotheses, which matter most for expanding MT corpora to be used for re-training them.
Published: 2023

7. EvolveMT: an Ensemble MT Engine Improving Itself with Usage Only

Author: Yuksel, Kamer Ali, Gunduz, Ahmet, Al-Badrashiny, Mohamed, Sharma, Shreyas, Sawaf, Hassan, Yuksel, Kamer Ali, Gunduz, Ahmet, Al-Badrashiny, Mohamed, Sharma, Shreyas, and Sawaf, Hassan
Abstract: This paper presents EvolveMT for efficiently combining multiple machine translation (MT) engines. The proposed system selects the output from a single engine for each segment by utilizing online learning techniques to predict the most suitable system for every translation request. A neural quality estimation metric supervises the method without requiring reference translations. The online learning capability of this system allows for dynamic adaptation to alterations in the domain or machine translation engines, thereby obviating the necessity for additional training. EvolveMT selects a subset of translation engines to be called based on the source sentence features. The degree of exploration is configurable according to the desired quality-cost trade-off. Results from custom datasets demonstrate that EvolveMT achieves similar translation accuracy at a lower cost than selecting the best translation of each segment from all translations using an MT quality estimator. To our knowledge, EvolveMT is the first meta MT system that adapts itself after deployment to incoming translation requests from the production environment without needing costly retraining on human feedback.
Published: 2023

8. Granular Motor State Monitoring of Free Living Parkinson's Disease Patients via Deep Learning

Author: Yuksel, Kamer A., Goschenhofer, Jann, Varma, Hridya V., Fietzek, Urban, Pfister, Franz M. J., Yuksel, Kamer A., Goschenhofer, Jann, Varma, Hridya V., Fietzek, Urban, and Pfister, Franz M. J.
Abstract: Parkinson's disease (PD) is the second most common neurodegenerative disease worldwide and affects around 1% of the (60+ years old) elderly population in industrial nations. More than 80% of PD patients suffer from motor symptoms, which could be well addressed if a personalized medication schedule and dosage could be administered to them. However, such personalized medication schedule requires a continuous, objective and precise measurement of motor symptoms experienced by the patients during their regular daily activities. In this work, we propose the use of a wrist-worn smart-watch, which is equipped with 3D motion sensors, for estimating the motor fluctuation severity of PD patients in a free-living environment. We introduce a novel network architecture, a post-training scheme and a custom loss function that accounts for label noise to improve the results of our previous work in this domain and to establish a novel benchmark for nine-level PD motor state estimation., Comment: Machine Learning for Health (ML4H) at NeurIPS 2019 -- Extended Abstract
Published: 2019

9. Wearable-based Parkinson's Disease Severity Monitoring using Deep Learning

Author: Goschenhofer, Jann, Pfister, Franz MJ, Yuksel, Kamer Ali, Bischl, Bernd, Fietzek, Urban, Thomas, Janek, Goschenhofer, Jann, Pfister, Franz MJ, Yuksel, Kamer Ali, Bischl, Bernd, Fietzek, Urban, and Thomas, Janek
Abstract: One major challenge in the medication of Parkinson's disease is that the severity of the disease, reflected in the patients' motor state, cannot be measured using accessible biomarkers. Therefore, we develop and examine a variety of statistical models to detect the motor state of such patients based on sensor data from a wearable device. We find that deep learning models consistently outperform a classical machine learning model applied on hand-crafted features in this time series classification task. Furthermore, our results suggest that treating this problem as a regression instead of an ordinal regression or a classification task is most appropriate. For consistent model evaluation and training, we adopt the leave-one-subject-out validation scheme to the training of deep learning models. We also employ a class-weighting scheme to successfully mitigate the problem of high multi-class imbalances in this domain. In addition, we propose a customized performance measure that reflects the requirements of the involved medical staff on the model. To solve the problem of limited availability of high quality training data, we propose a transfer learning technique which helps to improve model performance substantially. Our results suggest that deep learning techniques offer a high potential to autonomously detect motor states of patients with Parkinson's disease.
Published: 2019

10. Static analysis of executables for collaborative malware detection on Android

Author: Fettweis, G, Stuttgen, H, Kabacinski, W, Schmidt, Aubrey-Derrick, Bye, Rainer, Schmidt, Hans-Gunterh, Clausen, Jan, K, Osman, Yuksel, Kamer Ail, Camtepe, Seyit, Albayrak, Sahin, Fettweis, G, Stuttgen, H, Kabacinski, W, Schmidt, Aubrey-Derrick, Bye, Rainer, Schmidt, Hans-Gunterh, Clausen, Jan, K, Osman, Yuksel, Kamer Ail, Camtepe, Seyit, and Albayrak, Sahin
Published: 2009

11. Enhancing security of linux-based android devices

Author: Schmidt, Aubrey-Derrick, Schmidt, Hans-Gunterh, Clausen, Jan, Yuksel, Kamer Ail, Osman, Kiraz, Camtepe, Seyit, Albayrak, Sahin, Schmidt, Aubrey-Derrick, Schmidt, Hans-Gunterh, Clausen, Jan, Yuksel, Kamer Ail, Osman, Kiraz, Camtepe, Seyit, and Albayrak, Sahin
Published: 2008

12. Monitoring android for collaborative anomaly detection: a first architectural draft [Technical Report: TUB-DAI 08/08-02]

Author: Schmidt, Aubrey-Derrick, Bye, Rainer, Schmidt, Hans-Gunterh, Yuksel, Kamer Ail, K, Osman, Clausen, Jan, Raddatz, Karsten, Camtepe, Seyit, Albayrak, Sahin, Schmidt, Aubrey-Derrick, Bye, Rainer, Schmidt, Hans-Gunterh, Yuksel, Kamer Ail, K, Osman, Clausen, Jan, Raddatz, Karsten, Camtepe, Seyit, and Albayrak, Sahin
Published: 2008

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

12 results on '"Yuksel, Kamer"'

1. An Automated End-to-End Open-Source Software for High-Quality Text-to-Speech Dataset Generation

2. Word-Level ASR Quality Estimation for Efficient Corpus Sampling and Post-Editing through Analyzing Attentions of a Reference-Free Metric

3. Generative Meta-Learning Robust Quality-Diversity Portfolio

4. NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning

5. A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision

6. Efficient Machine Translation Corpus Generation

7. EvolveMT: an Ensemble MT Engine Improving Itself with Usage Only

8. Granular Motor State Monitoring of Free Living Parkinson's Disease Patients via Deep Learning

9. Wearable-based Parkinson's Disease Severity Monitoring using Deep Learning

10. Static analysis of executables for collaborative malware detection on Android

11. Enhancing security of linux-based android devices

12. Monitoring android for collaborative anomaly detection: a first architectural draft [Technical Report: TUB-DAI 08/08-02]

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Publication Year Range

Publication Type

Database

Publisher

12 results on '"Yuksel, Kamer"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources