Author: "Vogel, Manfred" / Language: undetermined - Searchworks@Jio Institute Digital Library Search Results

1. Text-to-Speech Pipeline for Swiss German -- A comparison

Author: Bollinger, Tobias, Deriu, Jan, and Vogel, Manfred
Subjects: FOS: Computer and information sciences, Sound (cs.SD), Computer Science - Computation and Language, Audio and Speech Processing (eess.AS), FOS: Electrical engineering, electronic engineering, information engineering, Computation and Language (cs.CL), Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: In this work, we studied the synthesis of Swiss German speech using different Text-to-Speech (TTS) models. We evaluated the TTS models on three corpora, and we found, that VITS models performed best, hence, using them for further testing. We also introduce a new method to evaluate TTS models by letting the discriminator of a trained vocoder GAN model predict whether a given waveform is human or synthesized. In summary, our best model delivers speech synthesis for different Swiss German dialects with previously unachieved quality.
Published: 2023
Full Text: View/download PDF

2. 2nd Swiss German Speech to Standard German Text Shared Task at SwissText 2022

Author: Plüss, Michel, Schraner, Yanick, Scheller, Christian, and Vogel, Manfred
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)
Abstract: We present the results and findings of the 2nd Swiss German speech to Standard German text shared task at SwissText 2022. Participants were asked to build a sentence-level Swiss German speech to Standard German text system specialized on the Grisons dialect. The objective was to maximize the BLEU score on a test set of Grisons speech. 3 teams participated, with the best-performing system achieving a BLEU score of 70.1., Comment: 3 pages, 0 figures, to appear in proceedings of SwissText 2022
Published: 2023
Full Text: View/download PDF

3. Improving Metrics for Speech Translation

Author: Paonessa, Claudio, Frefel, Dominik, and Vogel, Manfred
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computation and Language (cs.CL)
Abstract: We introduce Parallel Paraphrasing ($\text{Para}_\text{both}$), an augmentation method for translation metrics making use of automatic paraphrasing of both the reference and hypothesis. This method counteracts the typically misleading results of speech translation metrics such as WER, CER, and BLEU if only a single reference is available. We introduce two new datasets explicitly created to measure the quality of metrics intended to be applied to Swiss German speech-to-text systems. Based on these datasets, we show that we are able to significantly improve the correlation with human quality perception if our method is applied to commonly used metrics., Comment: Preprint SwissText 2023
Published: 2023
Full Text: View/download PDF

4. A Learning Rate Method for Full-Batch Gradient Descent

Author: Vogel Manfred and Asadi Soodabeh
Subjects: Control theory, Computer science, General Medicine, Gradient descent
Abstract: In this paper, we present a learning rate method for gradient descent using only first order information. This method requires no manual tuning of the learning rate. We applied this method on a linear neural network built from scratch, along with the full-batch gradient descent, where we calculated the gradients for the whole dataset to perform one parameter update. We tested the method on a moderate sized dataset of housing information and compared the result with that of the Adam optimizer used with a sequential neural network model from Keras. The comparison shows that our method finds the minimum in a much fewer number of epochs than does Adam.
Published: 2020

5. Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus

Author: Plüss, Michel, Neukom, Lukas, Scheller, Christian, and Vogel, Manfred
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Computation and Language, Computation and Language (cs.CL), Machine Learning (cs.LG)
Abstract: We present the Swiss Parliaments Corpus (SPC), an automatically aligned Swiss German speech to Standard German text corpus. This first version of the corpus is based on publicly available data of the Bernese cantonal parliament and consists of 293 hours of data. It was created using a novel forced sentence alignment procedure and an alignment quality estimator, which can be used to trade off corpus size and quality. We trained Automatic Speech Recognition (ASR) models as baselines on different subsets of the data and achieved a Word Error Rate (WER) of 0.278 and a BLEU score of 0.586 on the SPC test set. The corpus is freely available for download., Comment: 8 pages, 0 figures
Published: 2020
Full Text: View/download PDF

6. EUREC4A: A Field Campaign to Elucidate the Couplings Between Clouds, Convection and Circulation

Author: Sandrine Bony, Bjorn Stevens, Felix Ament, Sebastien Bigorre, Patrick Chazette, Susanne Crewell, Julien Delanoë, Kerry Emanuel, David Farrell, Cyrille Flamant, Silke Gross, Lutz Hirsch, Johannes Karstensen, Bernhard Mayer, Louise Nuijens, James H. Ruppert, Irina Sandu, Pier Siebesma, Sabrina Speich, Frédéric Szczap, Julien Totems, Raphaela Vogel, Manfred Wendisch, Martin Wirth
Published: 2017
Full Text: View/download PDF

7. Optimization of 4D Process Planning using Genetic Algorithms

Author: Vogel, Manfred, Breit, Manfred, and Märki, Fabian
Abstract: The presented work focuses on the presentation of a discrete event simulator which can be used for automated sequencing and optimization of building processes. The sequencing is based on the commonly used component–activity–resource relations taking structural and process constraints into account. For the optimization a genetic algorithm approach was developed, implemented and successfully applied to several real life steel constructions. In this contribution we discuss the application of the discrete event simulator including its optimization capabilities on a 4D process model of a steel structure of an automobile recycling facility.
Published: 2004
Full Text: View/download PDF

8. COMPOSITION TO REDUCE THE LEVEL OF SUGAR IN THE BLOOD

Author: SUEDZUCKER AG, HEINZ FRITZ, HERTEL SABINE, and VOGEL MANFRED
Subjects: A61K31/70
Abstract: The invention relates to the use of L-xylulose as an inhibitor for saccharase and maltase enzyme activity.

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

8 results on '"Vogel, Manfred"'

1. Text-to-Speech Pipeline for Swiss German -- A comparison

2. 2nd Swiss German Speech to Standard German Text Shared Task at SwissText 2022

3. Improving Metrics for Speech Translation

4. A Learning Rate Method for Full-Batch Gradient Descent

5. Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus

6. EUREC4A: A Field Campaign to Elucidate the Couplings Between Clouds, Convection and Circulation

7. Optimization of 4D Process Planning using Genetic Algorithms

8. COMPOSITION TO REDUCE THE LEVEL OF SUGAR IN THE BLOOD

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Journal

Database

Publisher

8 results on '"Vogel, Manfred"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources