Descriptor: "Linguistic Data Consortium" / Journal: the journal of the acoustical society of america - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Linguistic Data Consortium"' showing total 2 results

Start Over Descriptor "Linguistic Data Consortium" Journal the journal of the acoustical society of america

2 results on '"Linguistic Data Consortium"'

1. Some musings on systematic variability and speech recognition

Author: Jordan Cohen
Subjects: Linguistic Data Consortium, Audio mining, Training set, Acoustics and Ultrasonics, Arts and Humanities (miscellaneous), Computer science, Speech recognition, Acoustic model, Observer (special relativity), Speaker recognition, Speech processing
Abstract: One of the major problems in speech recognition is the inability of trained models to generalize appropriately to channel variations, new speakers, or modified acoustics. The naive observer would believe that a multimillion‐parameter system should be sufficient! The difficulty appears to be too many parameters rather than too few. For moderate‐sized training corpora, systems learn all of the conditions in the training data rather than generalizing from the exemplars. (For instance, speech recognition algorithms will generally score the speech from a training speaker higher than that from a speaker who was excluded from the set.) One can force the issue by explicitly modeling systematic variation, and then ‘‘normalizing’’ at the front end or in the acoustic model. Two exemplars of this philosophy are Cepstral mean subtraction and vocal tract normalization [Frontiers in Speech Processing 94, LDC96s40, Linguistic Data Consortium (1995)]. In each case a single parameter from a very restrictive model is estimated, and accounting for the variability explicitly improves performance. Concrete examples of these situations are offered, and the impact of this work on future work in automatic speech recognition is discussed.
Published: 1999

2. Pronunciation variability in the Switchboard corpus

Author: Sean A. Fulop and Patricia A. Keating
Subjects: Acoustics and Ultrasonics, Computer science, business.industry, Pronunciation, computer.software_genre, Lexical item, Linguistics, Linguistic Data Consortium, ComputingMethodologies_PATTERNRECOGNITION, Arts and Humanities (miscellaneous), Transcription (linguistics), ComputingMethodologies_DOCUMENTANDTEXTPROCESSING, Artificial intelligence, business, computer, Natural language processing
Abstract: This paper first describes a project on manual phonetic labeling of some 2075 words from the Switchboard corpus (a 3 million word corpus of unscripted telephone conversations, recorded and orthographically transcribed by Texas Instruments, and available from the Linguistic Data Consortium). Multiple (from 10 to 40) tokens of 72 lexical items were transcribed; the transcription system used was an extension of the TIMITBET, designed to allow a narrower transcription, particularly of consonants. Intertranscriber agreement was assessed using the Oregon Graduate Institute’s metric for transcription accuracy, and comparison with their results for their English telephone corpus will be provided. A number of phonological facts will then be elucidated from the transcriptions. To facilitate this, a database of contextual information has been created for each phoneme in the dictionary forms of the words; the database includes both lexical contextual factors and the actual context in which a given token appears. The ...
Published: 1996

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Linguistic Data Consortium"'

1. Some musings on systematic variability and speech recognition

2. Pronunciation variability in the Switchboard corpus

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

2 results on '"Linguistic Data Consortium"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources