367 results on '"Daniel Povey"'
Search Results
152. Quick fmllr for speaker adaptation in speech recognition.
153. Universal background model based speech recognition.
154. Evaluation of Proposed Modifications to MPE for Large Scale Discriminative Training.
155. The Impact of ASR on Speech-to-Speech Translation Performance.
156. The IBM 2006 Gale Arabic ASR System.
157. The IBM Rich Transcription Spring 2006 Speech-to-Text System for Lecture Meetings.
158. Morpheme-Based Language Modeling for Arabic Lvcsr.
159. Secondary Classification for GMM Based Speaker Recognition.
160. Automated Quality Monitoring in the Call Center with ASR and Maximum Entropy.
161. Improvements to fMPE for discriminative training of features.
162. Discriminatively trained features using fMPE for multi-stream audio-visual speech recognition.
163. Anatomy of an extremely fast LVCSR decoder.
164. fMPE: Discriminatively Trained Features for Speech Recognition.
165. The IBM 2004 Conversational Telephony System for Rich Transcription.
166. Phone duration modeling for LVCSR.
167. Feature space Gaussianization.
168. MMI-MAP and MPE-MAP for acoustic model adaptation.
169. Discriminative Training for HMM-Based Offline Handwritten Character Recognition.
170. Discriminative map for acoustic model adaptation.
171. Porting: SwitchBoard to the VoiceMail task.
172. Minimum Phone Error and I-smoothing for improved discriminative training.
173. LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation
174. Translations of the Callhome Egyptian Arabic corpus for conversational speech translation.
175. New features in the CU-HTK system for transcription of conversational telephone speech.
176. Improved discriminative training techniques for large vocabulary continuous speech recognition.
177. Krylov Subspace Descent for Deep Learning.
178. A basis representation of constrained MLLR transforms for robust adaptation.
179. Minimum Bayes Risk decoding and system combination based on a recursion for edit distance.
180. The subspace Gaussian mixture model - A structured model for speech recognition.
181. Frame discrimination training for HMMs for large vocabulary speech recognition.
182. Advances in Arabic Speech Transcription at IBM Under the DARPA GALE Program.
183. Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging.
184. MUSAN: A Music, Speech, and Noise Corpus.
185. Advances in speech transcription at IBM under the DARPA EARS program.
186. Automatic transcription of conversational telephone speech.
187. Large scale discriminative training of hidden Markov models for speech recognition.
188. SPAM and full covariance for speech recognition.
189. Feature and model space speaker adaptation with full covariance Gaussians.
190. Automated Quality Monitoring for Call Centers using Speech and NLP Technologies.
191. A Parallelizable Lattice Rescoring Strategy with Neural Language Models
192. speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment
193. DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs
194. An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
195. Wake Word Detection with Streaming Transformers
196. GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
197. Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition Systems
198. An Alternative to MFCCs for ASR
199. Neural Language Modeling With Implicit Cache Pointers
200. Efficient MDI Adaptation for n-gram Language Models
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.