Author: "Daniel Povey" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Daniel Povey"' showing total 367 results

Start Over Author "Daniel Povey"

367 results on '"Daniel Povey"'

1. SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM.

Author: Quandong Wang, Yuxuan Yuan, Xiaoyu Yang, Ruike Zhang, Kang Zhao, Wei Liu 0005, Jian Luan 0001, Daniel Povey, and Bin Wang 0004
Published: 2024
Full Text: View/download PDF

2. Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS.

Author: Yifan Yang, Feiyu Shen, Chenpeng Du, Ziyang Ma, Kai Yu 0004, Daniel Povey, and Xie Chen 0001
Published: 2024
Full Text: View/download PDF

3. Less Peaky and More Accurate CTC Forced Alignment by Label Priors.

Author: Ruizhe Huang, Xiaohui Zhang 0007, Zhaoheng Ni, Li Sun, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe 0001, Daniel Povey, and Sanjeev Khudanpur
Published: 2024
Full Text: View/download PDF

4. PromptASR for Contextualized ASR with Controllable Style.

Author: Xiaoyu Yang, Wei Kang 0006, Zengwei Yao, Yifan Yang, Liyong Guo, Fangjun Kuang, Long Lin, and Daniel Povey
Published: 2024
Full Text: View/download PDF

5. Libriheavy: A 50, 000 Hours ASR Corpus with Punctuation Casing and Context.

Author: Wei Kang 0006, Xiaoyu Yang, Zengwei Yao, Fangjun Kuang, Yifan Yang, Liyong Guo, Long Lin, and Daniel Povey
Published: 2024
Full Text: View/download PDF

6. On Speaker Attribution with SURT.

Author: Desh Raj, Matthew Wiesner, Matthew Maciejewski, Paola García 0001, Daniel Povey, and Sanjeev Khudanpur
Published: 2024
Full Text: View/download PDF

7. Zipformer: A faster and better encoder for automatic speech recognition.

Author: Zengwei Yao, Liyong Guo, Xiaoyu Yang, Wei Kang 0006, Fangjun Kuang, Yifan Yang, Zengrui Jin, Long Lin, and Daniel Povey
Published: 2024

8. LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization.

Author: Zengrui Jin, Yifan Yang, Mohan Shi, Wei Kang 0006, Xiaoyu Yang, Zengwei Yao, Fangjun Kuang, Liyong Guo, Lingwei Meng, Long Lin, Yong Xu 0004, Shi-Xiong Zhang 0001, and Daniel Povey
Published: 2024
Full Text: View/download PDF

9. Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation.

Author: Ruizhe Huang, Mahsa Yarmohammadi, Sanjeev Khudanpur, and Daniel Povey
Published: 2024
Full Text: View/download PDF

10. GPU-accelerated Guided Source Separation for Meeting Transcription.

Author: Desh Raj, Daniel Povey, and Sanjeev Khudanpur
Published: 2023
Full Text: View/download PDF

11. Blank-regularized CTC for Frame Skipping in Neural Transducer.

Author: Yifan Yang, Xiaoyu Yang, Liyong Guo, Zengwei Yao, Wei Kang 0006, Fangjun Kuang, Long Lin, Xie Chen 0001, and Daniel Povey
Published: 2023
Full Text: View/download PDF

12. Delay-penalized CTC Implemented Based on Finite State Transducer.

Author: Zengwei Yao, Wei Kang 0006, Fangjun Kuang, Liyong Guo, Xiaoyu Yang, Yifan Yang, Long Lin, and Daniel Povey
Published: 2023
Full Text: View/download PDF

13. Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts.

Author: Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola García, Daniel Povey, and Sanjeev Khudanpur
Published: 2023
Full Text: View/download PDF

14. Learning From Flawed Data: Weakly Supervised Automatic Speech Recognition.

Author: Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola García-Perera, Daniel Povey, and Sanjeev Khudanpur
Published: 2023
Full Text: View/download PDF

15. Fast and Parallel Decoding for Transducer.

Author: Wei Kang 0006, Liyong Guo, Fangjun Kuang, Long Lin, Mingshuang Luo, Zengwei Yao, Xiaoyu Yang, Piotr Zelasko, and Daniel Povey
Published: 2023
Full Text: View/download PDF

16. Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation.

Author: Liyong Guo, Xiaoyu Yang, Quandong Wang, Yuxiang Kong, Zengwei Yao, Fan Cui, Fangjun Kuang, Wei Kang 0006, Long Lin, Mingshuang Luo, Piotr Zelasko, and Daniel Povey
Published: 2023
Full Text: View/download PDF

17. Delay-Penalized Transducer for Low-Latency Streaming ASR.

Author: Wei Kang 0006, Zengwei Yao, Fangjun Kuang, Liyong Guo, Xiaoyu Yang, Long Lin, Piotr Zelasko, and Daniel Povey
Published: 2023
Full Text: View/download PDF

18. Building Keyword Search System from End-To-End Asr Systems.

Author: Ruizhe Huang, Matthew Wiesner, Leibny Paola García-Perera, Daniel Povey, Jan Trmal, and Sanjeev Khudanpur
Published: 2023
Full Text: View/download PDF

19. Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition.

Author: Han Zhu 0004, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, and Yonghong Yan 0002
Published: 2023
Full Text: View/download PDF

20. SURT 2.0: Advances in Transducer-Based Multi-Talker Speech Recognition.

Author: Desh Raj, Daniel Povey, and Sanjeev Khudanpur
Published: 2023
Full Text: View/download PDF

21. Pruned RNN-T for fast, memory-efficient ASR training.

Author: Fangjun Kuang, Liyong Guo, Wei Kang 0006, Long Lin, Mingshuang Luo, Zengwei Yao, and Daniel Povey
Published: 2022
Full Text: View/download PDF

22. Zipformer: A faster and better encoder for automatic speech recognition.

Author: Zengwei Yao, Liyong Guo, Xiaoyu Yang, Wei Kang 0006, Fangjun Kuang, Yifan Yang, Zengrui Jin, Long Lin, and Daniel Povey
Published: 2023
Full Text: View/download PDF

23. PromptASR for contextualized ASR with controllable style.

Author: Xiaoyu Yang, Wei Kang 0006, Zengwei Yao, Yifan Yang, Liyong Guo, Fangjun Kuang, Long Lin, and Daniel Povey
Published: 2023
Full Text: View/download PDF

24. Libriheavy: a 50, 000 hours ASR corpus with punctuation casing and context.

Author: Wei Kang 0006, Xiaoyu Yang, Zengwei Yao, Fangjun Kuang, Yifan Yang, Liyong Guo, Long Lin, and Daniel Povey
Published: 2023
Full Text: View/download PDF

25. Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS.

Author: Yifan Yang, Feiyu Shen, Chenpeng Du, Ziyang Ma, Kai Yu 0004, Daniel Povey, and Xie Chen 0001
Published: 2023
Full Text: View/download PDF

26. speechocean762: An Open-Source Non-Native English Speech Corpus for Pronunciation Assessment.

Author: Junbo Zhang, Zhiwen Zhang, Yongqing Wang, Zhiyong Yan, Qiong Song, Yukai Huang, Ke Li 0018, Daniel Povey, and Yujun Wang
Published: 2021
Full Text: View/download PDF

27. GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio.

Author: Guoguo Chen, Shuzhou Chai, Guan-Bo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su 0002, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe 0001, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Zhao You, and Zhiyong Yan
Published: 2021
Full Text: View/download PDF

28. Wake Word Detection with Streaming Transformers.

Author: Yiming Wang 0006, Hang Lv 0001, Daniel Povey, Lei Xie 0001, and Sanjeev Khudanpur
Published: 2021
Full Text: View/download PDF

29. An Asynchronous WFST-Based Decoder for Automatic Speech Recognition.

Author: Hang Lv 0001, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie 0001, and Sanjeev Khudanpur
Published: 2021
Full Text: View/download PDF

30. A Parallelizable Lattice Rescoring Strategy with Neural Language Models.

Author: Ke Li 0018, Daniel Povey, and Sanjeev Khudanpur
Published: 2021
Full Text: View/download PDF

31. DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs.

Author: Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe 0001, Daniel Povey, Andreas Stolcke, and Sanjeev Khudanpur
Published: 2021
Full Text: View/download PDF

32. Neural Language Modeling with Implicit Cache Pointers.

Author: Ke Li 0018, Daniel Povey, and Sanjeev Khudanpur
Published: 2020
Full Text: View/download PDF

33. PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR.

Author: Yiwen Shao, Yiming Wang 0006, Daniel Povey, and Sanjeev Khudanpur
Published: 2020
Full Text: View/download PDF

34. An Alternative to MFCCs for ASR.

Author: Pegah Ghahramani, Hossein Hadian, Daniel Povey, Hynek Hermansky, and Sanjeev Khudanpur
Published: 2020
Full Text: View/download PDF

35. Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition Systems.

Author: Srikanth R. Madikeri, Banriskhem K. Khonglah, Sibo Tong, Petr Motlícek, Hervé Bourlard, and Daniel Povey
Published: 2020
Full Text: View/download PDF

36. Wake Word Detection with Alignment-Free Lattice-Free MMI.

Author: Yiming Wang 0006, Hang Lv 0001, Daniel Povey, Lei Xie 0001, and Sanjeev Khudanpur
Published: 2020
Full Text: View/download PDF

37. Efficient MDI Adaptation for n-Gram Language Models.

Author: Ruizhe Huang, Ke Li 0018, Ashish Arora, Daniel Povey, and Sanjeev Khudanpur
Published: 2020
Full Text: View/download PDF

38. OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR.

Author: Xiaohui Zhang 0007, Daniel Povey, and Sanjeev Khudanpur
Published: 2020
Full Text: View/download PDF

39. Speaker Diarization with Region Proposal Network.

Author: Zili Huang, Shinji Watanabe 0001, Yusuke Fujita, Paola García 0001, Yiwen Shao, Daniel Povey, and Sanjeev Khudanpur
Published: 2020
Full Text: View/download PDF

40. Gpu-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech Recognition.

Author: Hugo Braun, Justin Luitjens, Ryan Leary, Tim Kaldewey, and Daniel Povey
Published: 2020
Full Text: View/download PDF

41. An Empirical Study of Transformer-Based Neural Language Model Adaptation.

Author: Ke Li 0018, Zhe Liu 0011, Tianxing He, Hongzhao Huang, Fuchun Peng, Daniel Povey, and Sanjeev Khudanpur
Published: 2020
Full Text: View/download PDF

42. LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation.

Author: Hang Lv 0001, Daniel Povey, Mahsa Yarmohammadi, Ke Li 0018, Yiming Wang 0006, Lei Xie 0001, and Sanjeev Khudanpur
Published: 2021
Full Text: View/download PDF

43. Improving Emotion Identification Using Phone Posteriors in Raw Speech Waveform Based DNN.

Author: Mousmita Sarma, Pegah Ghahremani, Daniel Povey, Nagendra Kumar Goel, Kandarpa Kumar Sarma, and Najim Dehak
Published: 2019
Full Text: View/download PDF

44. State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.

Author: Jesús Villalba 0001, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Fred Richardson, Suwon Shon, François Grondin, Réda Dehak, Leibny Paola García-Perera, Daniel Povey, Pedro A. Torres-Carrasquillo, Sanjeev Khudanpur, and Najim Dehak
Published: 2019
Full Text: View/download PDF

45. The JHU ASR System for VOiCES from a Distance Challenge 2019.

Author: Yiming Wang 0006, David Snyder, Hainan Xu, Vimal Manohar, Phani Sankar Nidadavolu, Daniel Povey, and Sanjeev Khudanpur
Published: 2019
Full Text: View/download PDF

46. The JHU Speaker Recognition System for the VOiCES 2019 Challenge.

Author: David Snyder, Jesús Villalba 0001, Nanxin Chen, Daniel Povey, Gregory Sell, Najim Dehak, and Sanjeev Khudanpur
Published: 2019
Full Text: View/download PDF

47. Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network.

Author: Fei Wu, Leibny Paola García-Perera, Daniel Povey, and Sanjeev Khudanpur
Published: 2019
Full Text: View/download PDF

48. Multi-PLDA Diarization on Children's Speech.

Author: Jiamin Xie, Leibny Paola García-Perera, Daniel Povey, and Sanjeev Khudanpur
Published: 2019
Full Text: View/download PDF

49. x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition.

Author: Daniel Garcia-Romero, David Snyder, Gregory Sell, Alan McCree, Daniel Povey, and Sanjeev Khudanpur
Published: 2019
Full Text: View/download PDF

50. Speaker Recognition Benchmark Using the CHiME-5 Corpus.

Author: Daniel Garcia-Romero, David Snyder, Shinji Watanabe 0001, Gregory Sell, Alan McCree, Daniel Povey, and Sanjeev Khudanpur
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Database

Publisher

367 results on '"Daniel Povey"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources