Search

Your search keyword '"Daniel Povey"' showing total 151 results

Search Constraints

Start Over You searched for: Author "Daniel Povey" Remove constraint Author: "Daniel Povey" Database OpenAIRE Remove constraint Database: OpenAIRE
151 results on '"Daniel Povey"'

Search Results

1. Fast and parallel decoding for transducer

3. LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation

4. Delay-penalized transducer for low-latency streaming ASR

5. A Parallelizable Lattice Rescoring Strategy with Neural Language Models

6. speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment

7. DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs

8. An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

9. Wake Word Detection with Streaming Transformers

10. GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

13. Neural Language Modeling With Implicit Cache Pointers

14. Efficient MDI Adaptation for n-gram Language Models

15. PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

16. CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings

17. An Empirical Study of Transformer-Based Neural Language Model Adaptation

18. OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR

19. Speaker Diarization with Region Proposal Network

20. Wake Word Detection with Alignment-Free Lattice-Free MMI

21. Multistream CNN for Robust Acoustic Modeling

22. Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR

23. Low Latency Acoustic Modeling Using Temporal Convolution and LSTMs

24. Probing the Information Encoded in X-Vectors

25. Incremental Lattice Determinization for WFST Decoders

28. State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18

34. Optical Character Recognition with Chinese and Korean Character Decomposition

35. Using ASR Methods for OCR

36. Speaker Recognition for Multi-speaker Conversations Using X-vectors

37. GPU-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech Recognition

38. Improving LF-MMI Using Unconstrained Supervisions for ASR

39. A Teacher-Student Learning Approach for Unsupervised Domain Adaptation of Sequence-Trained ASR Models

41. Acoustic Modeling from Frequency Domain Representations of Speech

42. End-to-end Speech Recognition Using Lattice-free MMI

43. Output-Gate Projected Gated Recurrent Unit for Speech Recognition

44. Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks

45. Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge

46. End-to-end Deep Neural Network Age Estimation

47. Emotion Identification from Raw Speech Signals Using DNNs

48. Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification

49. Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition

50. A GPU-based WFST Decoder with Exact Lattice Generation

Catalog

Books, media, physical & digital resources