Search

Your search keyword '"Kawai, Hisashi"' showing total 27 results

Search Constraints

Start Over You searched for: Author "Kawai, Hisashi" Remove constraint Author: "Kawai, Hisashi" Database arXiv Remove constraint Database: arXiv
27 results on '"Kawai, Hisashi"'

Search Results

1. Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR

2. Generative linguistic representation for spoken language identification

3. Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition

4. Neural domain alignment for spoken language recognition based on optimal transport

5. Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR

6. Cross-modal Alignment with Optimal Transport for CTC-based ASR

7. Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition

8. Speaking-Rate-Controllable HiFi-GAN Using Feature Interpolation

9. Transducer-based language embedding for spoken language identification

10. Partial Coupling of Optimal Transport for Spoken Language Identification

11. Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

12. CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation

13. Predicting and Attending to Damaging Collisions for Placing Everyday Objects in Photo-Realistic Simulations

14. Coupling a generative model with a discriminative learning framework for speaker verification

15. Unsupervised neural adaptation model based on optimal transport for spoken language identification

16. Quasi-Periodic Parallel WaveGAN: A Non-autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network

17. Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder-Decoder Network

18. Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation

19. Cross-scale Attention Model for Acoustic Event Classification

20. A Multimodal Target-Source Classifier with Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects

21. Multimodal Attention Branch Network for Perspective-Free Sentence Generation

22. Understanding Natural Language Instructions for Fetching Daily Objects Using GAN-Based Multimodal Target-Source Classification

23. Incorporating Symbolic Sequential Modeling for Speech Enhancement

24. A Multimodal Classifier Generative Adversarial Network for Carry and Place Tasks from Ambiguous Language Instructions

25. Grounded Language Understanding for Manipulation Instructions Using GAN-Based Classification

26. End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks

27. Raw Waveform-based Speech Enhancement by Fully Convolutional Networks

Catalog

Books, media, physical & digital resources