Search

Your search keyword '"Kawai, Hisashi"' showing total 1,103 results

Search Constraints

Start Over You searched for: Author "Kawai, Hisashi" Remove constraint Author: "Kawai, Hisashi"
1,103 results on '"Kawai, Hisashi"'

Search Results

1. Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR

3. Generative linguistic representation for spoken language identification

4. Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition

5. Neural domain alignment for spoken language recognition based on optimal transport

7. Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR

8. Cross-modal Alignment with Optimal Transport for CTC-based ASR

9. Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition

10. Speaking-Rate-Controllable HiFi-GAN Using Feature Interpolation

11. Transducer-based language embedding for spoken language identification

13. Partial Coupling of Optimal Transport for Spoken Language Identification

15. Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

16. CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation

17. Predicting and Attending to Damaging Collisions for Placing Everyday Objects in Photo-Realistic Simulations

18. Coupling a generative model with a discriminative learning framework for speaker verification

20. Unsupervised neural adaptation model based on optimal transport for spoken language identification

21. Quasi-Periodic Parallel WaveGAN: A Non-autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network

22. Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder-Decoder Network

23. Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation

25. Cross-scale Attention Model for Acoustic Event Classification

26. A Multimodal Target-Source Classifier with Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects

27. Multimodal Attention Branch Network for Perspective-Free Sentence Generation

28. Understanding Natural Language Instructions for Fetching Daily Objects Using GAN-Based Multimodal Target-Source Classification

29. Incorporating Symbolic Sequential Modeling for Speech Enhancement

35. A Multimodal Classifier Generative Adversarial Network for Carry and Place Tasks from Ambiguous Language Instructions

36. Grounded Language Understanding for Manipulation Instructions Using GAN-Based Classification

37. Field Experiment System 'VoiceTra'

38. End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks

39. Raw Waveform-based Speech Enhancement by Fully Convolutional Networks

49. Latent-Space Data Augmentation for Visually-Grounded Language Understanding

Catalog

Books, media, physical & digital resources