Author: "Hu, Ke" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Hu, Ke"' showing total 5,404 results

Start Over Author "Hu, Ke"

5,404 results on '"Hu, Ke"'

1. KD-MSLRT: Lightweight Sign Language Recognition Model Based on Mediapipe and 3D to 1D Knowledge Distillation

Author: Li, Yulong, Ren, Bolin, Hu, Ke, Liu, Changyuan, Jiang, Zhengyong, Dang, Kang, and Su, Jionglong
Subjects: Computer Science - Computers and Society
Abstract: Artificial intelligence has achieved notable results in sign language recognition and translation. However, relatively few efforts have been made to significantly improve the quality of life for the 72 million hearing-impaired people worldwide. Sign language translation models, relying on video inputs, involves with large parameter sizes, making it time-consuming and computationally intensive to be deployed. This directly contributes to the scarcity of human-centered technology in this field. Additionally, the lack of datasets in sign language translation hampers research progress in this area. To address these, we first propose a cross-modal multi-knowledge distillation technique from 3D to 1D and a novel end-to-end pre-training text correction framework. Compared to other pre-trained models, our framework achieves significant advancements in correcting text output errors. Our model achieves a decrease in Word Error Rate (WER) of at least 1.4% on PHOENIX14 and PHOENIX14T datasets compared to the state-of-the-art CorrNet. Additionally, the TensorFlow Lite (TFLite) quantized model size is reduced to 12.93 MB, making it the smallest, fastest, and most accurate model to date. We have also collected and released extensive Chinese sign language datasets, and developed a specialized training vocabulary. To address the lack of research on data augmentation for landmark data, we have designed comparative experiments on various augmentation methods. Moreover, we performed a simulated deployment and prediction of our model on Intel platform CPUs and assessed the feasibility of deploying the model on other platforms., Comment: AAAI 2025
Published: 2025

2. NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts

Author: Lin, Yen-Ting, Yang, Chao-Han Huck, Chen, Zhehuai, Zelasko, Piotr, Yang, Xuesong, Chen, Zih-Ching, Puvvada, Krishna C, Fu, Szu-Wei, Hu, Ke, Chiu, Jun Wei, Balam, Jagadeesh, Ginsburg, Boris, and Wang, Yu-Chiang Frank
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Multiagent Systems, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Construction of a general-purpose post-recognition error corrector poses a crucial question: how can we most effectively train a model on a large mixture of domain datasets? The answer would lie in learning dataset-specific features and digesting their knowledge in a single model. Previous methods achieve this by having separate correction language models, resulting in a significant increase in parameters. In this work, we present Mixture-of-Experts as a solution, highlighting that MoEs are much more than a scalability tool. We propose a Multi-Task Correction MoE, where we train the experts to become an ``expert'' of speech-to-text, language-to-text and vision-to-text datasets by learning to route each dataset's tokens to its mapped expert. Experiments on the Open ASR Leaderboard show that we explore a new state-of-the-art performance by achieving an average relative $5.0$% WER reduction and substantial improvements in BLEU scores for speech and translation tasks. On zero-shot evaluation, NeKo outperforms GPT-3.5 and Claude-Opus with $15.5$% to $27.6$% relative WER reduction in the Hyporadise benchmark. NeKo performs competitively on grammar and post-OCR correction as a multi-task model., Comment: NeKo work has been done in June 2024. NeKo LMs will be open source on https://huggingface.co/nvidia under the MIT license
Published: 2024

3. VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning

Author: Peng, Yifan, Puvvada, Krishna C., Chen, Zhehuai, Zelasko, Piotr, Huang, He, Dhawan, Kunal, Hu, Ke, Watanabe, Shinji, Balam, Jagadeesh, and Ginsburg, Boris
Subjects: Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Recent studies have augmented large language models (LLMs) with speech capabilities, leading to the development of speech language models (SpeechLMs). Earlier SpeechLMs focused on single-turn speech-based question answering (QA), where user input comprised a speech context and a text question. More recent studies have extended this to multi-turn conversations, though they often require complex, multi-stage supervised fine-tuning (SFT) with diverse data. Another critical challenge with SpeechLMs is catastrophic forgetting-where models optimized for speech tasks suffer significant degradation in text-only performance. To mitigate these issues, we propose a novel single-stage joint speech-text SFT approach on the low-rank adaptation (LoRA) of the LLM backbone. Our joint SFT combines text-only SFT data with three types of speech-related data: speech recognition and translation, speech-based QA, and mixed-modal SFT. Compared to previous SpeechLMs with 7B or 13B parameters, our 3B model demonstrates superior performance across various speech benchmarks while preserving the original capabilities on text-only tasks. Furthermore, our model shows emergent abilities of effectively handling previously unseen prompts and tasks, including multi-turn, mixed-modal inputs.
Published: 2024

4. EMMeTT: Efficient Multimodal Machine Translation Training

Author: Żelasko, Piotr, Chen, Zhehuai, Wang, Mengru, Galvez, Daniel, Hrinchuk, Oleksii, Ding, Shuoyang, Hu, Ke, Balam, Jagadeesh, Lavrukhin, Vitaly, and Ginsburg, Boris
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: A rising interest in the modality extension of foundation language models warrants discussion on the most effective, and efficient, multimodal training approach. This work focuses on neural machine translation (NMT) and proposes a joint multimodal training regime of Speech-LLM to include automatic speech translation (AST). We investigate two different foundation model architectures, decoder-only GPT and encoder-decoder T5, extended with Canary-1B's speech encoder. To handle joint multimodal training, we propose a novel training framework called EMMeTT. EMMeTT improves training efficiency with the following: balanced sampling across languages, datasets, and modalities; efficient sequential data iteration; and a novel 2D bucketing scheme for multimodal data, complemented by a batch size optimizer (OOMptimizer). We show that a multimodal training consistently helps with both architectures. Moreover, SALM-T5 trained with EMMeTT retains the original NMT capability while outperforming AST baselines on four-language subsets of FLORES and FLEURS. The resultant Multimodal Translation Model produces strong text and speech translation results at the same time., Comment: 4 pages, submitted to ICASSP 2025
Published: 2024

5. Chain-of-Thought Prompting for Speech Translation

Author: Hu, Ke, Chen, Zhehuai, Yang, Chao-Han Huck, Żelasko, Piotr, Hrinchuk, Oleksii, Lavrukhin, Vitaly, Balam, Jagadeesh, and Ginsburg, Boris
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) have demonstrated remarkable advancements in language understanding and generation. Building on the success of text-based LLMs, recent research has adapted these models to use speech embeddings for prompting, resulting in Speech-LLM models that exhibit strong performance in automatic speech recognition (ASR) and automatic speech translation (AST). In this work, we propose a novel approach to leverage ASR transcripts as prompts for AST in a Speech-LLM built on an encoder-decoder text LLM. The Speech-LLM model consists of a speech encoder and an encoder-decoder structure Megatron-T5. By first decoding speech to generate ASR transcripts and subsequently using these transcripts along with encoded speech for prompting, we guide the speech translation in a two-step process like chain-of-thought (CoT) prompting. Low-rank adaptation (LoRA) is used for the T5 LLM for model adaptation and shows superior performance to full model fine-tuning. Experimental results show that the proposed CoT prompting significantly improves AST performance, achieving an average increase of 2.4 BLEU points across 6 En->X or X->En AST tasks compared to speech prompting alone. Additionally, compared to a related CoT prediction method that predicts a concatenated sequence of ASR and AST transcripts, our method performs better by an average of 2 BLEU points.
Published: 2024

6. Robust Principal Component Analysis via Discriminant Sample Weight Learning

Author: Deng, Yingzhuo, Hu, Ke, Li, Bo, and Zhang, Yao
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Principal component analysis (PCA) is a classical feature extraction method, but it may be adversely affected by outliers, resulting in inaccurate learning of the projection matrix. This paper proposes a robust method to estimate both the data mean and the PCA projection matrix by learning discriminant sample weights from data containing outliers. Each sample in the dataset is assigned a weight, and the proposed algorithm iteratively learns the weights, the mean, and the projection matrix, respectively. Specifically, when the mean and the projection matrix are available, via fine-grained analysis of outliers, a weight for each sample is learned hierarchically so that outliers have small weights while normal samples have large weights. With the learned weights available, a weighted optimization problem is solved to estimate both the data mean and the projection matrix. Because the learned weights discriminate outliers from normal samples, the adverse influence of outliers is mitigated due to the corresponding small weights. Experiments on toy data, UCI dataset, and face dataset demonstrate the effectiveness of the proposed method in estimating the mean and the projection matrix from the data containing outliers.
Published: 2024

7. High-resolution Simulation Dataset of Hourly PM2.5 Chemical Composition in China (CAQRA-aerosol) from 2013 to 2020

Author: Kong, Lei, Tang, Xiao, Zhu, Jiang, Wang, Zifa, Liu, Bing, Zhu, Yuanyuan, Zhu, Lili, Chen, Duohong, Hu, Ke, Wu, Huangjian, Wu, Qian, Shen, Jin, Sun, Yele, Liu, Zirui, Xin, Jinyuan, Ji, Dongsheng, and Zheng, Mei
Published: 2025
Full Text: View/download PDF

8. Effect of tumor microenvironment in pancreatic cancer on the loss of β-cell mass: implications for type 3c diabetes

Author: Hu, Ke, Zhao, Xuelian, Zhang, Na, Ma, Jing, Zhang, Ruonan, Lu, Zhiqiang, Wu, Wenchuan, Ji, Yuan, and Li, Xiaomu
Published: 2025
Full Text: View/download PDF

9. Bifunctional bridging capping layer enables 24.5% efficiency of perovskite solar cells with polymer-based hole transport materials

Author: Zhu, Can, Wang, Yiyang, Meng, Lei, Qiu, Beibei, Li, Jing, Qin, Shucheng, Hu, Ke, Jiang, Xin, Lai, Wenbin, Liu, Minchao, Liu, Zhe, Lu, Chenxing, Zhang, Jinyuan, and Li, Yongfang
Published: 2025
Full Text: View/download PDF

10. Direct Carbon Emission Accounting and Alarm Method for SF6

Author: Zhao, Xiaofeng, Dai, Yao, Zhang, Rui, Luo, Lijian, Hu, Ke, Li, Qingfeng, Wang, Jiahao, Wu, Yingyu, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, and Li, Jian, editor
Published: 2025
Full Text: View/download PDF

11. Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization

Author: Ding, Shutong, Hu, Ke, Zhang, Zhenhao, Ren, Kan, Zhang, Weinan, Yu, Jingyi, Wang, Jingya, and Shi, Ye
Subjects: Computer Science - Machine Learning
Abstract: Diffusion models have garnered widespread attention in Reinforcement Learning (RL) for their powerful expressiveness and multimodality. It has been verified that utilizing diffusion policies can significantly improve the performance of RL algorithms in continuous control tasks by overcoming the limitations of unimodal policies, such as Gaussian policies, and providing the agent with enhanced exploration capabilities. However, existing works mainly focus on the application of diffusion policies in offline RL, while their incorporation into online RL is less investigated. The training objective of the diffusion model, known as the variational lower bound, cannot be optimized directly in online RL due to the unavailability of 'good' actions. This leads to difficulties in conducting diffusion policy improvement. To overcome this, we propose a novel model-free diffusion-based online RL algorithm, Q-weighted Variational Policy Optimization (QVPO). Specifically, we introduce the Q-weighted variational loss, which can be proved to be a tight lower bound of the policy objective in online RL under certain conditions. To fulfill these conditions, the Q-weight transformation functions are introduced for general scenarios. Additionally, to further enhance the exploration capability of the diffusion policy, we design a special entropy regularization term. We also develop an efficient behavior policy to enhance sample efficiency by reducing the variance of the diffusion policy during online interactions. Consequently, the QVPO algorithm leverages the exploration capabilities and multimodality of diffusion policies, preventing the RL agent from converging to a sub-optimal policy. To verify the effectiveness of QVPO, we conduct comprehensive experiments on MuJoCo benchmarks. The final results demonstrate that QVPO achieves state-of-the-art performance on both cumulative reward and sample efficiency., Comment: Accepted by NeurIPS2024
Published: 2024

12. Enhancing Visual Continual Learning with Language-Guided Supervision

Author: Ni, Bolin, Zhao, Hongbo, Zhang, Chenghao, Hu, Ke, Meng, Gaofeng, Zhang, Zhaoxiang, and Xiang, Shiming
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Continual learning (CL) aims to empower models to learn new tasks without forgetting previously acquired knowledge. Most prior works concentrate on the techniques of architectures, replay data, regularization, \etc. However, the category name of each class is largely neglected. Existing methods commonly utilize the one-hot labels and randomly initialize the classifier head. We argue that the scarce semantic information conveyed by the one-hot labels hampers the effective knowledge transfer across tasks. In this paper, we revisit the role of the classifier head within the CL paradigm and replace the classifier with semantic knowledge from pretrained language models (PLMs). Specifically, we use PLMs to generate semantic targets for each class, which are frozen and serve as supervision signals during training. Such targets fully consider the semantic correlation between all classes across tasks. Empirical studies show that our approach mitigates forgetting by alleviating representation drifting and facilitating knowledge transfer across tasks. The proposed method is simple to implement and can seamlessly be plugged into existing methods with negligible adjustments. Extensive experiments based on eleven mainstream baselines demonstrate the effectiveness and generalizability of our approach to various protocols. For example, under the class-incremental learning setting on ImageNet-100, our method significantly improves the Top-1 accuracy by 3.2\% to 6.1\% while reducing the forgetting rate by 2.6\% to 13.1\%., Comment: Accepted by CVPR 2024
Published: 2024

13. Generation of isolated attosecond electron bunches by the diffraction of a polarization-tailored intense laser beam

Author: Hu, Ke and Yi, Longqing
Subjects: Physics - Plasma Physics
Abstract: We propose utilizing a polarization-tailored high-power laser pulse to extract and accelerate electrons from the edge of a solid foil target to produce isolated attosecond electron bunches. The laser pulse consists of two orthogonally-polarized components with a time delay comparable to the pulse duration, such that the polarization in the middle of the pulse rapidly rotates over 90$^\circ$ within few optical cycles. Three-dimensional (3D) Particle-in-Cell simulations show that when such a light pulse diffracts at the edge of a plasma foil, a series of isolated relativistic electron bunches are emitted into separated azimuthal angles determined by the varying polarization. In comparison with most other methods that require an ultra-short drive laser, we show the proposed scheme works well with typical multi-cycle ($\sim 30~$fs) pulses from high-power laser facilities. The generated electron bunches have typical durations of a few hundred attoseconds and charges of tens of picocoulombs.
Published: 2024

14. Improvement of hepatic fibrosis after tenofovir disoproxil fumarate switching to tenofovir alafenamide for three years.

Author: Huynh, Tung, Bui, Delana, Zhou, Tina, and Hu, Ke-Qin
Subjects: Aspartate aminotransferase to platelet ratio index, Fibrosis-4, Hepatic fibrosis improvement, Shear wave elastography, Switching, Tenofovir alafenamide, Tenofovir disoproxil fumarate
Abstract: BACKGROUND: Both tenofovir alafenamide (TAF) and tenofovir disoproxil fumarate (TDF) are the first-line treatments for chronic hepatitis B (CHB). We have showed switching from TDF to TAF for 96 weeks resulted in further alanine aminotransferase (ALT) improvement, but data remain lacking on the long-term benefits of TDF switching to TAF on hepatic fibrosis. AIM: To assess the benefits of TDF switching to TAF for 3 years on ALT, aspartate aminotransferase (AST), and hepatic fibrosis improvement in patients with CHB. METHODS: A single center retrospective study on 53 patients with CHB who were initially treated with TDF, then switched to TAF to determine dynamic patterns of ALT, AST, AST to platelet ratio index (APRI), fibrosis-4 (FIB-4) scores, and shear wave elastography (SWE) reading improvement at switching week 144, and the associated factors. RESULTS: The mean age was 55 (28-80); 45.3%, males; 15.1%, clinical cirrhosis; mean baseline ALT, 24.8; AST, 25.7 U/L; APRI, 0.37; and FIB-4, 1.66. After 144 weeks TDF switching to TAF, mean ALT and AST were reduced to 19.7 and 21, respectively. From baseline to switching week 144, the rates of ALT and AST < 35 (male)/25 (female) and < 30 (male)/19 (female) were persistently increased; hepatic fibrosis was also improved by APRI < 0.5, from 79.2% to 96.2%; FIB-4 < 1.45, from 52.8% to 58.5%, respectively; mean APRI was reduced to 0.27; FIB-4, to 1.38; and mean SWE reading, from 7.05 to 6.30 kPa after a mean of 109 weeks switching. The renal function was stable and the frequency of patients with glomerular filtration rate > 60 mL/min was increased from 86.5% at baseline to 88.2% at switching week 144. CONCLUSION: Our data confirmed that switching from TDF to TAF for 3 years results in not only persistent ALT/AST improvement, but also hepatic fibrosis improvement by APRI, FIB-4 scores, as well as SWE reading, the important clinical benefits of long-term hepatitis B virus antiviral treatment with TAF.
Published: 2024

15. Synthesis and Durable Antimicrobial and Anti-fungal Properties of Triclosan and Chitosan Co-grafted Polypropylene Nonwovens

Author: Hu, Ke, Chen, Hongxuan, Lin, Yihui, Han, Shitong, Wang, Qi, Peng, Houqian, Wang, Ying, Zhao, Jiwu, Xi, Hailing, Wen, Na, and Long, Jinlin
Published: 2024
Full Text: View/download PDF

16. Proteomics identifies hypothermia induced adiponectin protects corneal endothelial cells via AMPK mediated autophagy in phacoemulsification

Author: Chen, Yanyi, Li, Kewei, Huang, Rongxi, Xiong, Liang, Li, Ruonan, Jiang, Lu, Xun, Yan, Wan, Wenjuan, and Hu, Ke
Published: 2024
Full Text: View/download PDF

17. Impact of Samarium on Microstructural Evolution and Tribological Behavior of FeCoNiCr High-Entropy Alloys Fabricated by Laser Metal Deposition

Author: Hu, Ke, Guo, Xiaoming, She, Yunfeng, Li, Lingling, She, Lixia, Huo, Xiaomin, Liu, Xiao, Huang, Junjie, Zhang, Ying, and Chen, Jinjian
Published: 2024
Full Text: View/download PDF

18. Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study

Author: Huang, W. Ronny, Allauzen, Cyril, Chen, Tongzhou, Gupta, Kilol, Hu, Ke, Qin, James, Zhang, Yu, Wang, Yongqiang, Chang, Shuo-Yiin, and Sainath, Tara N.
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: In the era of large models, the autoregressive nature of decoding often results in latency serving as a significant bottleneck. We propose a non-autoregressive LM-fused ASR system that effectively leverages the parallelization capabilities of accelerator hardware. Our approach combines the Universal Speech Model (USM) and the PaLM 2 language model in per-segment scoring mode, achieving an average relative WER improvement across all languages of 10.8% on FLEURS and 3.6% on YouTube captioning. Furthermore, our comprehensive ablation study analyzes key parameters such as LLM size, context length, vocabulary size, fusion methodology. For instance, we explore the impact of LLM size ranging from 128M to 340B parameters on ASR performance. This study provides valuable insights into the factors influencing the effectiveness of practical large-scale LM-fused speech recognition systems., Comment: ICASSP 2024
Published: 2024

19. Feature Norm Regularized Federated Learning: Transforming Skewed Distributions into Global Insights

Author: Hu, Ke, Qiu, WeiDong, and Tang, Peng
Subjects: Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: In the field of federated learning, addressing non-independent and identically distributed (non-i.i.d.) data remains a quintessential challenge for improving global model performance. This work introduces the Feature Norm Regularized Federated Learning (FNR-FL) algorithm, which uniquely incorporates class average feature norms to enhance model accuracy and convergence in non-i.i.d. scenarios. Our comprehensive analysis reveals that FNR-FL not only accelerates convergence but also significantly surpasses other contemporary federated learning algorithms in test accuracy, particularly under feature distribution skew scenarios. The novel modular design of FNR-FL facilitates seamless integration with existing federated learning frameworks, reinforcing its adaptability and potential for widespread application. We substantiate our claims through rigorous empirical evaluations, demonstrating FNR-FL's exceptional performance across various skewed data distributions. Relative to FedAvg, FNR-FL exhibits a substantial 66.24\% improvement in accuracy and a significant 11.40\% reduction in training time, underscoring its enhanced effectiveness and efficiency.
Published: 2023

20. Interobserver variability of clinical target volume delineation in patients undergoing breast-conserving surgery without surgical clips: a pilot study on preoperative magnetic resonance simulation

Author: Jiao, Shuning, Wang, Yiqing, Ma, Jiabin, Shen, Jing, Zhang, Xi-Qian, Zhou, Bing, Sun, Xiansong, Xu, Haoran, Liu, Xia, Hu, Ke, Zhang, Fuquan, Hou, Xiaorong, and Qiu, Jie
Published: 2024
Full Text: View/download PDF

21. Nr1d1 inhibition mitigates intermittent hypoxia-induced pulmonary hypertension via Dusp1-mediated Erk1/2 deactivation and mitochondrial fission attenuation

Author: Pan, Zhou, Yao, Yan, Liu, Xu, Wang, Yixuan, Zhang, Xinyue, Zha, Shiqian, and Hu, Ke
Published: 2024
Full Text: View/download PDF

22. Clinical outcomes analysis of image-guided brachytherapy as definitive treatment for inoperable endometrial cancer

Author: Gong, Xinyue, Sun, Shuai, Yan, Junfang, Wang, Wenhui, Ren, Kang, Hou, Xiaorong, Hu, Ke, and Zhang, Fuquan
Published: 2024
Full Text: View/download PDF

23. Hyper-realistic rendering-assisted laparoscopic adrenalectomy for giant adrenal tumors: a pilot study

Author: Zhang, Jiamo, Hu, Ke, Qing, Jing, Chen, Jiangchuan, Li, Changlong, and Zhou, Yongxia
Published: 2024
Full Text: View/download PDF

24. Long sleep duration is associated with abdominal aortic calcification among male adults with chronic kidney disease: NHANES 2013–2014

Author: Wang, Yuhan, Liu, Xu, Zhang, Jingyi, Zhou, Beini, Yue, Wuriliga, and Hu, Ke
Published: 2024
Full Text: View/download PDF

25. Biodegradable copper-iodide clusters modulate mitochondrial function and suppress tumor growth under ultralow-dose X-ray irradiation

Author: Ma, Xiaoqian, Lin, Nuo, Yang, Qing, Liu, Peifei, Ding, Haizhen, Xu, Mengjiao, Ren, Fangfang, Shen, Zhiyang, Hu, Ke, Meng, Shanshan, and Chen, Hongmin
Published: 2024
Full Text: View/download PDF

26. A dosimetric comparison of brachytherapy sources for endometrial cancer: an electronic brachytherapy and an iridium-192 source with multichannel cylinders and a three-dimensional technique

Author: Wang, Wenhui, Wang, Bei, Yu, Lang, Zhen, Hongnan, Zhang, Yue, Feng, Siqi, Chen, Zhou, Zhang, Yuan, Qiu, Jie, Zhang, Fuquan, and Hu, Ke
Published: 2024
Full Text: View/download PDF

27. Comparison of AirSeal versus conventional insufflation system for robot-assisted partial nephrectomy: a meta-analysis and systematic review

Author: Fan, Gen, Chen, Yushui, Wang, Junji, Wu, Yinyu, Wang, Yu, Hu, Ke, and Tang, Tielong
Published: 2024
Full Text: View/download PDF

28. Comparison of outcomes between early-stage cervical cancer patients without high-risk factors undergoing adjuvant concurrent chemoradiotherapy and radiotherapy alone after radical surgery

Author: Zhou, Yuncan, Wang, Weiping, Tang, Jia, Hu, Ke, and Zhang, Fuquan
Published: 2024
Full Text: View/download PDF

29. Nanocomposite magnetic hydrogel with dual anisotropic properties induces osteogenesis through the NOTCH-dependent pathways

Author: Tang, Shijia, Yan, Yue, Lu, Xiaoli, Wang, Peng, Xu, Xueqin, Hu, Ke, Yan, Sen, Guo, Zhaobin, Han, Xiao, Zhang, Feimin, and Gu, Ning
Published: 2024
Full Text: View/download PDF

30. Clinical characteristics and radiation therapy modality of younger patients with early-stage endometrial cancer, a multicenter study in China’s real world

Author: Zhang, Kun, Wang, Tiejun, Liu, Zi, He, Jianli, Sun, Xiaoge, Zhong, Wei, Zhao, Fengjv, Li, Xiaomei, Li, Sha, Zhu, Hong, Ma, Zhanshu, Hu, Ke, Zhang, Fuquan, Hou, Xiaorong, Wei, Lichun, and Zou, Lijuan
Published: 2024
Full Text: View/download PDF

31. Comparison of the clinical characteristics in parents and their children in a series of family clustered Mycoplasma pneumoniae infections

Author: Liu, Xu, Zhang, Qingfeng, Chen, Hao, Hao, Yueying, Zhang, Jingyi, Zha, Shiqian, Zhou, Beini, Yi, Yaohua, Xiao, Rui, and Hu, Ke
Published: 2024
Full Text: View/download PDF

32. Clinical courses and outcomes of COVID-19 associated pulmonary aspergillosis in 168 patients with the SARS-CoV-2 omicron variant

Author: Wang, Yixuan, Yao, Yan, Zhang, Qingfeng, Chen, Hao, He, Yang, and Hu, Ke
Published: 2024
Full Text: View/download PDF

33. Bisphenol A triggers apoptosis in mouse pre-antral follicle granulosa cells via oxidative stress

Author: Wang, Chen, He, Chaofan, Xu, Shumin, Gao, Yuanyuan, Wang, Kaixian, Liang, Meng, and Hu, Ke
Published: 2024
Full Text: View/download PDF

34. Exposure to residential green and blue space and the natural environment is associated with a lower incidence of psychiatric disorders in middle-aged and older adults: findings from the UK Biobank

Author: Liu, Bao-Peng, Huxley, Rachel R., Schikowski, Tamara, Hu, Ke-Jia, Zhao, Qi, and Jia, Cun-Xian
Published: 2024
Full Text: View/download PDF

35. Prospects for daily online adaptive radiotherapy for cervical cancer: Auto-contouring evaluation and dosimetric outcomes

Author: Zhang, Yu, Wang, Guangyu, Chang, Yankui, Wang, Zhiqun, Sun, Xiansong, Sun, Yuliang, Zeng, Zheng, Chen, Yining, Hu, Ke, Qiu, Jie, Yan, Junfang, and Zhang, Fuquan
Published: 2024
Full Text: View/download PDF

36. Evaluation of PTV margins with daily iterative online adaptive radiotherapy for postoperative treatment of endometrial and cervical cancer: a prospective single-arm phase 2 study

Author: Wang, Guangyu, Wang, Zhiqun, Guo, Yuping, Zhang, Yu, Qiu, Jie, Hu, Ke, Li, Jing, Yan, JunFang, and Zhang, Fuquan
Published: 2024
Full Text: View/download PDF

37. Dapagliflozin attenuates LPS-induced myocardial injury by reducing ferroptosis

Author: Hu, Ke, Jiang, Pin, Hu, Jiaxin, Song, Bing, Hou, Ya, Zhao, Jinxuan, Chen, Haiting, and Xie, Jun
Published: 2024
Full Text: View/download PDF

38. Small-molecule caspase-1 inhibitor CZL80 terminates refractory status epilepticus via inhibition of glutamatergic transmission

Author: Wang, Fei, Wang, Yu, Zhang, Qing-yang, Hu, Ke-yu, Song, Ying-jie, Yang, Lin, Fei, Fan, Xu, Ceng-lin, Cui, Sun-liang, Ruan, Ye-ping, Wang, Yi, and Chen, Zhong
Published: 2024
Full Text: View/download PDF

39. Influence of Human Activity Intensity on Habitat Quality in Hainan Tropical Rainforest National Park, China

Author: Han, Nianlong, Yu, Miao, Jia, Peihong, Zhang, Yucheng, and Hu, Ke
Published: 2024
Full Text: View/download PDF

40. Improving Joint Speech-Text Representations Without Alignment

Author: Peyser, Cal, Meng, Zhong, Hu, Ke, Prabhavalkar, Rohit, Rosenberg, Andrew, Sainath, Tara N., Picheny, Michael, and Cho, Kyunghyun
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: The last year has seen astonishing progress in text-prompted image generation premised on the idea of a cross-modal representation space in which the text and image domains are represented jointly. In ASR, this idea has found application as joint speech-text encoders that can scale to the capacities of very large parameter models by being trained on both unpaired speech and text. While these methods show promise, they have required special treatment of the sequence-length mismatch inherent in speech and text, either by up-sampling heuristics or an explicit alignment model. In this work, we offer evidence that joint speech-text encoders naturally achieve consistent representations across modalities by disregarding sequence length, and argue that consistency losses could forgive length differences and simply assume the best alignment. We show that such a loss improves downstream WER in both a large-parameter monolingual and multilingual system.
Published: 2023

41. Nonexistence of the compressible Euler equations with space-dependent damping in high dimensions

Author: Geng Jinbo, Hu Ke, Lai Ning-An, and Yuen Manwai
Subjects: compressible euler equations, damping, blow-up, lifespan, test function method, 35q31, 35l65, 35l67, 76n15, Analysis, QA299.6-433
Abstract: Compressible Euler equations with space-dependent damping in high dimensions Rn(n=2,3){{\bf{R}}}^{n}\hspace{0.33em}\hspace{0.33em}\left(n=2,3) are considered in this article. Assuming that the small initial velocity and small perturbation of the initial density have compact support, we establish finite-time blow-up results for the Euler system, by combining energy estimate and new test functions constructed by the solutions of the following linear elliptic partial differential equations system: −G1(x)+∇⋅G2→(x)=0,−G2→(x)+∇G1(x)=μG2→(x)(1+∣x∣)λ.\left\{\begin{array}{l}-{G}_{1}\left(x)+\nabla \cdot \overrightarrow{{G}_{2}}\left(x)=0,\\ -\overrightarrow{{G}_{2}}\left(x)+\nabla {G}_{1}\left(x)=\frac{\mu \overrightarrow{{G}_{2}}\left(x)}{{(1+| x| )}^{\lambda }}.\end{array}\right. This result generalizes the one in the literature from 1−D1-D to high dimension Rn(n=2,3){{\bf{R}}}^{n}\hspace{0.33em}\hspace{0.33em}\left(n=2,3).
Published: 2024
Full Text: View/download PDF

42. Research on two-stage configuration optimization of energy storage power station considering idle energy storage in new energy stations

Author: XUN Hanlong, LIU Yang, JIN Xuran, CHEN Jiayi, and HU Ke
Subjects: new energy station, idle energy storage, third-party energy storage, two-stage optimization, configuration optimization, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: In order to improve the stability of integrating renewable energy sources into the power grid， it is essential to deploy energy storage devices. However， excessive deployment can lead to higher construction costs and lower utilization efficiency. To tackle this challenge， this paper proposes an optimal deployment approach that leverages idle energy storage in new energy stations to reduce the construction costs of third-party energy storage power stations. This approach comprises two stages： In the first stage， an optimal scheduling model for the energy storage devices in new energy stations is developed， with the goal of maximizing the consumption of wind and solar power. In the second stage， after assessing the remaining storage capacity of new energy stations， the deployment of the energy storage power station capacity is optimized to maximize the operational revenue of third-party energy storage power stations. Case analysis demonstrates that third-party energy storage power stations following this approach exhibit reduced energy storage configuration capacity and significantly lower construction and maintenance costs compared to independently constructed energy storage power stations.
Published: 2024
Full Text: View/download PDF

43. Correction to: Diagnosis and Management of Hepatitis Delta Virus Infection

Author: Pan, Calvin, Gish, Robert, Jacobson, Ira M, Hu, Ke-Qin, Wedemeyer, Heiner, and Martin, Paul
Subjects: Biomedical and Clinical Sciences, Clinical Sciences, Gastroenterology & Hepatology, Clinical sciences
Abstract: The article “Diagnosis and Management of Hepatitis Delta Virus Infection”, written by Calvin Pan, Robert Gish, Ira M. Jacobson, Ke‑Qin Hu, Heiner Wedemeyer, Paul Martin, was originally published electronically on the publisher’s internet portal on 20 June 2023 without open access. With the author(s)’ decision to opt for Open Choice the copyright of the article changed on 1 July 2023 to © The Author(s) 2023 and the article is forthwith distributed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit. The original article has been corrected.
Published: 2023

44. The dynamics of deltamethrin resistance evolution in Aedes albopictus has an impact on fitness and dengue virus type-2 vectorial capacity

Author: Guo, Yijia, Hu, Ke, Zhou, Jingni, Xie, Zhensheng, Zhao, Yijie, Zhao, Siyu, Gu, Jinbao, Zhou, Xiaohong, Yan, Guiyun, James, Anthony A, and Chen, Xiao-Guang
Subjects: Biological Sciences, Emerging Infectious Diseases, Genetics, Rare Diseases, Vaccine Related, Vector-Borne Diseases, Infectious Diseases, Biodefense, Prevention, Prevention of disease and conditions, and promotion of well-being, 3.2 Interventions to alter physical and biological environmental risks, Infection, Good Health and Well Being, Animals, Aedes, Dengue Virus, Insecticides, Mosquito Vectors, Zika Virus, Zika Virus Infection, Metabolic resistance, Pyrethroid, Selection, Target-site resistance, Vector competence, Developmental Biology
Abstract: BackgroundWorldwide invasion and expansion of Aedes albopictus, an important vector of dengue, chikungunya, and Zika viruses, has become a serious concern in global public health. Chemical insecticides are the primary means currently available to control the mosquito populations. However, long-term and large-scale use of insecticides has selected for resistance in the mosquito that is accompanied by a genetic load that impacts fitness.ResultsA number of laboratory strains representing different resistance mechanisms were isolated and identified from laboratory-derived, deltamethrin-resistant Ae. albopictus recovered in previous work. Resistance levels and fitness costs of the strains were evaluated and compared to characterize the evolution of the resistance genotypes and phenotypes. The heterozygous F1534S mutation (1534F/S) in the voltage gated sodium channel (vgsc) gene product (VGSC), first detected in early stages of resistance evolution, not only confers high-level resistance, but also produces no significant fitness costs, leading to the rapid spread of resistance in the population. This is followed by the increase in frequency of homozygous F1534S (1534S/S) mosquitoes that have significant fitness disadvantages, prompting the emergence of an unlinked I1532T mutation with fewer side effects and a mating advantage better adapted to the selection and reproductive pressures imposed in the experiments. Metabolic resistance with no significant fitness cost and mediating a high-tolerance resistance phenotype may play a dominant role in the subsequent evolution of resistance. The different resistant strains had similar vector competence for dengue virus type-2 (DENV-2). Furthermore, a comparative analysis of vectorial capacity revealed that increased survival due to deltamethrin resistance balanced the negative fitness cost effects and contributed to the risk of dengue virus (DENV) transmission by resistant populations. The progressive evolution of resistance results in mosquitoes with both target-site insensitivity and metabolic resistance with lower fitness costs, which further leads to resistant populations with both high resistance levels and vectorial capacity.ConclusionsThis study reveals a possible mechanism for the evolution of deltamethrin resistance in Aedes albopictus. These findings will help guide practical strategies for insecticide use, resistance management and the prevention and control of mosquito-borne disease.
Published: 2023

45. Mixture-of-Expert Conformer for Streaming Multilingual ASR

Author: Hu, Ke, Li, Bo, Sainath, Tara N., Zhang, Yu, and Beaufays, Francoise
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: End-to-end models with large capacity have significantly improved multilingual automatic speech recognition, but their computation cost poses challenges for on-device applications. We propose a streaming truly multilingual Conformer incorporating mixture-of-expert (MoE) layers that learn to only activate a subset of parameters in training and inference. The MoE layer consists of a softmax gate which chooses the best two experts among many in forward propagation. The proposed MoE layer offers efficient inference by activating a fixed number of parameters as the number of experts increases. We evaluate the proposed model on a set of 12 languages, and achieve an average 11.9% relative improvement in WER over the baseline. Compared to an adapter model using ground truth information, our MoE model achieves similar WER and activates similar number of parameters but without any language information. We further show around 3% relative WER improvement by multilingual shallow fusion., Comment: Accepted to Interspeech 2023
Published: 2023

46. A Deliberation-based Joint Acoustic and Text Decoder

Author: Mavandadi, Sepand, Sainath, Tara N., Hu, Ke, and Wu, Zelin
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Sound
Abstract: We propose a new two-pass E2E speech recognition model that improves ASR performance by training on a combination of paired data and unpaired text data. Previously, the joint acoustic and text decoder (JATD) has shown promising results through the use of text data during model training and the recently introduced deliberation architecture has reduced recognition errors by leveraging first-pass decoding results. Our method, dubbed Deliberation-JATD, combines the spelling correcting abilities of deliberation with JATD's use of unpaired text data to further improve performance. The proposed model produces substantial gains across multiple test sets, especially those focused on rare words, where it reduces word error rate (WER) by between 12% and 22.5% relative. This is done without increasing model size or requiring multi-stage training, making Deliberation-JATD an efficient candidate for on-device applications., Comment: Interspeech 2021
Published: 2023

47. Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Author: Zhang, Yu, Han, Wei, Qin, James, Wang, Yongqiang, Bapna, Ankur, Chen, Zhehuai, Chen, Nanxin, Li, Bo, Axelrod, Vera, Wang, Gary, Meng, Zhong, Hu, Ke, Rosenberg, Andrew, Prabhavalkar, Rohit, Park, Daniel S., Haghani, Parisa, Riesa, Jason, Perng, Ginger, Soltau, Hagen, Strohman, Trevor, Ramabhadran, Bhuvana, Sainath, Tara, Moreno, Pedro, Chiu, Chung-Cheng, Schalkwyk, Johan, Beaufays, Françoise, and Wu, Yonghui
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: We introduce the Universal Speech Model (USM), a single large model that performs automatic speech recognition (ASR) across 100+ languages. This is achieved by pre-training the encoder of the model on a large unlabeled multilingual dataset of 12 million (M) hours spanning over 300 languages, and fine-tuning on a smaller labeled dataset. We use multilingual pre-training with random-projection quantization and speech-text modality matching to achieve state-of-the-art performance on downstream multilingual ASR and speech-to-text translation tasks. We also demonstrate that despite using a labeled training set 1/7-th the size of that used for the Whisper model, our model exhibits comparable or better performance on both in-domain and out-of-domain speech recognition tasks across many languages., Comment: 20 pages, 7 figures, 8 tables
Published: 2023

48. Sigma-1 receptor exerts protective effects on ameliorating nephrolithiasis by modulating endoplasmic reticulum-mitochondrion association and inhibiting endoplasmic reticulum stress-induced apoptosis in renal tubular epithelial cells

Author: Hu Ke, Xiaozhe Su, Caitao Dong, Ziqi He, Qianlin Song, Chao song, Jiawei Zhou, Wenbiao Liao, Chuan Wang, Sixing Yang, and Yunhe Xiong
Subjects: Sigmar-1 receptor, endoplasmic reticulum stress, kidney stone, apoptosis, dimemorfan, mitochondria-associated endoplasmic reticulum membrane, Pathology, RB1-214, Biology (General), QH301-705.5
Abstract: Oxalate-induced damage to renal tubular epithelial cells (RTECs) is an essential factor in the incident kidney stone, but the specific mechanism is unclear. Recent research has pinpointed interacting areas within the endoplasmic reticulum and mitochondria, called mitochondria-associated membranes (MAMs). These studies have linked endoplasmic reticulum stress (ERS) and oxidative imbalance to kidney disease development. The sigma-1 receptor (S1R), a specific protein found in MAMs, is involved in various physiological processes, but its role in oxalate-induced kidney stone formation remains unclear. In this study, we established cellular and rat models of oxalate-induced kidney stone formation to elucidate the S1R's effects against ERS and apoptosis and its mechanism in oxalate-induced RTEC injury. We found that oxalate downregulated S1R expression in RTECs and escalated oxidative stress and ERS, culminating in increased apoptosis. The S1R agonist dimemorfan up-regulated S1R expression and mitigated ERS and oxidative stress, thereby reducing apoptosis. This protective effect was mediated through S1R inhibition of the CHOP pathway. Animal experiments demonstrated that S1R's activation attenuated oxalate-induced kidney injury and alleviated kidney stone formation. This is the first study to establish the connection between S1R and kidney stones, suggesting S1R's protective role in inhibiting ERS-mediated apoptosis to ameliorate kidney stone formation.
Published: 2024
Full Text: View/download PDF

49. Selenium participates in the formation of kidney stones by alleviating endoplasmic reticulum stress and apoptosis of renal tubular epithelial cells

Author: Xiaozhe Su, Hongbo Chen, Heng Xiang, Hu Ke, Caitao Dong, Qianlin Song, Jiawei Zhou, Qinhong Jiang, Yunhan Wang, Liang Chen, and Sixing Yang
Subjects: Kidney stones, selenium, selenoprotein K, endoplasmic reticulum stress, apoptosis, Pathology, RB1-214, Biology (General), QH301-705.5
Abstract: Objectives: To investigate the role of selenium and selenium-containing proteins in the etiology and pathogenesis of kidney stones.Methods: The HK-2 cell line was subjected to supersaturation oxalate treatment to establish an in vitro model of calcium oxalate kidney stones, while SD rats were administered with ethylene glycol to establish an in vivo model of calcium oxalate kidney stones. qPCR analysis was employed to investigate the alterations in selenoproteins within the models, and subsequently, genes exhibiting significant changes were identified. Subsequently, based on the functions of these genes, their regulatory effects on endoplasmic reticulum stress (ERS) and apoptosis during the disease progression were examined both in HK-2 cells and rat kidneys. Finally, Selenomethionine (SeMet) supplementation was introduced to explore its therapeutic potential for kidney stone management.Results: The involvement of Selenoprotein K in the pathogenesis of calcium oxalate kidney stone disease has been confirmed, exhibiting significant alterations. Manipulation of its expression levels through overexpression and knockdown techniques resulted in a corresponding reduction or increase in oxidative stress, ERS, and apoptosis within renal tubular epithelial cells. SelK regulates ERS and apoptosis by controlling the IRE1-ASK1-JNK pathway. In addition, SeMet treatment, which contains selenium, effectively reduced the levels of oxidative stress, ERS, and apoptosis in vivo and in vitro models, thereby alleviating tubular epithelial cell damage and reducing the formation of kidney stones in experimental rats.Discussion: Selenium is involved in the occurrence and development of kidney stones by regulating oxidative damage to renal tubular epithelial cells. The results suggest that dietary selenium supplementation in daily life may be of great significance for the prevention and treatment of kidney stones.
Published: 2024
Full Text: View/download PDF

50. Massively Multilingual Shallow Fusion with Large Language Models

Author: Hu, Ke, Sainath, Tara N., Li, Bo, Du, Nan, Huang, Yanping, Dai, Andrew M., Zhang, Yu, Cabrera, Rodrigo, Chen, Zhifeng, and Strohman, Trevor
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: While large language models (LLM) have made impressive progress in natural language processing, it remains unclear how to utilize them in improving automatic speech recognition (ASR). In this work, we propose to train a single multilingual language model (LM) for shallow fusion in multiple languages. We push the limits of the multilingual LM to cover up to 84 languages by scaling up using a mixture-of-experts LLM, i.e., generalist language model (GLaM). When the number of experts increases, GLaM dynamically selects only two at each decoding step to keep the inference computation roughly constant. We then apply GLaM to a multilingual shallow fusion task based on a state-of-the-art end-to-end model. Compared to a dense LM of similar computation during inference, GLaM reduces the WER of an English long-tail test set by 4.4% relative. In a multilingual shallow fusion task, GLaM improves 41 out of 50 languages with an average relative WER reduction of 3.85%, and a maximum reduction of 10%. Compared to the baseline model, GLaM achieves an average WER reduction of 5.53% over 43 languages., Comment: Accepted to IEEE ICASSP 2023
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

5,404 results on '"Hu, Ke"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources