Author: "Yan, Zhiyong" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yan, Zhiyong"' showing total 497 results

Start Over Author "Yan, Zhiyong"

497 results on '"Yan, Zhiyong"'

1. Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding

Author: Liu, Jizhong, Li, Gang, Zhang, Junbo, Dinkel, Heinrich, Wang, Yongqing, Yan, Zhiyong, Wang, Yujun, and Wang, Bin
Subjects: Computer Science - Sound, Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Automated audio captioning (AAC) is an audio-to-text task to describe audio contents in natural language. Recently, the advancements in large language models (LLMs), with improvements in training approaches for audio encoders, have opened up possibilities for improving AAC. Thus, we explore enhancing AAC from three aspects: 1) a pre-trained audio encoder via consistent ensemble distillation (CED) is used to improve the effectivity of acoustic tokens, with a querying transformer (Q-Former) bridging the modality gap to LLM and compress acoustic tokens; 2) we investigate the advantages of using a Llama 2 with 7B parameters as the decoder; 3) another pre-trained LLM corrects text errors caused by insufficient training data and annotation ambiguities. Both the audio encoder and text decoder are optimized by low-rank adaptation (LoRA). Experiments show that each of these enhancements is effective. Our method obtains a 33.0 SPIDEr-FL score, outperforming the winner of DCASE 2023 Task 6A., Comment: Accepted by Interspeech 2024
Published: 2024

2. Bridging Language Gaps in Audio-Text Retrieval

Author: Yan, Zhiyong, Dinkel, Heinrich, Wang, Yongqing, Liu, Jizhong, Zhang, Junbo, Wang, Yujun, and Wang, Bin
Subjects: Computer Science - Sound, Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Audio-text retrieval is a challenging task, requiring the search for an audio clip or a text caption within a database. The predominant focus of existing research on English descriptions poses a limitation on the applicability of such models, given the abundance of non-English content in real-world data. To address these linguistic disparities, we propose a language enhancement (LE), using a multilingual text encoder (SONAR) to encode the text data with language-specific information. Additionally, we optimize the audio encoder through the application of consistent ensemble distillation (CED), enhancing support for variable-length audio-text retrieval. Our methodology excels in English audio-text retrieval, demonstrating state-of-the-art (SOTA) performance on commonly used datasets such as AudioCaps and Clotho. Simultaneously, the approach exhibits proficiency in retrieving content in seven other languages with only 10% of additional language-enhanced training data, yielding promising results. The source code is publicly available https://github.com/zyyan4/ml-clap., Comment: interspeech2024
Published: 2024

3. Scaling up masked audio encoder learning for general audio classification

Author: Dinkel, Heinrich, Yan, Zhiyong, Wang, Yongqing, Zhang, Junbo, Wang, Yujun, and Wang, Bin
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Despite progress in audio classification, a generalization gap remains between speech and other sound domains, such as environmental sounds and music. Models trained for speech tasks often fail to perform well on environmental or musical audio tasks, and vice versa. While self-supervised (SSL) audio representations offer an alternative, there has been limited exploration of scaling both model and dataset sizes for SSL-based general audio classification. We introduce Dasheng, a simple SSL audio encoder, based on the efficient masked autoencoder framework. Trained with 1.2 billion parameters on 272,356 hours of diverse audio, Dasheng obtains significant performance gains on the HEAR benchmark. It outperforms previous works on CREMA-D, LibriCount, Speech Commands, VoxLingua, and competes well in music and environment classification. Dasheng features inherently contain rich speech, music, and environmental information, as shown in nearest-neighbor classification experiments. Code is available https://github.com/richermans/dasheng/., Comment: Interspeech 2024
Published: 2024

4. Stearoyl-CoA desaturase 1 is targeted by EBV-encoded miR-BART20-5p and regulates cell autophagy, proliferation, and migration in EBV-associated gastric cancer

Author: Gong, Zhiyuan, Shi, Duo, Yan, Zhiyong, Sun, Lingling, Liu, Wen, and Luo, Bing
Published: 2024
Full Text: View/download PDF

5. CED: Consistent ensemble distillation for audio tagging

Author: Dinkel, Heinrich, Wang, Yongqing, Yan, Zhiyong, Zhang, Junbo, and Wang, Yujun
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Augmentation and knowledge distillation (KD) are well-established techniques employed in audio classification tasks, aimed at enhancing performance and reducing model sizes on the widely recognized Audioset (AS) benchmark. Although both techniques are effective individually, their combined use, called consistent teaching, hasn't been explored before. This paper proposes CED, a simple training framework that distils student models from large teacher ensembles with consistent teaching. To achieve this, CED efficiently stores logits as well as the augmentation methods on disk, making it scalable to large-scale datasets. Central to CED's efficacy is its label-free nature, meaning that only the stored logits are used for the optimization of a student model only requiring 0.3\% additional disk space for AS. The study trains various transformer-based models, including a 10M parameter model achieving a 49.0 mean average precision (mAP) on AS. Pretrained models and code are available at https://github.com/RicherMans/CED.
Published: 2023

6. Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

Author: Lin, Jiuxin, Wang, Peng, Dinkel, Heinrich, Chen, Jun, Wu, Zhiyong, Yan, Zhiyong, Wang, Yongqing, Zhang, Junbo, and Wang, Yujun
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Previously, Target Speaker Extraction (TSE) has yielded outstanding performance in certain application scenarios for speech enhancement and source separation. However, obtaining auxiliary speaker-related information is still challenging in noisy environments with significant reverberation. inspired by the recently proposed distance-based sound separation, we propose the near sound (NS) extractor, which leverages distance information for TSE to reliably extract speaker information without requiring previous speaker enrolment, called speaker embedding self-enrollment (SESE). Full- & sub-band modeling is introduced to enhance our NS-Extractor's adaptability towards environments with significant reverberation. Experimental results on several cross-datasets demonstrate the effectiveness of our improvements and the excellent performance of our proposed NS-Extractor in different application scenarios., Comment: Proc. INTERSPEECH 2023, 2488-2492, doi: 10.21437/Interspeech.2023-218
Published: 2023

7. AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction

Author: Lin, Jiuxin, Cai, Xinyu, Dinkel, Heinrich, Chen, Jun, Yan, Zhiyong, Wang, Yongqing, Zhang, Junbo, Wu, Zhiyong, Wang, Yujun, and Meng, Helen
Subjects: Computer Science - Multimedia, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Visual information can serve as an effective cue for target speaker extraction (TSE) and is vital to improving extraction performance. In this paper, we propose AV-SepFormer, a SepFormer-based attention dual-scale model that utilizes cross- and self-attention to fuse and model features from audio and visual. AV-SepFormer splits the audio feature into a number of chunks, equivalent to the length of the visual feature. Then self- and cross-attention are employed to model and fuse the multi-modal features. Furthermore, we use a novel 2D positional encoding, that introduces the positional information between and within chunks and provides significant gains over the traditional positional encoding. Our model has two key advantages: the time granularity of audio chunked feature is synchronized to the visual feature, which alleviates the harm caused by the inconsistency of audio and video sampling rate; by combining self- and cross-attention, feature fusion and speech extraction processes are unified within an attention paradigm. The experimental results show that AV-SepFormer significantly outperforms other existing methods., Comment: Accepted by ICASSP2023
Published: 2023

8. MgCo2O4 microflower with excellent thermocatalytic properties for ammonium perchlorate decomposition

Author: Zhao, Zhengyi, Yu, Xin, Zhang, Guofei, Qin, Songnan, Li, Sirong, Yan, Zhiyong, and Xiao, Xuechun
Published: 2024
Full Text: View/download PDF

9. Understanding temporally weakly supervised training: A case study for keyword spotting

Author: Dinkel, Heinrich, Zhuang, Weiji, Yan, Zhiyong, Wang, Yongqing, Zhang, Junbo, and Wang, Yujun
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: The currently most prominent algorithm to train keyword spotting (KWS) models with deep neural networks (DNNs) requires strong supervision i.e., precise knowledge of the spoken keyword location in time. Thus, most KWS approaches treat the presence of redundant data, such as noise, within their training set as an obstacle. A common training paradigm to deal with data redundancies is to use temporally weakly supervised learning, which only requires providing labels on a coarse scale. This study explores the limits of DNN training using temporally weak labeling with applications in KWS. We train a simple end-to-end classifier on the common Google Speech Commands dataset with increased difficulty by randomly appending and adding noise to the training dataset. Our results indicate that temporally weak labeling can achieve comparable results to strongly supervised baselines while having a less stringent labeling requirement. In the presence of noise, weakly supervised models are capable to localize and extract target keywords without explicit supervision, leading to a performance increase compared to strongly supervised approaches.
Published: 2023

10. Streaming Audio Transformers for Online Audio Tagging

Author: Dinkel, Heinrich, Yan, Zhiyong, Wang, Yongqing, Zhang, Junbo, Wang, Yujun, and Wang, Bin
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Transformers have emerged as a prominent model framework for audio tagging (AT), boasting state-of-the-art (SOTA) performance on the widely-used Audioset dataset. However, their impressive performance often comes at the cost of high memory usage, slow inference speed, and considerable model delay, rendering them impractical for real-world AT applications. In this study, we introduce streaming audio transformers (SAT) that combine the vision transformer (ViT) architecture with Transformer-Xl-like chunk processing, enabling efficient processing of long-range audio signals. Our proposed SAT is benchmarked against other transformer-based SOTA methods, achieving significant improvements in terms of mean average precision (mAP) at a delay of 2s and 1s, while also exhibiting significantly lower memory usage and computational overhead. Checkpoints are publicly available https://github.com/RicherMans/SAT., Comment: Interspeech2024
Published: 2023

11. Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers

Author: Dinkel, Heinrich, Wang, Yongqing, Yan, Zhiyong, Zhang, Junbo, and Wang, Yujun
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Keyword spotting (KWS) is a core human-machine-interaction front-end task for most modern intelligent assistants. Recently, a unified (UniKW-AT) framework has been proposed that adds additional capabilities in the form of audio tagging (AT) to a KWS model. However, previous work did not consider the real-world deployment of a UniKW-AT model, where factors such as model size and inference speed are more important than performance alone. This work introduces three mobile-device deployable models named Unified Transformers (UiT). Our best model achieves an mAP of 34.09 on Audioset, and an accuracy of 97.76 on the public Google Speech Commands V1 dataset. Further, we benchmark our proposed approaches on four mobile platforms, revealing that the proposed UiT models can achieve a speedup of 2 - 6 times against a competitive MobileNetV2., Comment: ICASSP 2023
Published: 2023

12. Application of ultrasound-guided anterior quadratus lumborum block at the lateral supra-arcuate ligament in bariatric surgery

Author: ZHONG Mingjie*, ZHU Wei, CONG Wenbo, YAN Zhiyong, GENG Ying
Subjects: ultrasound-guided， arcuate ligament， quadratus lumborum block， transversus abdominis plane block， laparoscopic sleeve gastrectomy， ropivacaine, Medicine
Abstract: Objective To compare the analgesic effect of ultrasound-guided anterior quadratus lumborum block at the lateral supra-arcuate ligament (QLB-LSAL) and transversus abdominis plane block (TAPB) in laparoscopic sleeve gastrectomy (LSG). Methods From January 2023 to January 2024, 90 patients underwent LSG in Suqian First People's Hospital were randomly divided into two groups: QLB-LSAL group and TAPB group, 45 cases in each group. Bilateral nerve block was performed before induction of general anesthesia, and 0.375% ropivacaine 20 mL was injected into each side of both groups. Both groups of patients received the same general anesthesia and postoperative patient-controlled intravenous analgesia (PCIA) regimen. The number of block dermatomes after block, mean arterial pressure (MAP), heart rate (HR), visual analogue scale (VAS) score were measured in different time. The intraoperative consumption of sufentanil and remifentanil, the interval time from the end of operation to the first pressing of the analgesia pump, the consumption of analgesics within 48 h after operation, the requirement for rescue analgesia, and the incidence of adverse reactions were recorded. Results The MAP and HR at 1 min and 5 min after skin incision, the intraoperative consumption of remifentanil, the VAS score at 2,6,12,24 h after operation, the consumption of analgesics within 48 h after operation, and the incidence of nausea and vomiting in QLB-LSAL group were significantly lower than those in TAPB group (P＜0.05). The number of block dermatomes at 5 min, 10 min, 6 h, 24 h after block, and the interval time from the end of operation to the first pressing of the analgesia pump in QLB-LSAL group were significantly higher than those in TAPB group (P＜0.05). There was no significant difference in the intraoperative consumption of sufentanil, the requirement for rescue analgesia, and the incidence of respiratory depression between the two groups (P＞0.05). Conclusion Ultrasound-guided QLB-LSAL combined with general anesthesia can stabilize hemodynamics, reduce the consumption of intraoperative opioids, and provide effective postoperative analgesia in patients received LSG.
Published: 2024
Full Text: View/download PDF

13. An empirical study of weakly supervised audio tagging embeddings for general audio representations

Author: Dinkel, Heinrich, Yan, Zhiyong, Wang, Yongqing, Zhang, Junbo, and Wang, Yujun
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: We study the usability of pre-trained weakly supervised audio tagging (AT) models as feature extractors for general audio representations. We mainly analyze the feasibility of transferring those embeddings to other tasks within the speech and sound domains. Specifically, we benchmark weakly supervised pre-trained models (MobileNetV2 and EfficientNet-B0) against modern self-supervised learning methods (BYOL-A) as feature extractors. Fourteen downstream tasks are used for evaluation ranging from music instrument classification to language classification. Our results indicate that AT pre-trained models are an excellent transfer learning choice for music, event, and emotion recognition tasks. Further, finetuning AT models can also benefit speech-related tasks such as keyword spotting and intent classification., Comment: Odyssey 2022
Published: 2022
Full Text: View/download PDF

14. UniKW-AT: Unified Keyword Spotting and Audio Tagging

Author: Dinkel, Heinrich, Wang, Yongqing, Yan, Zhiyong, Zhang, Junbo, and Wang, Yujun
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Within the audio research community and the industry, keyword spotting (KWS) and audio tagging (AT) are seen as two distinct tasks and research fields. However, from a technical point of view, both of these tasks are identical: they predict a label (keyword in KWS, sound event in AT) for some fixed-sized input audio segment. This work proposes UniKW-AT: An initial approach for jointly training both KWS and AT. UniKW-AT enhances the noise-robustness for KWS, while also being able to predict specific sound events and enabling conditional wake-ups on sound events. Our approach extends the AT pipeline with additional labels describing the presence of a keyword. Experiments are conducted on the Google Speech Commands V1 (GSCV1) and the balanced Audioset (AS) datasets. The proposed MobileNetV2 model achieves an accuracy of 97.53% on the GSCV1 dataset and an mAP of 33.4 on the AS evaluation set. Further, we show that significant noise-robustness gains can be observed on a real-world KWS dataset, greatly outperforming standard KWS approaches. Our study shows that KWS and AT can be merged into a single framework without significant performance degradation., Comment: Accepted in Interspeech2022
Published: 2022
Full Text: View/download PDF

15. Pseudo strong labels for large scale weakly supervised audio tagging

Author: Dinkel, Heinrich, Yan, Zhiyong, Wang, Yongqing, Zhang, Junbo, and Wang, Yujun
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Large-scale audio tagging datasets inevitably contain imperfect labels, such as clip-wise annotated (temporally weak) tags with no exact on- and offsets, due to a high manual labeling cost. This work proposes pseudo strong labels (PSL), a simple label augmentation framework that enhances the supervision quality for large-scale weakly supervised audio tagging. A machine annotator is first trained on a large weakly supervised dataset, which then provides finer supervision for a student model. Using PSL we achieve an mAP of 35.95 balanced train subset of Audioset using a MobileNetV2 back-end, significantly outperforming approaches without PSL. An analysis is provided which reveals that PSL mitigates missing labels. Lastly, we show that models trained with PSL are also superior at generalizing to the Freesound datasets (FSD) than their weakly trained counterparts., Comment: Accepted by ICASSP 2022
Published: 2022

16. Sustainable rewritable paper based on photoresponsive tungsten oxide quantum dots for anti-counterfeiting and waterproofing

Author: Chen, Tiandi, Mai, Xiaoxue, Li, Yiyun, Wang, Tingting, Gong, Rui, Chen, Feixiong, Huang, Hongjie, Yan, Zhiyong, and Wang, Feng
Published: 2024
Full Text: View/download PDF

17. In-situ construction of N-doped Zn0.6Cd0.4S/oxygen vacancy-rich WO3 Z-scheme heterojunction compound for boosting photocatalytic hydrogen production

Author: Dong, Yuxin, Ma, Yueting, Shu, Aoqiang, Yan, Zhiyong, Wang, Hou, and Wu, Yan
Published: 2025
Full Text: View/download PDF

18. Ternary metal chalcogenides hollow tube derived from MOF sulfurization for photocatalytic hydrogen generation from water and tetracycline wastewater

Author: Wu, Yunchao, Qi, Yige, Zhou, Guoxi, Wang, Hou, Yan, Zhiyong, and Wu, Yan
Published: 2024
Full Text: View/download PDF

19. Unveiling the intratumoral microbiota within cancer landscapes

Author: Che, Shusheng, Yan, Zhiyong, Feng, Yugong, and Zhao, Hai
Published: 2024
Full Text: View/download PDF

20. In-situ construction of In2O3/In2S3-CdIn2S4 Z-scheme heterojunction nanotubes for enhanced photocatalytic hydrogen production

Author: Qi, Yige, Zhou, Guoxi, Wu, Yunchao, Wang, Hou, Yan, Zhiyong, and Wu, Yan
Published: 2024
Full Text: View/download PDF

21. GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

Author: Chen, Guoguo, Chai, Shuzhou, Wang, Guanbo, Du, Jiayu, Zhang, Wei-Qiang, Weng, Chao, Su, Dan, Povey, Daniel, Trmal, Jan, Zhang, Junbo, Jin, Mingjie, Khudanpur, Sanjeev, Watanabe, Shinji, Zhao, Shuaijiang, Zou, Wei, Li, Xiangang, Yao, Xuchen, Wang, Yongqing, Wang, Yujun, You, Zhao, and Yan, Zhiyong
Subjects: Computer Science - Sound, Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: This paper introduces GigaSpeech, an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training, and 40,000 hours of total audio suitable for semi-supervised and unsupervised training. Around 40,000 hours of transcribed audio is first collected from audiobooks, podcasts and YouTube, covering both read and spontaneous speaking styles, and a variety of topics, such as arts, science, sports, etc. A new forced alignment and segmentation pipeline is proposed to create sentence segments suitable for speech recognition training, and to filter out segments with low-quality transcription. For system training, GigaSpeech provides five subsets of different sizes, 10h, 250h, 1000h, 2500h, and 10000h. For our 10,000-hour XL training subset, we cap the word error rate at 4% during the filtering/validation stage, and for all our other smaller training subsets, we cap it at 0%. The DEV and TEST evaluation sets, on the other hand, are re-processed by professional human transcribers to ensure high transcription quality. Baseline systems are provided for popular speech recognition toolkits, namely Athena, ESPnet, Kaldi and Pika.
Published: 2021

22. Mesoporous FeCo2O4 nanosheet-supported Pt for oxygen reduction and oxygen evolution reaction bi-functional catalytic performance

Author: Li, Sirong, Zhao, Mengyao, Wang, Zhenlong, Zhang, Zhanyu, Yan, Zhiyong, and Xiao, Xuechun
Published: 2023
Full Text: View/download PDF

23. A novel self-enhanced ECL-RET aptasensor based on the bimetallic MOFs with homogeneous catalytic sites for kanamycin detection

Author: Wang, Xuemei, Zhang, Feifei, Xia, Jianfei, Yan, Zhiyong, and Wang, Zonghua
Published: 2024
Full Text: View/download PDF

24. AC loss study of high-temperature superconducting stacked conductors based on parameter identification method

Author: Lei, Zhiwen, Wei, Junwen, Yan, Zhiyong, Li, Zi, Kang, Xuyang, Wang, Suxin, Xu, Ying, Long, Feiyang, and Tan, Yunfei
Published: 2024
Full Text: View/download PDF

25. The mechanism of mineral dissolution and its impact on pore evolution of CO2 flooding in tight sandstone: A case study from the Chang 7 member of the Triassic Yanchang formation in the Ordos Basin, China

Author: Wang, Wei, Yan, Zhiyong, Chen, Dayou, He, Yi, Liang, Zhengzhong, Li, Yahui, and Han, Yuyan
Published: 2024
Full Text: View/download PDF

26. speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment

Author: Zhang, Junbo, Zhang, Zhiwen, Wang, Yongqing, Yan, Zhiyong, Song, Qiong, Huang, Yukai, Li, Ke, Povey, Daniel, and Wang, Yujun
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: This paper introduces a new open-source speech corpus named "speechocean762" designed for pronunciation assessment use, consisting of 5000 English utterances from 250 non-native speakers, where half of the speakers are children. Five experts annotated each of the utterances at sentence-level, word-level and phoneme-level. A baseline system is released in open source to illustrate the phoneme-level pronunciation assessment workflow on this corpus. This corpus is allowed to be used freely for commercial and non-commercial purposes. It is available for free download from OpenSLR, and the corresponding baseline system is published in the Kaldi speech recognition toolkit., Comment: Accepted in INTERSPEECH 2021
Published: 2021

27. Mechanisms of total phosphorus removal and reduction of β-lactam antibiotic resistance genes by exogenous fungal combination activated sludge

Author: Cai, Yixiang, Liu, Feng, He, Guiyi, Kong, Xiaoliang, Jiang, Yuexi, Liu, Ji, Yan, Binghua, Zhang, Shunan, Zhang, Jiachao, and Yan, Zhiyong
Published: 2024
Full Text: View/download PDF

28. Data Augmentation For Children's Speech Recognition -- The 'Ethiopian' System For The SLT 2021 Children Speech Recognition Challenge

Author: Chen, Guoguo, Na, Xingyu, Wang, Yongqing, Yan, Zhiyong, Zhang, Junbo, Ma, Sifan, and Wang, Yujun
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: This paper presents the "Ethiopian" system for the SLT 2021 Children Speech Recognition Challenge. Various data processing and augmentation techniques are proposed to tackle children's speech recognition problem, especially the lack of the children's speech recognition training data issue. Detailed experiments are designed and conducted to show the effectiveness of each technique, across different speech recognition toolkits and model architectures. Step by step, we explain how we come up with our final system, which provides the state-of-the-art results in the SLT 2021 Children Speech Recognition Challenge, with 21.66% CER on the Track 1 evaluation set (4th place overall), and 16.53% CER on the Track 2 evaluation set (1st place overall). Post-challenge analysis shows that our system actually achieves 18.82% CER on the Track 1 evaluation set, but we submitted the wrong version to the challenge organizer for Track 1., Comment: System description of the SLT 2021 Children Speech Recognition Challenge
Published: 2020

29. Double-modified C–MgCo2O4 as a highly active catalytic material for the thermal decomposition of AP and its application for CSPs

Author: Zhang, Guofei, Yu, Xin, Wang, Zhenlong, Li, Sirong, Zhao, Zhengyi, Zhu, Yunjiong, Yan, Zhiyong, Chen, Gang, and Xiao, Xuechun
Published: 2023
Full Text: View/download PDF

30. Enhanced catalytic behavior of La2O2CO3/Co3O4 composite for thermal decomposition of ammonium perchlorate

Author: Zhu, Yunjiong, Yu, Xin, Zhang, Guofei, Yan, Zhiyong, and Xiao, Xuechun
Published: 2023
Full Text: View/download PDF

31. One-step electrodeposition synthesis of amorphous NiCoFe(OH)x/NF as an efficient catalyst for urea-assisted overall water splitting

Author: Li, Sirong, Zhang, Yipeng, Yu, Xin, Wang, Zhenlong, Zhang, Guofei, Zhao, Zhengyi, Yan, Zhiyong, and Xiao, Xuechun
Published: 2023
Full Text: View/download PDF

32. Qualitative and quantitative analysis of triterpenoids in different tissues of Pulsatilla chinensis

Author: Zhang, Tinglan, Zhang, Jun, Chen, Fangfang, Liu, An, Jiang, Jinzhu, Yan, Zhiyong, and Liu, Xianju
Published: 2023
Full Text: View/download PDF

33. Preparation of MgFeMn-LDOs and its thallium(I) adsorption performance in aqueous and mechanism

Author: He, Guiyi, Zeng, Meiqing, Yan, Zhiyong, and Liu, Yaochi
Published: 2023
Full Text: View/download PDF

34. Modified Dingchuan Decoction treats cough-variant asthma by suppressing lung inflammation and regulating the lung microbiota

Author: Wen, Lingmiao, Zhang, Tinglan, Chen, Fangfang, Hu, Lin, Dou, Chongyang, Ding, Xian, Altamirano, Alvin, Wei, Guihua, and Yan, Zhiyong
Published: 2023
Full Text: View/download PDF

35. Treatment of amoxicillin-containing wastewater by Trichoderma strains selected from activated sludge

Author: Cai, Yixiang, Yu, Hong, Ren, Liheng, Ou, Yingjuan, Jiang, Shilin, Chai, Youzheng, Chen, Anwei, Yan, Binghua, Zhang, Jiachao, and Yan, Zhiyong
Published: 2023
Full Text: View/download PDF

36. Carbon Nanomaterials-PEDOT: PSS Based Electrochemical Ionic Soft Actuators: Recent Development in Design and Applications

Author: Li, Yali, Liu, Weiwei, Gao, Xiaolong, Zou, Tao, Deng, Pingye, Zhao, Jin, Zhang, Tao, Chen, Yudi, He, Liying, Shao, Leihou, Yan, Zhiyong, and Zhang, Xiaoguang
Published: 2023
Full Text: View/download PDF

37. Conceptual Design of a HTS CICC Coil for Fusion Devices Based on Bi-2223 Tapes

Author: Yan, Zhiyong, primary, Wang, Suxing, additional, Lei, Zhiwen, additional, Wei, Junwen, additional, Kang, Xuyang, additional, Li, Zi, additional, and Tan, Yunfei, additional
Published: 2024
Full Text: View/download PDF

38. Study of Transverse Compressive Stress Effects on the Critical Current of Bi-2223/Ag HTS Tapes

Author: Wang, Suxin, primary, Yan, Zhiyong, additional, Wei, Junwen, additional, Lei, Zhiwen, additional, Kang, Xuyang, additional, Li, Zi, additional, and Tan, Yunfei, additional
Published: 2024
Full Text: View/download PDF

39. Sensitive photoelectrochemical biosensors based on AuNPs/MXenes electrode coupled with light-harvesting UiO-66-NH2 probes for protein kinase detection

Author: Yan, Zhiyong, Li, Yansen, Wei, XiaoXiao, Li, Pan, Jiang, Jingjing, Chen, Yongjia, Duan, Pengfei, Wang, Xiaoyang, Deng, Pingye, and Liu, Xiangwen
Published: 2022
Full Text: View/download PDF

40. Computational approach to decode the mechanism of curcuminoids against neuropathic pain

Author: Xiang, Chunxiao, Chen, Chunlan, Li, Xi, Wu, Yating, Xu, Qing, Wen, Lingmiao, Xiong, Wei, Liu, Yanjun, Zhang, Tinglan, Dou, Chongyang, Ding, Xian, Hu, Lin, Chen, Fangfang, Yan, Zhiyong, Liang, Lingli, and Wei, Guihua
Published: 2022
Full Text: View/download PDF

41. Porous cobaltate: Structure, active sites, thermocatalytic properties for ammonium perchlorate decomposition

Author: Xiao, Xuechun, Zhang, Guofei, Wang, Zhenlong, Zhu, Yunjiong, Yan, Zhiyong, and Wang, Yude
Published: 2022
Full Text: View/download PDF

42. Integrating dual-defects and the heterojunction in ZnIn2S4−x/g-C3N4−x composites induces breaking-symmetry for photocatalytic hydrogen production.

Author: Zhou, Guoxi, Qi, Yige, Wu, Yunchao, Wang, Hou, Yan, Zhiyong, and Wu, Yan
Abstract: Solar photocatalysis presents a promising solution to address both energy demands and the associated environmental concerns. However, the rapid combination of photogenerated charges internally and on the material's surface seriously affects the photocatalytic efficiency. In this work, N and S double-defected heterojunction ZISSv/CNNv composites were prepared by thermal polymerization and hydrothermal methods, and the defects present in the materials caused the symmetry of the crystal structure of the materials to be broken. Femtosecond transient absorption spectroscopy indicated that material defects induce internal symmetry breaking, generating shallow trapping states that decelerate the photogenerated charge recombination. The configuration attributed to the heterojunction produces a differentiated interfacial component, so that the interfacial symmetry is broken and an interfacial electric field is formed, accelerating the charge migration. This study offers a holistic approach by breaking symmetry to regulate charge dynamics within and across materials. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

43. Seafloor Rayleigh ellipticity, measured from unoriented data, and its significance for passive seismic imaging in the ocean.

Author: Ai, Sanxi, Akuhara, Takeshi, He, Lipeng, Xiong, Cheng, Yan, Zhiyong, and Zheng, Yong
Subjects: IMAGING systems in seismology, FREE surfaces, SEISMOMETERS, OCEAN, TOMOGRAPHY
Abstract: Observations of the seafloor Rayleigh ellipticity contribute to seismic imaging in the ocean. To extract such observables from the arbitrarily oriented ocean–bottom seismometer (OBS) data, we develop an orthogonal-regression-based approach to measure the waveform amplitude ratios of the unoriented horizontal and vertical components. The amplitude ratios are then used to calculate the Rayleigh ellipticity (and the sensor orientation angle). The robustness of our method is verified by applications to both the unoriented OBS data and the well oriented on-land seismic data. As we propose to calculate the Rayleigh ellipticity directly from the unoriented three-component data, the measurement process avoids the complexity arising from the surface wave non-great-circle effects and uncertainties of the OBS sensor orientation angles. Overall the Rayleigh ellipticity measurements from our method are systematically higher than those by conventional analysis and show less uncertainties. Our analyses suggest that the Rayleigh ellipticity curve (14–60 s), which could be retrieved from the raw broad-band OBS data, is effective to constrain the oceanic lithosphere structure, and the accurate measurement of Rayleigh ellipticity curve is important. The potential of seafloor Rayleigh ellipticity for seismic imaging in the ocean is evidenced by a case study of the Japan Basin, the Sea of Japan. Considering the insufficient station coverage in the ocean, the single-station measurement of seafloor Rayleigh ellipticity is of significance for OBS community. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

44. Qisheng Wan formula ameliorates cognitive impairment of Alzheimer's disease rat via inflammation inhibition and intestinal microbiota regulation

Author: Xiong, Wei, Zhao, Xiaoqin, Xu, Qing, Wei, Guihua, Zhang, Liudai, Fan, Yuqing, Wen, Lingmiao, Liu, Yanjun, Zhang, Tinglan, Zhang, Li, Tong, Yan, Yin, Qiaozhi, Zhang, Tian-e, and Yan, Zhiyong
Published: 2022
Full Text: View/download PDF

45. Effect of Hydrothermal Aging on Mechanical Properties and Long-Term Durability of Recycled Polysulfone

Author: Li, Zhijiang, Yu, Lichao, and Yan, Zhiyong
Published: 2022
Full Text: View/download PDF

46. CED: Consistent Ensemble Distillation for Audio Tagging

Author: Dinkel, Heinrich, primary, Wang, Yongqing, additional, Yan, Zhiyong, additional, Zhang, Junbo, additional, and Wang, Yujun, additional
Published: 2024
Full Text: View/download PDF

47. Effect of Cucumber Continuous Monocropping on Traditional Chinese Medicine Residue through Analysis of Physicochemical Characteristics and Microbial Diversity

Author: Zhao, Qingsong, primary, Dong, Jingjing, additional, Yan, Zhiyong, additional, Xu, Ling, additional, and Liu, Ake, additional
Published: 2024
Full Text: View/download PDF

48. Ameliorative effects of escin on neuropathic pain induced by chronic constriction injury of sciatic nerve

Author: Zhang, Liudai, Chen, Xiu, Wu, Lanlan, Li, Yongbiao, Wang, Liwen, Zhao, Xiaoqin, Zhao, Tingting, Zhang, Li, Yan, Zhiyong, and Wei, Guihua
Published: 2021
Full Text: View/download PDF

49. Precise analysis of T4 polynucleotide kinase and inhibition by coupling personal glucose meter with split DNAzyme and ligation-triggered DNA walker

Author: Yan, Zhiyong, Shen, Xiaoyan, Zhou, Baolong, Pan, Ruiyan, Zhang, Bo, Zhao, Chunzhen, Ren, Lanhui, and Ming, Jingjing
Published: 2021
Full Text: View/download PDF

50. Deciphering Subduction Polarity During Ancient Arc‐Continent Collisions.

Author: Yan, Zhiyong, Chen, Lin, Zuza, Andrew V., Xiang, Xiao, Xie, Renxian, and Ai, Sanxi
Subjects: *SUBDUCTION zones, *ISLAND arcs, *METAMORPHIC rocks, *CONTINENTAL margins, *TOPOGRAPHY
Abstract: The closure of an ancient ocean basin via oceanic arc‐continent collision has two subduction styles with opposite polarities, which may proceed via subduction polarity reversal (SPR) or a subduction zone jump (SZJ). Interpreting the geometry or kinematic evolution of ancient collisional zones, especially the original subduction polarity, can be challenging. Here we used 2D thermo‐mechanical modeling to investigate the dynamic evolution process of SPR versus SZJ. Our modeling predicts different structural, topographic, magmatic, and basin histories for SPR and SZJ, which can be compared against, and help interpret, the geologic record past sites of oceanic closure during collisional orogens. Our results match geologic observations of past collisions in Kamchatka, eastern Russia, and the Banda Arc, eastern Indonesia, and thus our results can help effectively decode the evolutionary history of past arc‐continent collisions. Plain Language Summary: Determining the geometry and kinematic evolution of ancient subduction zones that experienced collision with an oceanic island arc can be challenging based on the surface geology along. Such collisions usually result in different dynamical evolution processes, namely subduction polarity reversal (SPR) or a subduction zone jump (SZJ). Here we conducted numerical modeling of oceanic island arcs that collide with a continental margin to explore the dynamic evolution process of different subduction styles. Our results reveal geologic indicators to decipher SPR versus SZJ in natural oceanic arc‐continent collisions, such as the distribution of thrust faults, metamorphic rocks, magmatism, crustal thickness, and topography. The numerical simulations help explain the geologic history of Kamchatka in eastern Russia and Banda Arc in eastern Indonesia. This study provides provide new insights and implications diagnosing the polarity of vanished subduction in arc‐continental systems. Key Points: New numerical models with convergent velocity boundary condition for deciphering subduction polarity during arc‐continent collisionsThrust faults, metamorphic rocks, magmatism, topography and Moho morphology can be used as indicators to diagnose subduction polarityThe evolution of subduction polarity reversal well explains the tectonic activities of the Kamchatka and Banda Arc in the Cenozoic [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

497 results on '"Yan, Zhiyong"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources