Author: "Zhao, Wenlong" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zhao, Wenlong"' showing total 498 results

Start Over Author "Zhao, Wenlong"

498 results on '"Zhao, Wenlong"'

1. Quantifying reliance on external information over parametric knowledge during Retrieval Augmented Generation (RAG) using mechanistic analysis

Author: Ghosh, Reshmi, Seetharaman, Rahul, Wadhwa, Hitesh, Aggarwal, Somyaa, Basu, Samyadeep, Srinivasan, Soundararajan, Zhao, Wenlong, Chaudhari, Shreyas, and Aghazadeh, Ehsan
Subjects: Computer Science - Computation and Language
Abstract: Retrieval Augmented Generation (RAG) is a widely used approach for leveraging external context in several natural language applications such as question answering and information retrieval. Yet, the exact nature in which a Language Model (LM) leverages this non-parametric memory or retrieved context isn't clearly understood. This paper mechanistically examines the RAG pipeline to highlight that LMs demonstrate a "shortcut'' effect and have a strong bias towards utilizing the retrieved context to answer questions, while relying minimally on model priors. We propose (a) Causal Mediation Analysis; for proving that parametric memory is minimally utilized when answering a question and (b) Attention Contributions and Knockouts for showing the last token residual stream do not get enriched from the subject token in the question, but gets enriched from tokens of RAG-context. We find this pronounced "shortcut'' behaviour to be true across both LLMs (e.g.,LlaMa) and SLMs (e.g., Phi), Comment: Accepted to Blackbox NLP @ EMNLP 2024
Published: 2024

2. Different influence of gas accretion on the evolution of star-forming and non-star-forming galaxies

Author: Bao, Min, Zhao, Wenlong, and Yuan, Qirong
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: Using integral field spectroscopic data from the Mapping Nearby Galaxies at Apache Point Observatory survey, we investigate the spatially resolved properties and empirical relations of a star-forming galaxy and a non-star-forming galaxy hosting counter-rotating stellar disks (CRDs). The DESI $g, r, z$ color images reveal no evidence of merger remnants in either galaxy, suggesting that gas accretion fuels the formation of CRDs. Based on the visible counter-rotation in the stellar velocity field, we can fit a spatial boundary to distinguish the inner and outer regions dominated by two stellar disks in each galaxy. In the inner region of the star-forming CRDs, stars are co-rotating with ionized gas, and the stellar population is younger. Comparison of the star-forming main sequence relations between the inner and outer regions reveals enhanced star formation in the inner region. Given the abundant pre-existing gas in the star-forming galaxy, collisions between pre-existing and external gas efficiently consume angular momentum, triggering star formation in the inner region. Conversely, in the outer region of the non-star-forming CRDs, stars are co-rotating with ionized gas, and the stellar population is younger. Comparison of the stellar mass-metallicity relations between the inner and outer regions indicates enriched gas-phase metallicity in the outer region. Considering the less abundant pre-existing gas in the non-star-forming galaxy, external gas could preserve angular momentum, fueling star formation in the outer region. Overall, gas accretion exhibits different influence on the evolution of star-forming and non-star-forming galaxies., Comment: Accepted for publication in ApJ, 14 pages, 11 figures
Published: 2024

3. From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Author: Wadhwa, Hitesh, Seetharaman, Rahul, Aggarwal, Somyaa, Ghosh, Reshmi, Basu, Samyadeep, Srinivasan, Soundararajan, Zhao, Wenlong, Chaudhari, Shreyas, and Aghazadeh, Ehsan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Retrieval Augmented Generation (RAG) enriches the ability of language models to reason using external context to augment responses for a given user prompt. This approach has risen in popularity due to practical applications in various applications of language models in search, question/answering, and chat-bots. However, the exact nature of how this approach works isn't clearly understood. In this paper, we mechanistically examine the RAG pipeline to highlight that language models take shortcut and have a strong bias towards utilizing only the context information to answer the question, while relying minimally on their parametric memory. We probe this mechanistic behavior in language models with: (i) Causal Mediation Analysis to show that the parametric memory is minimally utilized when answering a question and (ii) Attention Contributions and Knockouts to show that the last token residual stream do not get enriched from the subject token in the question, but gets enriched from other informative tokens in the context. We find this pronounced shortcut behaviour true across both LLaMa and Phi family of models.
Published: 2024

4. Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval

Author: Song, Jonghyun, Jin, Cheyon, Zhao, Wenlong, McCallum, Andrew, and Lee, Jay-Yoon
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: A common retrieve-and-rerank paradigm involves retrieving relevant candidates from a broad set using a fast bi-encoder (BE), followed by applying expensive but accurate cross-encoders (CE) to a limited candidate set. However, relying on this small subset is often susceptible to error propagation from the bi-encoders, which limits the overall performance. To address these issues, we propose the Comparing Multiple Candidates (CMC) framework. CMC compares a query and multiple embeddings of similar candidates (i.e., neighbors) through shallow self-attention layers, delivering rich representations contextualized to each other. Furthermore, CMC is scalable enough to handle multiple comparisons simultaneously. For example, comparing ~10K candidates with CMC takes a similar amount of time as comparing 16 candidates with CE. Experimental results on the ZeSHEL dataset demonstrate that CMC, when plugged in between bi-encoders and cross-encoders as a seamless intermediate reranker (BE-CMC-CE), can effectively improve recall@k (+4.8%-p, +3.5%-p for R@16, R@64) compared to using only bi-encoders (BE-CE), with negligible slowdown (<7%). Additionally, to verify CMC's effectiveness as the final-stage reranker in improving top-1 accuracy, we conduct experiments on downstream tasks such as entity, passage, and dialogue ranking. The results indicate that CMC is not only faster (11x) but also often more effective than CE, with improved prediction accuracy in Wikipedia entity linking (+0.7%-p) and DSTC7 dialogue ranking (+3.3%-p)., Comment: accepted to EMNLP 2024 main track
Published: 2024

5. WorldValuesBench: A Large-Scale Benchmark Dataset for Multi-Cultural Value Awareness of Language Models

Author: Zhao, Wenlong, Mondal, Debanjan, Tandon, Niket, Dillion, Danica, Gray, Kurt, and Gu, Yuling
Subjects: Computer Science - Computation and Language, Computer Science - Computers and Society
Abstract: The awareness of multi-cultural human values is critical to the ability of language models (LMs) to generate safe and personalized responses. However, this awareness of LMs has been insufficiently studied, since the computer science community lacks access to the large-scale real-world data about multi-cultural values. In this paper, we present WorldValuesBench, a globally diverse, large-scale benchmark dataset for the multi-cultural value prediction task, which requires a model to generate a rating response to a value question based on demographic contexts. Our dataset is derived from an influential social science project, World Values Survey (WVS), that has collected answers to hundreds of value questions (e.g., social, economic, ethical) from 94,728 participants worldwide. We have constructed more than 20 million examples of the type "(demographic attributes, value question) $\rightarrow$ answer" from the WVS responses. We perform a case study using our dataset and show that the task is challenging for strong open and closed-source models. On merely $11.1\%$, $25.0\%$, $72.2\%$, and $75.0\%$ of the questions, Alpaca-7B, Vicuna-7B-v1.5, Mixtral-8x7B-Instruct-v0.1, and GPT-3.5 Turbo can respectively achieve $<0.2$ Wasserstein 1-distance from the human normalized answer distributions. WorldValuesBench opens up new research avenues in studying limitations and opportunities in multi-cultural value awareness of LMs., Comment: Accepted at LREC-COLING 2024. Wenlong and Debanjan contributed equally
Published: 2024

6. Incidence and risk factors for cardiac rupture after ST-segment elevation myocardial infarction in contemporary era: findings from the improving care for cardiovascular disease in China-Acute Coronary Syndrome project

Author: Yang, Na, Zhao, Wenlong, Hao, Yongchen, Liu, Jun, Liu, Jing, Zhao, Xuedong, Yan, Yan, Nie, Shaoping, and Gong, Wei
Published: 2024
Full Text: View/download PDF

7. Atomic-scale investigation on diffusion mechanism of immiscible Mo/Cu system under different temperatures and electric field

Author: Lu, Yunqi, Guo, Xiaolin, Zhao, Wenlong, Wang, Yumeng, Du, Jinlong, Mao, Chunliang, Li, Ming, Wang, Pengfei, Zhang, Lu, and Tong, Jiaqian
Published: 2024
Full Text: View/download PDF

8. Enhanced K-NN with Bayesian optimization algorithm for predicting energy efficiency of smart grids in IoT

Author: Zhao, Wenlong, Hu, Yuanchao, Yan, Xiaopeng, Liu, Xiaowei, Ding, Rixian, Dai, Chaofeng, and Cao, Ying
Published: 2024
Full Text: View/download PDF

9. Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation

Author: Zhao, Jiachen, Zhao, Wenlong, Drozdov, Andrew, Rozonoyer, Benjamin, Sultan, Md Arafat, Lee, Jay-Yoon, Iyyer, Mohit, and McCallum, Andrew
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We study semi-supervised sequence generation tasks, where the few labeled examples are too scarce to finetune a model, and meanwhile, few-shot prompted large language models (LLMs) exhibit room for improvement. In this paper, we present the discovery that a student model distilled from a few-shot prompted LLM can commonly generalize better than its teacher to unseen examples on such tasks. We find that the student is able to learn a general pattern from the high-quality pseudolabels produced by the teacher during knowledge distillation (KD), and favorably not a general pattern from the low-quality pseudolables. Leveraging this discovery, we propose a new method, Multistage Collaborative Knowledge Distillation from an LLM (MCKD), for these tasks. MCKD first few-shot prompts an LLM to produce pseudolabels for unlabeled data. Then at each stage of an iterative KD process, a new pair of students is trained on disjoint partitions of the pseudolabeled data, and produces new and improved pseudolabels for their unseen partitions. We conduct extensive experiments on four syntactic and semantic parsing datasets and show the effectiveness of MCKD for low-resource semi-supervised sequence generation. On CRAFT biomedical parsing, for example, 3-stage MCKD with 50 labeled examples outperforms an LLM teacher and vanilla KD by 7.5% and 3.7% parsing F1, respectively, and matches the performance of supervised finetuning with 500 labeled examples., Comment: ACL 2024
Published: 2023

10. Changes of morphological, physiological, and biochemical indices during seed development of Notopterygium incisum Ting ex H.T.Chang

Author: GENG Beirui, GAO Jie, GU Xiaoying, JIN Ling, CHEN Honggang, and ZHAO Wenlong
Subjects: notopterygium incisum ting ex h. t. chang, seed development, physiological and biochemical indicators, appearance and morphology, Biology (General), QH301-705.5, Botany, QK1-989
Abstract: Abstract [Objective] The study aims to explore the changes of morphological, physiological, and biochemical indexes during the development of Notopterygium incisum seeds, and to provide theoretical basis for the introduction and domestication of N . incisum and the cultivation of high-quality germplasm resources. [Methods] The morphological, physiological, and biochemical indexes of N . incisum seeds at different developmental stages were determined by observation samplings and the frozen section method. [Results] During the development of N . incisum seeds, seed size and wing thickness were first increased and then decreased, while wing width was increased. The color of the outer seed coat was changed from bright blue-green to yellow-brown. The contents of starch, soluble sugar, and soluble protein were increased. Amylase activity was fluctuated in a small range; succinate dehydrogenase activity was increased; glucose-6-phosphate dehydrogenase activity was increased and decreased alternately; pyruvate kinase activity showed “M”-shape changes. POD activity was decreased while SOD activity was decreased. GA3, IAA, and ZR contents were decreased, while ABA contents were increased. The morphological indexes of the seeds were negatively correlated with the activity of SOD. The content of the storage materials was negatively correlated with the seed thickness, but positively correlated with the width of the fruit wings. [Conclusion] When N . incisum embryo is developed to pre-cardioid embryo, the outer seed coat is changed from blue-green to yellow-brown, the content of storage material is increased, and the activity of key enzymes is fluctuated and decreased. The content of growth-promoting hormones is fluctuated and decreased, while the content of stress hormones is increased.
Published: 2024
Full Text: View/download PDF

11. Prognostic implications of obstructive sleep apnea in patients with unstable angina stratified by remnant cholesterol and triglyceride: a prospective cohort study

Author: Zhou, Yun, Gong, Wei, Yan, Yan, Wang, Xiao, Zheng, Wen, Que, Bin, Li, Siyi, Zhang, Zekun, Chen, Xiuhuan, Fan, Jingyao, Zhao, Wenlong, Xin, Qingjie, Ai, Hui, and Nie, Shaoping
Published: 2024
Full Text: View/download PDF

12. Complexities of peroxyacetyl nitrate photochemistry and its control strategies in contrasting environments in the Pearl River Delta region

Author: Liu, Tao, Wang, Yu, Cai, Huang, Wang, Hao, Zhang, Chunlin, Chen, Jun, Dai, Yan, Zhao, Wenlong, Li, Jiangyong, Gong, Daocheng, Chen, Duohong, Zhai, Yuhong, Zhou, Yan, Liao, Tong, and Wang, Boguang
Published: 2024
Full Text: View/download PDF

13. Editing Common Sense in Transformers

Author: Gupta, Anshita, Mondal, Debanjan, Sheshadri, Akshay Krishna, Zhao, Wenlong, Li, Xiang Lorraine, Wiegreffe, Sarah, and Tandon, Niket
Subjects: Computer Science - Computation and Language
Abstract: Editing model parameters directly in Transformers makes updating open-source transformer-based models possible without re-training (Meng et al., 2023). However, these editing methods have only been evaluated on statements about encyclopedic knowledge with a single correct answer. Commonsense knowledge with multiple correct answers, e.g., an apple can be green or red but not transparent, has not been studied but is as essential for enhancing transformers' reliability and usefulness. In this paper, we investigate whether commonsense judgments are causally associated with localized, editable parameters in Transformers, and we provide an affirmative answer. We find that directly applying the MEMIT editing algorithm results in sub-par performance and improve it for the commonsense domain by varying edit tokens and improving the layer selection strategy, i.e., $MEMIT_{CSK}$. GPT-2 Large and XL models edited using $MEMIT_{CSK}$ outperform best-fine-tuned baselines by 10.97% and 10.73% F1 scores on PEP3k and 20Q datasets. In addition, we propose a novel evaluation dataset, PROBE SET, that contains unaffected and affected neighborhoods, affected paraphrases, and affected reasoning challenges. $MEMIT_{CSK}$ performs well across the metrics while fine-tuning baselines show significant trade-offs between unaffected and affected metrics. These results suggest a compelling future direction for incorporating feedback about common sense into Transformers through direct model editing., Comment: Accepted to EMNLP 2023 Main Conference. Anshita, Debanjan, Akshay are co-first authors. Code and datasets for all experiments are available at https://github.com/anshitag/memit_csk
Published: 2023

14. Machine Reading Comprehension using Case-based Reasoning

Author: Thai, Dung, Agarwal, Dhruv, Chaudhary, Mudit, Zhao, Wenlong, Das, Rajarshi, Zaheer, Manzil, Lee, Jay-Yoon, Hajishirzi, Hannaneh, and McCallum, Andrew
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval
Abstract: We present an accurate and interpretable method for answer extraction in machine reading comprehension that is reminiscent of case-based reasoning (CBR) from classical AI. Our method (CBR-MRC) builds upon the hypothesis that contextualized answers to similar questions share semantic similarities with each other. Given a test question, CBR-MRC first retrieves a set of similar cases from a nonparametric memory and then predicts an answer by selecting the span in the test context that is most similar to the contextualized representations of answers in the retrieved cases. The semi-parametric nature of our approach allows it to attribute a prediction to the specific set of evidence cases, making it a desirable choice for building reliable and debuggable QA systems. We show that CBR-MRC provides high accuracy comparable with large reader models and outperforms baselines by 11.5 and 8.4 EM on NaturalQuestions and NewsQA, respectively. Further, we demonstrate the ability of CBR-MRC in identifying not just the correct answer tokens but also the span with the most relevant supporting evidence. Lastly, we observe that contexts for certain question types show higher lexical diversity than others and find that CBR-MRC is robust to these variations while performance using fully-parametric methods drops., Comment: 9 pages, 2 figures
Published: 2023

15. SAD: Semi-Supervised Anomaly Detection on Dynamic Graphs

Author: Tian, Sheng, Dong, Jihai, Li, Jintang, Zhao, Wenlong, Xu, Xiaolong, wang, Baokun, Song, Bowen, Meng, Changhua, Zhang, Tianyi, and Chen, Liang
Subjects: Computer Science - Machine Learning, Computer Science - Social and Information Networks
Abstract: Anomaly detection aims to distinguish abnormal instances that deviate significantly from the majority of benign ones. As instances that appear in the real world are naturally connected and can be represented with graphs, graph neural networks become increasingly popular in tackling the anomaly detection problem. Despite the promising results, research on anomaly detection has almost exclusively focused on static graphs while the mining of anomalous patterns from dynamic graphs is rarely studied but has significant application value. In addition, anomaly detection is typically tackled from semi-supervised perspectives due to the lack of sufficient labeled data. However, most proposed methods are limited to merely exploiting labeled data, leaving a large number of unlabeled samples unexplored. In this work, we present semi-supervised anomaly detection (SAD), an end-to-end framework for anomaly detection on dynamic graphs. By a combination of a time-equipped memory bank and a pseudo-label contrastive learning module, SAD is able to fully exploit the potential of large unlabeled samples and uncover underlying anomalies on evolving graph streams. Extensive experiments on four real-world datasets demonstrate that SAD efficiently discovers anomalies from dynamic graphs and outperforms existing advanced methods even when provided with only little labeled data., Comment: Accepted to IJCAI'23. Code will be available at https://github.com/D10Andy/SAD
Published: 2023

16. GRANDE: a neural model over directed multigraphs with application to anti-money laundering

Author: Wu, Ruofan, Ma, Boqun, Jin, Hong, Zhao, Wenlong, Wang, Weiqiang, and Zhang, Tianyi
Subjects: Computer Science - Machine Learning
Abstract: The application of graph representation learning techniques to the area of financial risk management (FRM) has attracted significant attention recently. However, directly modeling transaction networks using graph neural models remains challenging: Firstly, transaction networks are directed multigraphs by nature, which could not be properly handled with most of the current off-the-shelf graph neural networks (GNN). Secondly, a crucial problem in FRM scenarios like anti-money laundering (AML) is to identify risky transactions and is most naturally cast into an edge classification problem with rich edge-level features, which are not fully exploited by the prevailing GNN design that follows node-centric message passing protocols. In this paper, we present a systematic investigation of design aspects of neural models over directed multigraphs and develop a novel GNN protocol that overcomes the above challenges via efficiently incorporating directional information, as well as proposing an enhancement that targets edge-related tasks using a novel message passing scheme over an extension of edge-to-node dual graph. A concrete GNN architecture called GRANDE is derived using the proposed protocol, with several further improvements and generalizations to temporal dynamic graphs. We apply the GRANDE model to both a real-world anti-money laundering task and public datasets. Experimental evaluations show the superiority of the proposed GRANDE architecture over recent state-of-the-art models on dynamic graph modeling and directed graph modeling., Comment: Accepted as regular paper at ICDM 2022
Published: 2023

17. ConReader: Exploring Implicit Relations in Contracts for Contract Clause Extraction

Author: Xu, Weiwen, Deng, Yang, Lei, Wenqiang, Zhao, Wenlong, Chua, Tat-Seng, and Lam, Wai
Subjects: Computer Science - Computation and Language
Abstract: We study automatic Contract Clause Extraction (CCE) by modeling implicit relations in legal contracts. Existing CCE methods mostly treat contracts as plain text, creating a substantial barrier to understanding contracts of high complexity. In this work, we first comprehensively analyze the complexity issues of contracts and distill out three implicit relations commonly found in contracts, namely, 1) Long-range Context Relation that captures the correlations of distant clauses; 2) Term-Definition Relation that captures the relation between important terms with their corresponding definitions; and 3) Similar Clause Relation that captures the similarities between clauses of the same type. Then we propose a novel framework ConReader to exploit the above three relations for better contract understanding and improving CCE. Experimental results show that ConReader makes the prediction more interpretable and achieves new state-of-the-art on two CCE tasks in both conventional and zero-shot settings., Comment: To appear at EMNLP 2022 main conference
Published: 2022

18. ezCoref: Towards Unifying Annotation Guidelines for Coreference Resolution

Author: Gupta, Ankita, Karpinska, Marzena, Zhao, Wenlong, Krishna, Kalpesh, Merullo, Jack, Yeh, Luke, Iyyer, Mohit, and O'Connor, Brendan
Subjects: Computer Science - Computation and Language
Abstract: Large-scale, high-quality corpora are critical for advancing research in coreference resolution. However, existing datasets vary in their definition of coreferences and have been collected via complex and lengthy guidelines that are curated for linguistic experts. These concerns have sparked a growing interest among researchers to curate a unified set of guidelines suitable for annotators with various backgrounds. In this work, we develop a crowdsourcing-friendly coreference annotation methodology, ezCoref, consisting of an annotation tool and an interactive tutorial. We use ezCoref to re-annotate 240 passages from seven existing English coreference datasets (spanning fiction, news, and multiple other domains) while teaching annotators only cases that are treated similarly across these datasets. Surprisingly, we find that reasonable quality annotations were already achievable (>90% agreement between the crowd and expert annotations) even without extensive training. On carefully analyzing the remaining disagreements, we identify the presence of linguistic cases that our annotators unanimously agree upon but lack unified treatments (e.g., generic pronouns, appositives) in existing datasets. We propose the research community should revisit these phenomena when curating future unified annotation guidelines., Comment: preprint (19 pages), code in https://github.com/gnkitaa/ezCoref
Published: 2022

19. Carbon Dots with Antioxidant Capacity for Detecting Glucose by Fluorescence and Repairing High-Glucose Damaged Glial Cells

Author: Zhao, Wenlong, Zhang, Menghan, Zhang, Liang, Deng, Xiaoqin, Wang, Yao, Chen, Yiping, and Weng, Shaohuang
Published: 2024
Full Text: View/download PDF

20. Endogenous Hormones Improve Lodging Tolerance of Maize (Zea mays L.) by Regulating Stalk Structure Under Elevated Temperature

Author: Wang, Jingjing, Li, Rongfa, Liu, Dongyao, Zhao, Wenlong, Miao, Zhengyan, Jia, Xucun, Dong, Pengfei, and Wang, Qun
Published: 2024
Full Text: View/download PDF

21. Antibiotic resistance genes in plague ecosystems: Threatening the emergence of resistant plague

Author: Ma, Yixin, Li, Shixiong, Liu, Jingpeng, Sun, Yuechen, Wu, Yi, Chen, Huangliang, Mao, Meihan, Zhao, Wenlong, Deng, Chengcheng, Gao, Jiefei, Zhang, Qitao, Quan, Jinrou, and Qi, Rui
Published: 2024
Full Text: View/download PDF

22. Nanoarchitectonics of few-layer Ni3Fe nanosheets embedded porous nitrogen-doped carbon derived from asphalt waste: An efficient electrocatalyst for oxygen evolution reaction

Author: Zhao, Wenlong, Shangguan, Huangbin, Liu, Zhiyi, Lin, Jing, Huang, Congshu, Chen, Hongyi, Zhang, Xiaoxiao, Wang, Jianzhi, and Yu, Faquan
Published: 2024
Full Text: View/download PDF

23. Microstructure evolution, elemental diffusion behavior, and bonding strength of TA1/AZ31B laminated composite fabricated by hot pressing

Author: Yang, Bin, Zhao, Wenlong, Guo, Xiaolin, Lu, Yunqi, Xiao, Hong, and Du, Jinlong
Published: 2024
Full Text: View/download PDF

24. Fast identification of flammable chemicals based on broad learning system

Author: Zhao, Wenlong, Wang, Xue, Li, Wang, Peng, Xiaoyan, Feng, Peter, Duan, Shukai, Wang, Lidan, and Chu, Jin
Published: 2024
Full Text: View/download PDF

25. A mixed gas concentration regression prediction method based on RESHA-ALW

Author: Wu, Jilong, Zhao, Wenlong, Wu, Fan, Yan, Jia, Feng, Peter, Cui, Hao, Duan, Shukai, and Peng, Xiaoyan
Published: 2024
Full Text: View/download PDF

26. High-precision regression prediction of HCHO concentration based on gas sensors and FPNet

Author: Ma, Ruilong, Wu, Fan, Yue, Jiaxin, Zhao, Wenlong, Yan, Jia, Cui, Hao, Feng, Peter, and Peng, Xiaoyan
Published: 2024
Full Text: View/download PDF

27. Closed wellbore integrity failure induced by casing corrosion based on solid-chemical coupling model in CO2 sequestration

Author: Peng, Haoyan, Xu, Zhao-Dong, Xia, Zhiheng, Zang, Xulei, Xi, Dongmin, Jiang, Xinmeng, Zhao, Wenlong, and Lu, Hongfang
Published: 2024
Full Text: View/download PDF

28. A VMD-LSTNet-Attention model for concentration prediction of mixed gases

Author: Gan, Wenchao, Ma, Ruilong, Zhao, Wenlong, Peng, Xiaoyan, Cui, Hao, Yan, Jia, Duan, Shukai, Wang, Lidan, Feng, Peter, and Chu, Jin
Published: 2025
Full Text: View/download PDF

29. Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression

Author: Sun, Yuezhou, Zhao, Wenlong, Zhang, Lijun, Liu, Xiao, Guan, Hui, and Zaharia, Matei
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: This paper investigates deep neural network (DNN) compression from the perspective of compactly representing and storing trained parameters. We explore the previously overlooked opportunity of cross-layer architecture-agnostic representation sharing for DNN parameters. To do this, we decouple feedforward parameters from DNN architectures and leverage additive quantization, an extreme lossy compression method invented for image descriptors, to compactly represent the parameters. The representations are then finetuned on task objectives to improve task accuracy. We conduct extensive experiments on MobileNet-v2, VGG-11, ResNet-50, Feature Pyramid Networks, and pruned DNNs trained for classification, detection, and segmentation tasks. The conceptually simple scheme consistently outperforms iterative unstructured pruning. Applied to ResNet-50 with 76.1% top-1 accuracy on the ILSVRC12 classification challenge, it achieves a $7.2\times$ compression ratio with no accuracy loss and a $15.3\times$ compression ratio at 74.79% accuracy. Further analyses suggest that representation sharing can frequently happen across network layers and that learning shared representations for an entire DNN can achieve better accuracy at the same compression ratio than compressing the model as multiple separate parts. We release PyTorch code to facilitate DNN deployment on resource-constrained devices and spur future research on efficient representations and storage of DNN parameters.
Published: 2021

30. Experimental and numerical studies on influences of wedge reinforcement on seismic performance of loose penetrated mortise-tenon joints

Author: He, Junxiao, Liu, Ke, Xie, Linlin, Wang, Xiangjin, Yang, Wenyi, and Zhao, Wenlong
Published: 2024
Full Text: View/download PDF

31. Exploring the anti-ischemic stroke potential of wogonoside: Insights from Nrf2/Sirt3 signaling pathway and UPLC-TripleTOF-MS/MS-based metabolomics

Author: Xu, Di, Zhang, Lin, Meng, Huihui, Zhao, Wenlong, Hu, Ziyun, and Wang, Junsong
Published: 2024
Full Text: View/download PDF

32. Microplastics Aloft: A comprehensive exploration of sources, transport, variations, interactions and their implications on human health in the atmospheric realm

Author: Nafea, Taiseer Hussain, Chan, Faith Ka Shun, Xu, Yuyao, Wang, Chengjun, Wang, Xiaoyuan, Zhao, Wenlong, Ji, Dongsheng, Xiao, Hang, and He, Jun
Published: 2024
Full Text: View/download PDF

33. Spatiotemporal ozone concentration variations and their influencing factors in Zhejiang, China, 2018–2022: A geographically and temporally weighted regression

Author: Li, Yi, Hou, Qing, Xu, Bingye, Zhao, Wenlong, Ding, Jinmei, Jin, Lingling, Zou, Qiaoli, Tian, Xudong, Xu, Shengchen, Sun, Xin, and Chao, Na
Published: 2024
Full Text: View/download PDF

34. A novel model for predicting prolonged stay of patients with type-2 diabetes mellitus: a 13-year (2010–2022) multicenter retrospective case–control study

Author: Tan, Juntao, Zhang, Zhengyu, He, Yuxin, Yu, Yue, Zheng, Jing, Liu, Yunyu, Gong, Jun, Li, Jianjun, Wu, Xin, Zhang, Shengying, Lin, Xiantian, Zhao, Yuxi, Wu, Xiaoxin, Tang, Songjia, Chen, Jingjing, and Zhao, Wenlong
Published: 2023
Full Text: View/download PDF

35. A novel high accuracy fast gas detection algorithm based on multi-task learning

Author: Wang, Xue, Zhao, Wenlong, Ma, Ruilong, Zhuo, Junwei, Zeng, Yuanhu, Wu, Pengcheng, and Chu, Jin
Published: 2024
Full Text: View/download PDF

36. Model predictive motion control of blanket remote maintenance mover

Author: Li, Dongyi, Lu, Kun, Cheng, Yong, Wu, Huapeng, Handroos, Heikki, Zhao, Wenlong, Zhang, Xuanchen, Guo, Xinpeng, Yang, Songzhu, Zhang, Yu, and Pan, Hongtao
Published: 2024
Full Text: View/download PDF

37. IGA : An Intent-Guided Authoring Assistant

Author: Sun, Simeng, Zhao, Wenlong, Manjunatha, Varun, Jain, Rajiv, Morariu, Vlad, Dernoncourt, Franck, Srinivasan, Balaji Vasan, and Iyyer, Mohit
Subjects: Computer Science - Computation and Language
Abstract: While large-scale pretrained language models have significantly improved writing assistance functionalities such as autocomplete, more complex and controllable writing assistants have yet to be explored. We leverage advances in language modeling to build an interactive writing assistant that generates and rephrases text according to fine-grained author specifications. Users provide input to our Intent-Guided Assistant (IGA) in the form of text interspersed with tags that correspond to specific rhetorical directives (e.g., adding description or contrast, or rephrasing a particular sentence). We fine-tune a language model on a dataset heuristically-labeled with author intent, which allows IGA to fill in these tags with generated text that users can subsequently edit to their liking. A series of automatic and crowdsourced evaluations confirm the quality of IGA's generated outputs, while a small-scale user study demonstrates author preference for IGA over baseline methods in a creative writing task. We release our dataset, code, and demo to spur further research into AI-assisted writing., Comment: EMNLP2021
Published: 2021

38. Intravenous Tirofiban Versus Alteplase Before Endovascular Treatment in Acute Ischemic Stroke: A Pooled Analysis of the DEVT and RESCUE BT Trials

Author: Sang, Hongfei, Cao, Zhihua, Du, Jie, Nguyen, Thanh N., Saver, Jeffrey L., Mao, An, Nogueira, Raul G., Tao, Zhaojun, Zhou, Simin, Han, Qin, Sun, Dong, Lei, Bo, Liu, Shudong, Zeng, Guoyong, Yin, Congguo, Xie, Dongjing, Luo, Weidong, Jin, Zhenglong, Qiu, Zhongming, Yang, Qingwu, Wu, Deping, Shi, Zhonghua, Huang, Wenguo, Liu, Wenhua, Qiu, Tao, Zhou, Peiyang, Wang, Li, Fu, Xinmin, Yang, Shiquan, Zhang, Shuai, Zhou, Zhiming, Wang, Yan, Luo, Jun, Bai, Yongjie, Zhang, Min, Wu, Youlin, Wan, Yue, Wen, Changming, Wen, Hongbin, Ling, Wentong, Chen, Zhuo., Peng, Miao, Ai, Zhibing, Guo, Fuqiang, Guo, Jing, Guan, Haitao, Wang, Zhiyi, Liu, Yong, Pu, Jie, Wang, Zhen, Yang, Qingwu, Liu, Wenhua, Li, Huagang, Huang, Wenguo, Zhang, Min, Zhou, Peiyang, Zhou, Zhiming, Song, Bo, Wen, Changming, Yuan, Guangxiong, Luo, Jun, Zheng, Chong, Wang, Shouchun, Wang, Tao, Qi, Li, Wan, Yue, Wu, Youlin, Peng, Daizhou, Chang, Mingze, Shi, Zhonghua, Jin, Ying, Huang, Chuming, Liu, Tianzhu, Zhao, Wenlong, Wang, Jian, Wang, Li, Liu, Shugai, Luo, Shiwei, Pu, Jie, Chen, Shengli, Sun, Yaxuan, Jiang, Shunfu, Wei, Liping, Fu, Xinmin, Bai, Yongjie, Yang, Shunyu, Hu, Wei, Zhang, Guling, Pan, Chengde, Zhang, Shuai, Wang, Yan, Cao, Wenfeng, Yang, Shiquan, Zhang, Jun, Guo, Fuqiang, Wen, Hongbin, Zhang, Jinhua, and Lu, Mengjie
Published: 2024
Full Text: View/download PDF

39. Crack propagation and mechanical properties simulation of G/(HfNbTaTiZr)C–Al2O3 gradient composites

Author: Cao, Zhennan, Sun, Jialin, Zhang, Keguo, Zhao, Wenlong, and Yu, Ziqian
Published: 2024
Full Text: View/download PDF

40. Association between intravenous tirofiban and intracranial hemorrhage in acute large vessel occlusion stroke: insight from the RESCUE BT randomized placebo-controlled trial

Author: Sang, Hongfei, Huang, Jiacheng, Jiang, Bingwu, Guo, Qifeng, Nguyen, Thanh N., Abdalkader, Mohamad, Han, Qin, Zhou, Simin, Tao, Zhaojun, Mao, An, Yan, Zhizhong, Du, Jie, Jin, Ying, Huang, Chuming, Liu, Tianzhu, Zhao, Wenlong, Gu, Gangfeng, Wang, Li, Liu, Shugai, Luo, Shiwei, Pu, Jie, Hu, Jinrong, Yang, Jie, Li, Fengli, Zi, Wenjie, Hu, Xiaogang, and Qiu, Zhongming
Published: 2023
Full Text: View/download PDF

41. Compressing Transformer-Based Semantic Parsing Models using Compositional Code Embeddings

Author: Prakash, Prafull, Shashidhar, Saurabh Kumar, Zhao, Wenlong, Rongali, Subendhu, Khan, Haidar, and Kayser, Michael
Subjects: Computer Science - Computation and Language
Abstract: The current state-of-the-art task-oriented semantic parsing models use BERT or RoBERTa as pretrained encoders; these models have huge memory footprints. This poses a challenge to their deployment for voice assistants such as Amazon Alexa and Google Assistant on edge devices with limited memory budgets. We propose to learn compositional code embeddings to greatly reduce the sizes of BERT-base and RoBERTa-base. We also apply the technique to DistilBERT, ALBERT-base, and ALBERT-large, three already compressed BERT variants which attain similar state-of-the-art performances on semantic parsing with much smaller model sizes. We observe 95.15% ~ 98.46% embedding compression rates and 20.47% ~ 34.22% encoder compression rates, while preserving greater than 97.5% semantic parsing performances. We provide the recipe for training and analyze the trade-off between code embedding sizes and downstream performances., Comment: Accepted at EMNLP 2020 (Findings); 7 Pages
Published: 2020

42. Global Impact of COVID-19 on Stroke Care and IV Thrombolysis.

Author: Nogueira, Raul, Qureshi, Muhammad, Abdalkader, Mohamad, Martins, Sheila, Yamagami, Hiroshi, Qiu, Zhongming, Mansour, Ossama, Sathya, Anvitha, Czlonkowska, Anna, Tsivgoulis, Georgios, Aguiar de Sousa, Diana, Demeestere, Jelle, Mikulik, Robert, Vanacker, Peter, Siegler, James, Kõrv, Janika, Biller, Jose, Liang, Conrad, Sangha, Navdeep, Zha, Alicia, Czap, Alexandra, Holmstedt, Christine, Turan, Tanya, Ntaios, George, Malhotra, Konark, Tayal, Ashis, Loochtan, Aaron, Ranta, Annamarei, Mistry, Eva, Alexandrov, Anne, Huang, David, Yaghi, Shadi, Raz, Eytan, Sheth, Sunil, Mohammaden, Mahmoud, Frankel, Michael, Bila Lamou, Eric, Aref, Hany, Elbassiouny, Ahmed, Hassan, Farouk, Menecie, Tarek, Mustafa, Wessam, Shokri, Hossam, Roushdy, Tamer, Sarfo, Fred, Alabi, Tolulope, Arabambi, Babawale, Nwazor, Ernest, Sunmonu, Taofiki, Wahab, Kolawole, Yaria, Joseph, Mohammed, Haytham, Adebayo, Philip, Riahi, Anis, Sassi, Samia, Gwaunza, Lenon, Ngwende, Gift, Sahakyan, David, Rahman, Aminur, Ai, Zhibing, Bai, Fanghui, Duan, Zhenhui, Hao, Yonggang, Huang, Wenguo, Li, Guangwen, Li, Wei, Liu, Ganzhe, Luo, Jun, Shang, Xianjin, Sui, Yi, Tian, Ling, Wen, Hongbin, Wu, Bo, Yan, Yuying, Yuan, Zhengzhou, Zhang, Hao, Zhang, Jun, Zhao, Wenlong, Zi, Wenjie, Leung, Thomas, Chugh, Chandril, Huded, Vikram, Menon, Bindu, Pandian, Jeyaraj, Sylaja, P, Usman, Fritz, Farhoudi, Mehdi, Hokmabadi, Elyar, Horev, Anat, Reznik, Anna, Sivan Hoffmann, Rotem, Ohara, Nobuyuki, Sakai, Nobuyuki, Watanabe, Daisuke, Yamamoto, Ryoo, Doijiri, Ryosuke, Tokuda, Naoki, Yamada, Takehiro, Terasaki, Tadashi, and Yazawa, Yukako
Subjects: COVID-19, Cross-Sectional Studies, Hospitalization, Humans, Pandemics, Retrospective Studies, SARS-CoV-2, Stroke, Thrombolytic Therapy
Abstract: OBJECTIVE: To measure the global impact of COVID-19 pandemic on volumes of IV thrombolysis (IVT), IVT transfers, and stroke hospitalizations over 4 months at the height of the pandemic (March 1 to June 30, 2020) compared with 2 control 4-month periods. METHODS: We conducted a cross-sectional, observational, retrospective study across 6 continents, 70 countries, and 457 stroke centers. Diagnoses were identified by their ICD-10 codes or classifications in stroke databases. RESULTS: There were 91,373 stroke admissions in the 4 months immediately before compared to 80,894 admissions during the pandemic months, representing an 11.5% (95% confidence interval [CI] -11.7 to -11.3, p < 0.0001) decline. There were 13,334 IVT therapies in the 4 months preceding compared to 11,570 procedures during the pandemic, representing a 13.2% (95% CI -13.8 to -12.7, p < 0.0001) drop. Interfacility IVT transfers decreased from 1,337 to 1,178, or an 11.9% decrease (95% CI -13.7 to -10.3, p = 0.001). Recovery of stroke hospitalization volume (9.5%, 95% CI 9.2-9.8, p < 0.0001) was noted over the 2 later (May, June) vs the 2 earlier (March, April) pandemic months. There was a 1.48% stroke rate across 119,967 COVID-19 hospitalizations. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection was noted in 3.3% (1,722/52,026) of all stroke admissions. CONCLUSIONS: The COVID-19 pandemic was associated with a global decline in the volume of stroke hospitalizations, IVT, and interfacility IVT transfers. Primary stroke centers and centers with higher COVID-19 inpatient volumes experienced steeper declines. Recovery of stroke hospitalization was noted in the later pandemic months.
Published: 2021

43. Ground-based formaldehyde across the Pearl River Delta: A snapshot and meta-analysis study

Author: Mo, Xujun, Gong, Daocheng, Liu, Yunfeng, Li, Jiangyong, Zhao, Yiming, Zhao, Wenlong, Shen, Jin, Liao, Tong, Wang, Hao, and Wang, Boguang
Published: 2023
Full Text: View/download PDF

44. Effects of Spinning Process on Intergranular Corrosion Behavior of 5A06 Aluminum Alloy

Author: Zhang, Qianjun, Fu, Yongkang, Zhao, Wenlong, Li, Qing, Zhang, Rulin, Guo, Yong, and Li, Rongbin
Published: 2023
Full Text: View/download PDF

45. Dynamics analysis and vibration suppression of a spatial rigid-flexible link manipulator based on transfer matrix method of multibody system

Author: Shi, Mingming, Rong, Bao, Liang, Jing, Zhao, Wenlong, and Pan, Hongtao
Published: 2023
Full Text: View/download PDF

46. Rethinking Exposure Bias In Language Modeling

Author: Xu, Yifan, Zhang, Kening, Dong, Haoyu, Sun, Yuezhou, Zhao, Wenlong, and Tu, Zhuowen
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Exposure bias describes the phenomenon that a language model trained under the teacher forcing schema may perform poorly at the inference stage when its predictions are conditioned on its previous predictions unseen from the training corpus. Recently, several generative adversarial networks (GANs) and reinforcement learning (RL) methods have been introduced to alleviate this problem. Nonetheless, a common issue in RL and GANs training is the sparsity of reward signals. In this paper, we adopt two simple strategies, multi-range reinforcing, and multi-entropy sampling, to amplify and denoise the reward signal. Our model produces an improvement over competing models with regards to BLEU scores and road exam, a new metric we designed to measure the robustness against exposure bias in language models.
Published: 2019

47. Visualization of Thromboinflammation by 18F-DPA-714 PET in a Stroke Patient

Author: Lin, Qianqian, Wang, Youliang, Zhao, Wenlong, and Yao, Shaobo
Published: 2023
Full Text: View/download PDF

48. Research on fault diagnosis method for hydraulic system of CFETR blanket transfer device based on CNN-LSTM

Author: Guo, Xinpeng, Lu, Kun, Cheng, Yong, Zhao, Wenlong, Wu, Huapeng, Li, Dongyi, Li, Junwei, Yang, Songzhu, and Zhang, Yu
Published: 2022
Full Text: View/download PDF

49. A transcriptome-based analysis reveals functional differences among Dendrobium officinale Kimura & Migo species from different growing regions and with different quality levels

Author: Dang, Honglei, Zhang, Xiaoling, Ma, Chengmei, Zhao, Wenlong, Guo, Hongyan, Huang, Guoliang, and Xie, Lan
Published: 2022
Full Text: View/download PDF

50. Missing data imputation framework for bridge structural health monitoring based on slim generative adversarial networks

Author: Gao, Shuai, Zhao, Wenlong, Wan, Chunfeng, Jiang, Huachen, Ding, Youliang, and Xue, Songtao
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

498 results on '"Zhao, Wenlong"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources