Author: "Ke, Pei" / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ke, Pei"' showing total 458 results

Start Over Author "Ke, Pei" Publication Year Range Last 10 years

458 results on '"Ke, Pei"'

1. Benchmarking Complex Instruction-Following with Multiple Constraints Composition

Author: Wen, Bosi, Ke, Pei, Gu, Xiaotao, Wu, Lindong, Huang, Hao, Zhou, Jinfeng, Li, Wenchuang, Hu, Binxin, Gao, Wendy, Xu, Jiaxin, Liu, Yiming, Tang, Jie, Wang, Hongning, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Instruction following is one of the fundamental capabilities of large language models (LLMs). As the ability of LLMs is constantly improving, they have been increasingly applied to deal with complex human instructions in real-world scenarios. Therefore, how to evaluate the ability of complex instruction-following of LLMs has become a critical research problem. Existing benchmarks mainly focus on modeling different types of constraints in human instructions while neglecting the composition of different constraints, which is an indispensable constituent in complex instructions. To this end, we propose ComplexBench, a benchmark for comprehensively evaluating the ability of LLMs to follow complex instructions composed of multiple constraints. We propose a hierarchical taxonomy for complex instructions, including 4 constraint types, 19 constraint dimensions, and 4 composition types, and manually collect a high-quality dataset accordingly. To make the evaluation reliable, we augment LLM-based evaluators with rules to effectively verify whether generated texts can satisfy each constraint and composition. Furthermore, we obtain the final evaluation score based on the dependency structure determined by different composition types. ComplexBench identifies significant deficiencies in existing LLMs when dealing with complex instructions with multiple constraints composition., Comment: NeurIPS 2024 Datasets and Benchmarks Track
Published: 2024

2. Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Author: Zhang, Zhexin, Yang, Junxiao, Ke, Pei, Cui, Shiyao, Zheng, Chujie, Wang, Hongning, and Huang, Minlie
Subjects: Computer Science - Cryptography and Security, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: LLMs are known to be vulnerable to jailbreak attacks, even after safety alignment. An important observation is that, while different types of jailbreak attacks can generate significantly different queries, they mostly result in similar responses that are rooted in the same harmful knowledge (e.g., detailed steps to make a bomb). Therefore, we conjecture that directly unlearn the harmful knowledge in the LLM can be a more effective way to defend against jailbreak attacks than the mainstream supervised fine-tuning (SFT) based approaches. Our extensive experiments confirmed our insight and suggested surprising generalizability of our unlearning-based approach: using only 20 raw harmful questions \emph{without} any jailbreak prompt during training, our solution reduced the Attack Success Rate (ASR) in Vicuna-7B on \emph{out-of-distribution} (OOD) harmful questions wrapped with various complex jailbreak prompts from 82.6\% to 7.7\%. This significantly outperforms Llama2-7B-Chat, which is fine-tuned on about 0.1M safety alignment samples but still has an ASR of 21.9\% even under the help of an additional safety system prompt. Further analysis reveals that the generalization ability of our solution stems from the intrinsic relatedness among harmful responses across harmful questions (e.g., response patterns, shared steps and actions, and similarity among their learned representations in the LLM). Our code is available at \url{https://github.com/thu-coai/SafeUnlearning}., Comment: 15 pages
Published: 2024

3. AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Author: Cheng, Jiale, Lu, Yida, Gu, Xiaotao, Ke, Pei, Liu, Xiao, Dong, Yuxiao, Wang, Hongning, Tang, Jie, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Although Large Language Models (LLMs) are becoming increasingly powerful, they still exhibit significant but subtle weaknesses, such as mistakes in instruction-following or coding tasks. As these unexpected errors could lead to severe consequences in practical deployments, it is crucial to investigate the limitations within LLMs systematically. Traditional benchmarking approaches cannot thoroughly pinpoint specific model deficiencies, while manual inspections are costly and not scalable. In this paper, we introduce a unified framework, AutoDetect, to automatically expose weaknesses in LLMs across various tasks. Inspired by the educational assessment process that measures students' learning outcomes, AutoDetect consists of three LLM-powered agents: Examiner, Questioner, and Assessor. The collaboration among these three agents is designed to realize comprehensive and in-depth weakness identification. Our framework demonstrates significant success in uncovering flaws, with an identification success rate exceeding 30% in prominent models such as ChatGPT and Claude. More importantly, these identified weaknesses can guide specific model improvements, proving more effective than untargeted data augmentation methods like Self-Instruct. Our approach has led to substantial enhancements in popular LLMs, including the Llama series and Mistral-7b, boosting their performance by over 10% across several benchmarks. Code and data are publicly available at https://github.com/thu-coai/AutoDetect.
Published: 2024

4. Learning Task Decomposition to Assist Humans in Competitive Programming

Author: Wen, Jiaxin, Zhong, Ruiqi, Ke, Pei, Shao, Zhihong, Wang, Hongning, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Programming Languages
Abstract: When using language models (LMs) to solve complex problems, humans might struggle to understand the LM-generated solutions and repair the flawed ones. To assist humans in repairing them, we propose to automatically decompose complex solutions into multiple simpler pieces that correspond to specific subtasks. We introduce a novel objective for learning task decomposition, termed assistive value (AssistV), which measures the feasibility and speed for humans to repair the decomposed solution. We collect a dataset of human repair experiences on different decomposed solutions. Utilizing the collected data as in-context examples, we then learn to critique, refine, and rank decomposed solutions to improve AssistV. We validate our method under competitive programming problems: under 177 hours of human study, our method enables non-experts to solve 33.3\% more problems, speeds them up by 3.3x, and empowers them to match unassisted experts., Comment: ACL 2024 Main Conference
Published: 2024

5. Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering

Author: Wen, Zhihua, Tian, Zhiliang, Jian, Zexin, Huang, Zhen, Ke, Pei, Gao, Yifu, Huang, Minlie, and Li, Dongsheng
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large Language Models (LLMs) are widely used for knowledge-seeking yet suffer from hallucinations. The knowledge boundary (KB) of an LLM limits its factual understanding, beyond which it may begin to hallucinate. Investigating the perception of LLMs' KB is crucial for detecting hallucinations and LLMs' reliable generation. Current studies perceive LLMs' KB on questions with a concrete answer (close-ended questions) while paying limited attention to semi-open-ended questions (SoeQ) that correspond to many potential answers. Some researchers achieve it by judging whether the question is answerable or not. However, this paradigm is unsuitable for SoeQ, which are usually partially answerable, containing both answerable and ambiguous (unanswerable) answers. Ambiguous answers are essential for knowledge-seeking, but they may go beyond the KB of LLMs. In this paper, we perceive the LLMs' KB with SoeQ by discovering more ambiguous answers. First, we apply an LLM-based approach to construct SoeQ and obtain answers from a target LLM. Unfortunately, the output probabilities of mainstream black-box LLMs are inaccessible to sample for low-probability ambiguous answers. Therefore, we apply an open-sourced auxiliary model to explore ambiguous answers for the target LLM. We calculate the nearest semantic representation for existing answers to estimate their probabilities, with which we reduce the generation probability of high-probability answers to achieve a more effective generation. Finally, we compare the results from the RAG-based evaluation and LLM self-evaluation to categorize four types of ambiguous answers that are beyond the KB of the target LLM. Following our method, we construct a dataset to perceive the KB for GPT-4. We find that GPT-4 performs poorly on SoeQ and is often unaware of its KB. Besides, our auxiliary model, LLaMA-2-13B, is effective in discovering more ambiguous answers.
Published: 2024

6. Establishing Classifiers With Clinical Laboratory Indicators to Distinguish COVID-19 From Community-Acquired Pneumonia: Retrospective Cohort Study

Author: Dai, Wanfa, Ke, Pei-Feng, Li, Zhen-Zhen, Zhuang, Qi-Zhen, Huang, Wei, Wang, Yi, Xiong, Yujuan, and Huang, Xian-Zhang
Subjects: Computer applications to medicine. Medical informatics, R858-859.7, Public aspects of medicine, RA1-1270
Abstract: BackgroundThe initial symptoms of patients with COVID-19 are very much like those of patients with community-acquired pneumonia (CAP); it is difficult to distinguish COVID-19 from CAP with clinical symptoms and imaging examination. ObjectiveThe objective of our study was to construct an effective model for the early identification of COVID-19 that would also distinguish it from CAP. MethodsThe clinical laboratory indicators (CLIs) of 61 COVID-19 patients and 60 CAP patients were analyzed retrospectively. Random combinations of various CLIs (ie, CLI combinations) were utilized to establish COVID-19 versus CAP classifiers with machine learning algorithms, including random forest classifier (RFC), logistic regression classifier, and gradient boosting classifier (GBC). The performance of the classifiers was assessed by calculating the area under the receiver operating characteristic curve (AUROC) and recall rate in COVID-19 prediction using the test data set. ResultsThe classifiers that were constructed with three algorithms from 43 CLI combinations showed high performance (recall rate >0.9 and AUROC >0.85) in COVID-19 prediction for the test data set. Among the high-performance classifiers, several CLIs showed a high usage rate; these included procalcitonin (PCT), mean corpuscular hemoglobin concentration (MCHC), uric acid, albumin, albumin to globulin ratio (AGR), neutrophil count, red blood cell (RBC) count, monocyte count, basophil count, and white blood cell (WBC) count. They also had high feature importance except for basophil count. The feature combination (FC) of PCT, AGR, uric acid, WBC count, neutrophil count, basophil count, RBC count, and MCHC was the representative one among the nine FCs used to construct the classifiers with an AUROC equal to 1.0 when using the RFC or GBC algorithms. Replacing any CLI in these FCs would lead to a significant reduction in the performance of the classifiers that were built with them. ConclusionsThe classifiers constructed with only a few specific CLIs could efficiently distinguish COVID-19 from CAP, which could help clinicians perform early isolation and centralized management of COVID-19 patients.
Published: 2021
Full Text: View/download PDF

7. ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

Author: Zhang, Zhexin, Lu, Yida, Ma, Jingyuan, Zhang, Di, Li, Rui, Ke, Pei, Sun, Hao, Sha, Lei, Sui, Zhifang, Wang, Hongning, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: The safety of Large Language Models (LLMs) has gained increasing attention in recent years, but there still lacks a comprehensive approach for detecting safety issues within LLMs' responses in an aligned, customizable and explainable manner. In this paper, we propose ShieldLM, an LLM-based safety detector, which aligns with general human safety standards, supports customizable detection rules, and provides explanations for its decisions. To train ShieldLM, we compile a large bilingual dataset comprising 14,387 query-response pairs, annotating the safety of responses based on various safety standards. Through extensive experiments, we demonstrate that ShieldLM surpasses strong baselines across four test sets, showcasing remarkable customizability and explainability. Besides performing well on standard detection datasets, ShieldLM has also been shown to be effective in real-world situations as a safety evaluator for advanced LLMs. We release ShieldLM at \url{https://github.com/thu-coai/ShieldLM} to support accurate and explainable safety detection under various safety standards, contributing to the ongoing efforts to enhance the safety of LLMs., Comment: 17 pages
Published: 2024

8. Towards Efficient Exact Optimization of Language Model Alignment

Author: Ji, Haozhe, Lu, Cheng, Niu, Yilin, Ke, Pei, Wang, Hongning, Zhu, Jun, Tang, Jie, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: The alignment of language models with human preferences is vital for their application in real-world tasks. The problem is formulated as optimizing the model's policy to maximize the expected reward that reflects human preferences with minimal deviation from the initial policy. While considered as a straightforward solution, reinforcement learning (RL) suffers from high variance in policy updates, which impedes efficient policy improvement. Recently, direct preference optimization (DPO) was proposed to directly optimize the policy from preference data. However, we show that DPO derived based on the optimal solution of the problem leads to a compromised mean-seeking approximation of the optimal solution in practice. In this paper, we propose efficient exact optimization (EXO) of the alignment objective. EXO is guaranteed to optimize in the same direction as RL algorithms asymptotically for arbitrary policy parametrization. This leads to the same mode-seeking solution, while enables efficient optimization by circumventing the complexities of RL. We also compare our method to DPO with both theoretical and empirical analyses, and further demonstrate the advantages of our method over existing approaches on realistic human preference data. Code is available at https://github.com/haozheji/exact-optimization., Comment: 24 pages, 9 figures
Published: 2024

9. Assessing the range of blasting-induced cracks in the surrounding rock of deeply buried tunnels based on the unified strength theory

Author: Li, Liang, Chen, Jia-jun, Zhao, Lian-heng, He, Ke-pei, Hu, Shi-hong, and Li, Hua-long
Published: 2024
Full Text: View/download PDF

10. CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation

Author: Ke, Pei, Wen, Bosi, Feng, Zhuoer, Liu, Xiao, Lei, Xuanyu, Cheng, Jiale, Wang, Shengyuan, Zeng, Aohan, Dong, Yuxiao, Wang, Hongning, Tang, Jie, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Since the natural language processing (NLP) community started to make large language models (LLMs) act as a critic to evaluate the quality of generated texts, most of the existing works train a critique generation model on the evaluation data labeled by GPT-4's direct prompting. We observe that these models lack the ability to generate informative critiques in both pointwise grading and pairwise comparison especially without references. As a result, their generated critiques cannot provide fine-grained distinguishability on generated texts, causing unsatisfactory evaluation performance. In this paper, we propose a simple yet effective method called Eval-Instruct, which can first acquire pointwise grading critiques with pseudo references and then revise these critiques via multi-path prompting to obtain informative evaluation data in different tasks and settings, including pointwise grading and pairwise comparison with / without references. After fine-tuning on these data, the resulting model CritiqueLLM is empirically shown to outperform ChatGPT and all the open-source baselines and even achieve comparable evaluation performance to GPT-4 in system-level correlations of pointwise grading. We also demonstrate that our generated critiques can act as scalable feedback to further improve the generation quality of strong LLMs like ChatGPT., Comment: Accepted by ACL 2024 (Main Conference)
Published: 2023

11. AlignBench: Benchmarking Chinese Alignment of Large Language Models

Author: Liu, Xiao, Lei, Xuanyu, Wang, Shengyuan, Huang, Yue, Feng, Zhuoer, Wen, Bosi, Cheng, Jiale, Ke, Pei, Xu, Yifan, Tam, Weng Lam, Zhang, Xiaohan, Sun, Lichao, Gu, Xiaotao, Wang, Hongning, Zhang, Jing, Huang, Minlie, Dong, Yuxiao, and Tang, Jie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Alignment has become a critical step for instruction-tuned Large Language Models (LLMs) to become helpful assistants. However, the effective evaluation of alignment for emerging Chinese LLMs is still largely unexplored. To fill in this gap, we introduce AlignBench, a comprehensive multi-dimensional benchmark for evaluating LLMs' alignment in Chinese. We design a human-in-the-loop data curation pipeline, containing eight main categories, 683 real-scenario rooted queries and corresponding human verified references. To ensure the correctness of references, each knowledge-intensive query is accompanied with evidences collected from reliable web sources (including URLs and quotations) by our annotators. For automatic evaluation, our benchmark employs a rule-calibrated multi-dimensional LLM-as-Judge~\cite{zheng2023judging} approach with Chain-of-Thought to generate explanations and final ratings, ensuring high reliability and interpretability. All evaluation code, data, and LLM generations are available at \url{https://github.com/THUDM/AlignBench}. Since its release, AlignBench has been adopted by top (Chinese) LLMs for evaluating their alignment capabilities in Chinese, including ChatGLM, Qwen, DeepSeek, Yi, Baichuan, and Abab., Comment: Accepted to ACL 2024
Published: 2023

12. Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

Author: Zhang, Zhexin, Yang, Junxiao, Ke, Pei, Mi, Fei, Wang, Hongning, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: While significant attention has been dedicated to exploiting weaknesses in LLMs through jailbreaking attacks, there remains a paucity of effort in defending against these attacks. We point out a pivotal factor contributing to the success of jailbreaks: the intrinsic conflict between the goals of being helpful and ensuring safety. Accordingly, we propose to integrate goal prioritization at both training and inference stages to counteract. Implementing goal prioritization during inference substantially diminishes the Attack Success Rate (ASR) of jailbreaking from 66.4% to 3.6% for ChatGPT. And integrating goal prioritization into model training reduces the ASR from 71.0% to 6.6% for Llama2-13B. Remarkably, even in scenarios where no jailbreaking samples are included during training, our approach slashes the ASR by half. Additionally, our findings reveal that while stronger LLMs face greater safety risks, they also possess a greater capacity to be steered towards defending against such attacks, both because of their stronger ability in instruction following. Our work thus contributes to the comprehension of jailbreaking attacks and defenses, and sheds light on the relationship between LLMs' capability and safety. Our code is available at \url{https://github.com/thu-coai/JailbreakDefense_GoalPriority}., Comment: ACL 2024 Main Conference
Published: 2023

13. Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

Author: Cheng, Jiale, Liu, Xiao, Zheng, Kehan, Ke, Pei, Wang, Hongning, Dong, Yuxiao, Tang, Jie, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) have shown impressive success in various applications. However, these models are often not well aligned with human intents, which calls for additional treatments on them; that is, the alignment problem. To make LLMs better follow user instructions, existing alignment methods primarily focus on further training them. However, the extra training of LLMs is usually expensive in terms of GPU computing; even worse, some LLMs are not accessible for user-demanded training, such as GPTs. In this work, we take a different perspective -- Black-Box Prompt Optimization (BPO) -- to perform alignments. The idea is to optimize user prompts to suit LLMs' input understanding, so as to best realize users' intents without updating LLMs' parameters. BPO leverages human preferences to optimize prompts, thus making it superior to LLM (e.g., ChatGPT) as a prompt engineer. Moreover, BPO is model-agnostic, and the empirical results demonstrate that the BPO-aligned ChatGPT yields a 22% increase in the win rate against its original version and 10% for GPT-4. Notably, the BPO-aligned LLMs can outperform the same models aligned by PPO and DPO, and it also brings additional performance gains when combining BPO with PPO or DPO. Code and datasets are released at https://github.com/thu-coai/BPO., Comment: Accepted to ACL 2024
Published: 2023

14. Language Model Decoding as Direct Metrics Optimization

Author: Ji, Haozhe, Ke, Pei, Wang, Hongning, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: Despite the remarkable advances in language modeling, current mainstream decoding methods still struggle to generate texts that align with human texts across different aspects. In particular, sampling-based methods produce less-repetitive texts which are often disjunctive in discourse, while search-based methods maintain topic coherence at the cost of increased repetition. Overall, these methods fall short in achieving holistic alignment across a broad range of aspects. In this work, we frame decoding from a language model as an optimization problem with the goal of strictly matching the expected performance with human texts measured by multiple metrics of desired aspects simultaneously. The resulting decoding distribution enjoys an analytical solution that scales the input language model distribution via a sequence-level energy function defined by these metrics. And most importantly, we prove that this induced distribution is guaranteed to improve the perplexity on human texts, which suggests a better approximation to the underlying distribution of human texts. To facilitate tractable sampling from this globally normalized distribution, we adopt the Sampling-Importance-Resampling technique. Experiments on various domains and model scales demonstrate the superiority of our method in metrics alignment with human texts and human evaluation over strong baselines., Comment: 33 pages, 3 figures
Published: 2023

15. DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering

Author: Ke, Pei, Huang, Fei, Mi, Fei, Wang, Yasheng, Liu, Qun, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Existing evaluation metrics for natural language generation (NLG) tasks face the challenges on generalization ability and interpretability. Specifically, most of the well-performed metrics are required to train on evaluation datasets of specific NLG tasks and evaluation dimensions, which may cause over-fitting to task-specific datasets. Furthermore, existing metrics only provide an evaluation score for each dimension without revealing the evidence to interpret how this score is obtained. To deal with these challenges, we propose a simple yet effective metric called DecompEval. This metric formulates NLG evaluation as an instruction-style question answering task and utilizes instruction-tuned pre-trained language models (PLMs) without training on evaluation datasets, aiming to enhance the generalization ability. To make the evaluation process more interpretable, we decompose our devised instruction-style question about the quality of generated texts into the subquestions that measure the quality of each sentence. The subquestions with their answers generated by PLMs are then recomposed as evidence to obtain the evaluation result. Experimental results show that DecompEval achieves state-of-the-art performance in untrained metrics for evaluating text summarization and dialogue generation, which also exhibits strong dimension-level / task-level generalization ability and interpretability., Comment: Accepted by ACL 2023 (Main Conference)
Published: 2023

16. Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning

Author: Zheng, Chujie, Ke, Pei, Zhang, Zheng, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: It has always been an important yet challenging problem to control language models to avoid generating texts with undesirable attributes, such as toxic language and unnatural repetition. We introduce Click for controllable text generation, which needs no modification to the model architecture and facilitates out-of-the-box use of trained models. It employs a contrastive loss on sequence likelihood, which fundamentally decreases the generation probability of negative samples (i.e., generations with undesirable attributes). It also adopts a novel likelihood ranking-based strategy to construct contrastive samples from model generations. On the tasks of language detoxification, sentiment steering, and repetition reduction, we show that Click outperforms strong baselines of controllable text generation and demonstrate the superiority of Click's sample construction strategy., Comment: Findings of ACL 2023
Published: 2023

17. Directed Acyclic Transformer Pre-training for High-quality Non-autoregressive Text Generation

Author: Huang, Fei, Ke, Pei, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: Non-AutoRegressive (NAR) text generation models have drawn much attention because of their significantly faster decoding speed and good generation quality in machine translation. However, in a wider range of text generation tasks, existing NAR models lack proper pre-training, making them still far behind the pre-trained autoregressive models. In this paper, we propose Pre-trained Directed Acyclic Transformer (PreDAT) and a novel pre-training task to promote prediction consistency in NAR generation. Experiments on five text generation tasks show that our PreDAT remarkably outperforms existing pre-trained NAR models (+4.2 scores on average) and even achieves better results than pre-trained autoregressive baselines in n-gram-based metrics, along with 17 times speedup in throughput. Further analysis shows that PreDAT benefits from the unbiased prediction order that alleviates the error accumulation problem in autoregressive generation, which provides new insights into the advantages of NAR generation., Comment: Accepted at Transactions of the Association for Computational Linguistics
Published: 2023

18. Tailoring Language Generation Models under Total Variation Distance

Author: Ji, Haozhe, Ke, Pei, Hu, Zhipeng, Zhang, Rongsheng, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: The standard paradigm of neural language generation adopts maximum likelihood estimation (MLE) as the optimizing method. From a distributional view, MLE in fact minimizes the Kullback-Leibler divergence (KLD) between the distribution of the real data and that of the model. However, this approach forces the model to distribute non-zero (sometimes large) probability mass to all training samples regardless of their quality. Moreover, in the attempt to cover the low-probability regions in the data distribution, the model systematically overestimates the probability of corrupted text sequences, which we conjecture is one of the main reasons for text degeneration during autoregressive decoding. To remedy this problem, we leverage the total variation distance (TVD) with its robustness to outliers, and develop practical bounds to apply it to language generation. Then, we introduce the TaiLr objective that balances the tradeoff of estimating TVD. Intuitively, TaiLr downweights real data samples that have low model probabilities with tunable penalization intensity. Experimental results show that our method alleviates the overestimation of degenerated sequences without sacrificing diversity and improves generation quality on a wide range of text generation tasks., Comment: Published in ICLR 2023 (notable-top-5%)
Published: 2023

19. Technical Report: Automating Vehicle SOA Threat Analysis using a Model-Based Methodology

Author: Dantas, Yuri Gil, Barner, Simon, Ke, Pei, Nigam, Vivek, and Schoepp, Ulrich
Subjects: Computer Science - Logic in Computer Science
Abstract: While the adoption of Service-Oriented Architectures (SOA) eases the implementation of features such as autonomous driving and over-the-air updates, it also increases the vehicle's exposure to attacks that may place road-users in harm. To address this problem, standards (ISO 21434/UNECE) expect manufacturers to produce security arguments and evidence by carrying out appropriate threat analysis. As key threat analysis steps, e.g., damage/threat scenario and attack path enumeration, are often carried out manually and not rigorously, security arguments lack precise guarantees, e.g., traceability w.r.t. safety goals, especially under system updates. This article proposes automated methods for threat analysis using a model-based engineering methodology that provides precise guarantees with respect to safety goals. This is accomplished by proposing an intruder model for automotive SOA which together with the system architecture and the loss scenarios identified by safety analysis are used as input for computing assets, impact rating, damage/threat scenarios, and attack paths. To validate the proposed methodology, we developed a faithful model of the autonomous driving functions of the Apollo framework, a widely used open-source autonomous driving stack. The proposed machinery automatically enumerates several attack paths on Apollo, including attack paths not reported in the literature.
Published: 2022

20. Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization

Author: Gu, Yuxian, Ke, Pei, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: Training language models to learn from human instructions for zero-shot cross-task generalization has attracted much attention in NLP communities. Recently, instruction tuning (IT), which fine-tunes a pre-trained language model on a massive collection of tasks described via human-craft instructions, has been shown effective in instruction learning for unseen tasks. However, IT relies on a large amount of human-annotated samples, which restricts its generalization. Unlike labeled data, unlabeled data are often massive and cheap to obtain. In this work, we study how IT can be improved with unlabeled data. We first empirically explore the IT performance trends versus the number of labeled data, instructions, and training tasks. We find it critical to enlarge the number of training instructions, and the instructions can be underutilized due to the scarcity of labeled data. Then, we propose Unlabeled Data Augmented Instruction Tuning (UDIT) to take better advantage of the instructions during IT by constructing pseudo-labeled data from unlabeled plain texts. We conduct extensive experiments to show UDIT's effectiveness in various scenarios of tasks and datasets. We also comprehensively analyze the key factors of UDIT to investigate how to better improve IT with unlabeled data. The code is publicly available at https://github.com/thu-coai/UDIT., Comment: Accepted by the main conference of EMNLP 2022
Published: 2022

21. ChatGPT: potential, prospects, and limitations

Author: Zhou, Jie, Ke, Pei, Qiu, Xipeng, Huang, Minlie, and Zhang, Junping
Published: 2024
Full Text: View/download PDF

22. Inexact line search method in full waveform inversion

Author: Ma, Xiaona, Xu, Shan-hui, Ke, Pei, and Zhang, Hong-tao
Published: 2023
Full Text: View/download PDF

23. Distinct skyrmion phases at room temperature in two-dimensional ferromagnet Fe3GaTe2

Author: Xiaowei Lv, Hualiang Lv, Yalei Huang, Ruixuan Zhang, Guanhua Qin, Yihui Dong, Min Liu, Ke Pei, Guixin Cao, Jincang Zhang, Yuxiang Lai, and Renchao Che
Subjects: Science
Abstract: Abstract Distinct skyrmion phases at room temperature hosted by one material offer additional degree of freedom for the design of topology-based compact and energetically-efficient spintronic devices. The field has been extended to low-dimensional magnets with the discovery of magnetism in two-dimensional van der Waals magnets. However, creating multiple skyrmion phases in 2D magnets, especially above room temperature, remains a major challenge. Here, we report the experimental observation of mixed-type skyrmions, exhibiting both Bloch and hybrid characteristics, in a room-temperature ferromagnet Fe3GaTe2. Analysis of the magnetic intensities under varied imaging conditions coupled with complementary simulations reveal that spontaneous Bloch skyrmions exist as the magnetic ground state with the coexistence of hybrid stripes domain, on account of the interplay between the dipolar interaction and the Dzyaloshinskii-Moriya interaction. Moreover, hybrid skyrmions are created and their coexisting phases with Bloch skyrmions exhibit considerably high thermostability, enduring up to 328 K. The findings open perspectives for 2D spintronic devices incorporating distinct skyrmion phases at room temperature.
Published: 2024
Full Text: View/download PDF

24. Unveiling the functional heterogeneity of cytokine-primed human umbilical cord mesenchymal stem cells through single-cell RNA sequencing

Author: Zhiwei Hu, Duanduan Li, Shiduo Wu, Ke Pei, Zeqin Fu, Yulin Yang, Yinfu Huang, Jian Yang, Chuntao Liu, Junyuan Hu, Cheguo Cai, and Yan Liao
Subjects: Mesenchymal stem cells (MSCs), Single-cell RNA sequencing (scRNA-seq), Cell heterogeneity, Cytokine priming, Functional subpopulations, Biotechnology, TP248.13-248.65, Biology (General), QH301-705.5, Biochemistry, QD415-436
Abstract: Abstract Background Mesenchymal stem cells (MSCs) hold immense promise for use in immunomodulation and regenerative medicine. However, their inherent heterogeneity makes it difficult to achieve optimal therapeutic outcomes for a specific clinical disease. Primed MSCs containing a certain cytokine can enhance their particular functions, thereby increasing their therapeutic potential for related diseases. Therefore, understanding the characteristic changes and underlying mechanisms of MSCs primed by various cytokines is highly important. Results In this study, we aimed to reveal the cellular heterogeneity, functional subpopulations, and molecular mechanisms of MSCs primed with IFN-γ, TNF-α, IL-4, IL-6, IL-15, and IL-17 using single-cell RNA sequencing (scRNA-seq). Our results demonstrated that cytokine priming minimized the heterogeneity of the MSC transcriptome, while the expression of MSC surface markers exhibited only slight changes. Notably, compared to IL-6, IL-15, and IL-17; IFN-γ, TNF-α, and IL-4 priming, which stimulated a significantly greater number of differentially expressed genes (DEGs). Functional analysis, which included Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses, indicated that IFN-γ, TNF-α, and IL-4-primed hUC-MSCs are involved in interferon-mediated immune-related processes, leukocyte migration, chemotaxis potential, and extracellular matrix and cell adhesion, respectively. Moreover, an investigation of various biological function scores demonstrated that IFN-γ-primed hUC-MSCs exhibit strong immunomodulatory ability, TNF-α-primed hUC-MSCs exhibit high chemotaxis potential, and IL-4-primed hUC-MSCs express elevated amounts of collagen. Finally, we observed that cytokine priming alters the distribution of functional subpopulations of MSCs, and these subpopulations exhibit various potential biological functions. Taken together, our study revealed the distinct regulatory effects of cytokine priming on MSC heterogeneity, biological function, and functional subpopulations at the single-cell level. Conclusions These findings contribute to a comprehensive understanding of the inflammatory priming of MSCs, paving the way for their precise treatment in clinical applications.
Published: 2024
Full Text: View/download PDF

25. Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation

Author: Ke, Pei, Ji, Haozhe, Yang, Zhenyu, Huang, Yi, Feng, Junlan, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: Despite the success of text-to-text pre-trained models in various natural language generation (NLG) tasks, the generation performance is largely restricted by the number of labeled data in downstream tasks, particularly in data-to-text generation tasks. Existing works mostly utilize abundant unlabeled structured data to conduct unsupervised pre-training for task adaption, which fail to model the complex relationship between source structured data and target texts. Thus, we introduce self-training as a better few-shot learner than task-adaptive pre-training, which explicitly captures this relationship via pseudo-labeled data generated by the pre-trained model. To alleviate the side-effect of low-quality pseudo-labeled data during self-training, we propose a novel method called Curriculum-Based Self-Training (CBST) to effectively leverage unlabeled data in a rearranged order determined by the difficulty of text generation. Experimental results show that our method can outperform fine-tuning and task-adaptive pre-training methods, and achieve state-of-the-art performance in the few-shot setting of data-to-text generation., Comment: Accepted by IJCAI 2022
Published: 2022

26. Mechanism of Yishen Chuchan decoction intervention of Parkinson's disease based on network pharmacology and experimental verification

Author: Dong Di, Chencheng Zhang, Suping Sun, Ke Pei, Renjun Gu, Yan Sun, Shihan Zhou, Yanqing Wang, Xinyi Chen, Shan Jiang, Haoxin Wu, Boran Zhu, and Xu Xu
Subjects: Parkinson's disease, p38 MAPK, Medicated serum, Traditional Chinese medicine, Network pharmacology, Rotenone, Science (General), Q1-390, Social sciences (General), H1-99
Abstract: The incidence of Parkinson's disease (PD) rises rapidly with the increase of age. With the advent of global aging, the number of patients with PD is rising along with the elderly population, especially in China. Previously, we found that Yishen chuchan decoction (YCD), prescribed based on clinical experience, has the potential of alleviating symptoms, delaying the progression, and controlling the development of PD. Nonetheless, the underlying mechanistic role is yet to be explored. Aim: This research examined the possible therapeutic effects of YCD in alleviating PD via a systematic approach with network pharmacology and experimental validation, aiming at providing a new understanding of traditional Chinese medicine management regarding PD. Methods: The chemical structure and properties of YCD were adopted from Traditional Chinese Medicine System Pharmacology Database (TCMSP), SwissADME, PubChem, and PubMed. The potential targets for YCD and PD were identified using Swiss Target Prediction, GeneCard, PubChem, and UniProt. The herbal-component-target network was created via the Cytoscape software. Moreover, by using the STRING database, the protein-protein interaction (PPI) network was screened. Gene function GO and KEGG pathway enrichment analyses were performed via the Metascape database. YCD-medicated Rat Serum from Sprague-Dawley (SD) Rats was prepared, and SH-SY5Y cells were preconditioned with rotenone to develop the PD model. To examine the impact of YCD on these cells and explore the mechanistic role of the p38 mitogen-activated protein kinase (MAPK) pathway, the cells were pretreated with either serum or a p38 MAPK pathway inhibitor. This study employed the Cell Counting Kit (CCK)-8 assay and Hoechst 33,342 staining to evaluate the viability and morphological changes induced by the YCD-medicated rat serum on rotenone-treated SH-SY5Y cells. Apoptosis was assessed by Flow cytometry. Immunofluorescence staining assessed the microtubule-associated protein 2 (MAP2) level. Enzyme-linked immunosorbent assay (ELISA) was employed to quantify the concentrations of inflammatory mediators interleukin-1β (IL-1β), interleukin-6 (IL-6) and tumor necrosis factor-alpha (TNF-α). Also, reactive oxygen species (ROS) and superoxide dismutase (SOD) levels were determined. Western Blotting measured the expression of total and phospho-p38 MAPK (p-p38). Results: This study identified 65 active components in YCD, which were found to target 801 specific genes. By screening, 63 potential core targets were identified from a pool of 172 overlapping targets between PD and YCD. These targets were examined by GO and KEGG analyses revealing their substantial correlation to MAPK, PI3K-Akt signaling pathways, positively controlling protein phosphorylation, and pathways of neurodegenerative diseases. SH-SY5Y cells were treated with 2 μM rotenone for 48 h, which reduced cell viability to 50 %, and reduced MAP2 expression, increased the rate of apoptosis, oxidative stress, inflammation, and p-p38 expressions. YCD-medicated rat serum significantly improved the viability, reduced the apoptosis rate, and increased the MAP2 expression. YCD-medicated serum increased SOD, reduced ROS and suppressed IL-6, IL-1β and TNF-α levels, thus inhibiting oxidative stress and inflammation in rotenone-treated SH-SY5Y cells. Moreover, YCD-medicated serum substantially lowered the p-p38 expression induced by rotenone. SB203580, a specific inhibitor of p38 MAPK, could also inhibit the p-p38 expression, apoptosis, and restore morphological damage of cells, also improve inflammation and oxidative stress. Conclusion: YCD enhanced cell viability and reduced apoptosis rate, inflammation, and oxidative stress in vitro. These beneficial effects could potentially involve the suppression of p38 pathway and suppressed the phosphorylation of p38 MAPK.
Published: 2024
Full Text: View/download PDF

27. CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation

Author: Ke, Pei, Zhou, Hao, Lin, Yankai, Li, Peng, Zhou, Jie, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Existing reference-free metrics have obvious limitations for evaluating controlled text generation models. Unsupervised metrics can only provide a task-agnostic evaluation result which correlates weakly with human judgments, whereas supervised ones may overfit task-specific data with poor generalization ability to other datasets. In this paper, we propose an unsupervised reference-free metric called CTRLEval, which evaluates controlled text generation from different aspects by formulating each aspect into multiple text infilling tasks. On top of these tasks, the metric assembles the generation probabilities from a pre-trained language model without any model training. Experimental results show that our metric has higher correlations with human judgments than other baselines, while obtaining better generalization of evaluating generated texts from different models and with different qualities., Comment: Accepted by ACL 2022 (Main Conference)
Published: 2022

28. EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training

Author: Gu, Yuxian, Wen, Jiaxin, Sun, Hao, Song, Yi, Ke, Pei, Zheng, Chujie, Zhang, Zheng, Yao, Jianzhu, Liu, Lei, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large-scale pre-training has shown remarkable performance in building open-domain dialogue systems. However, previous works mainly focus on showing and evaluating the conversational performance of the released dialogue model, ignoring the discussion of some key factors towards a powerful human-like chatbot, especially in Chinese scenarios. In this paper, we conduct extensive experiments to investigate these under-explored factors, including data quality control, model architecture designs, training approaches, and decoding strategies. We propose EVA2.0, a large-scale pre-trained open-domain Chinese dialogue model with 2.8 billion parameters, and will make our models and codes publicly available. Automatic and human evaluations show that EVA2.0 significantly outperforms other open-source counterparts. We also discuss the limitations of this work by presenting some failure cases and pose some future research directions on large-scale Chinese open-domain dialogue systems., Comment: Machine Intelligence Research. https://link.springer.com/article/10.1007/s11633-022-1387-3 . 12 pages, 5 figures. The code and pre-trained models are publicly available at https://github.com/thu-coai/EVA
Published: 2022
Full Text: View/download PDF

29. Rethinking and Refining the Distinct Metric

Author: Liu, Siyang, Sabour, Sahand, Zheng, Yinhe, Ke, Pei, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, I.2.7
Abstract: Distinct-$n$ score\cite{Li2016} is a widely used automatic metric for evaluating diversity in language generation tasks. However, we observed that the original approach for calculating distinct scores has evident biases that tend to assign higher penalties to longer sequences. We refine the calculation of distinct scores by scaling the number of distinct tokens based on their expectations. We provide both empirical and theoretical evidence to show that our method effectively removes the biases existing in the original distinct score. Our experiments show that our proposed metric, \textit{Expectation-Adjusted Distinct (EAD)}, correlates better with human judgment in evaluating response diversity. To foster future research, we provide an example implementation at \url{https://github.com/lsy641/Expectation-Adjusted-Distinct}., Comment: 4 pages, to be published at ACL2022
Published: 2022

30. Effect of postoperative oxygen therapy regimen modification on oxygenation in patients with acute type A aortic dissection

Author: Li Wang, Xinyan Pang, Shouluan Ding, Ke Pei, Zijia Li, and Jianhong Wan
Subjects: Extracorporeal circulation, Mechanical ventilation, Oxygen concentration, Positive end-expiratory pressure, Science (General), Q1-390, Social sciences (General), H1-99
Abstract: Objective: In this study, we investigated the effect of various oxygen therapy regimens on oxygenation in patients with acute type A aortic dissection (AAD). Methods: A quasi-randomized controlled trial was conducted, in which patients with AAD hospitalized for surgery from June to September 2021 were assigned to the control group (patients received conventional oxygen therapy after postoperative mechanical ventilation, weaning, and extubation) and those who were admitted from October to December 2021 were assigned to the observation group [patients underwent optimally adjusted therapy based on the treatment of the control group, which mainly included prioritized elevation of positive end-expiratory pressure (PEEP) and restricted use of the fraction of inspired oxygen (FiO2)].The postoperative oxygenation index, blood gas analysis, and duration of mechanical ventilation were compared between the two groups. Results: There were significant differences in oxygenation observed at 2 h postoperatively between the groups. 12, 24, and 72 h postoperatively, the oxygenation index varied significantly between the two groups. There were statistically significant differences in the time effects of the oxygenation index and PaO2 between the two groups, as well as significant differences in the length of stay in the intensive care unit. Conclusion: For the postoperative care of patients with AAD, it is suggested that the minimum FiO2 required for oxygenation of patients be maintained. In addition, it is possible to enhance PEEP as a priority when PaO2 is low.
Published: 2024
Full Text: View/download PDF

31. EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training

Author: Zhou, Hao, Ke, Pei, Zhang, Zheng, Gu, Yuxian, Zheng, Yinhe, Zheng, Chujie, Wang, Yida, Wu, Chen Henry, Sun, Hao, Yang, Xiaocong, Wen, Bosi, Zhu, Xiaoyan, Huang, Minlie, and Tang, Jie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Although pre-trained language models have remarkably enhanced the generation ability of dialogue systems, open-domain Chinese dialogue systems are still limited by the dialogue data and the model size compared with English ones. In this paper, we propose EVA, a Chinese dialogue system that contains the largest Chinese pre-trained dialogue model with 2.8B parameters. To build this model, we collect the largest Chinese dialogue dataset named WDC-Dialogue from various public social media. This dataset contains 1.4B context-response pairs and is used as the pre-training corpus of EVA. Extensive experiments on automatic and human evaluation show that EVA outperforms other Chinese pre-trained dialogue models especially in the multi-turn interaction of human-bot conversations., Comment: 8 pages, 4 figures
Published: 2021

32. CPM-2: Large-scale Cost-effective Pre-trained Language Models

Author: Zhang, Zhengyan, Gu, Yuxian, Han, Xu, Chen, Shengqi, Xiao, Chaojun, Sun, Zhenbo, Yao, Yuan, Qi, Fanchao, Guan, Jian, Ke, Pei, Cai, Yanzheng, Zeng, Guoyang, Tan, Zhixing, Liu, Zhiyuan, Huang, Minlie, Han, Wentao, Liu, Yang, Zhu, Xiaoyan, and Sun, Maosong
Subjects: Computer Science - Computation and Language
Abstract: In recent years, the size of pre-trained language models (PLMs) has grown by leaps and bounds. However, efficiency issues of these large-scale PLMs limit their utilization in real-world scenarios. We present a suite of cost-effective techniques for the use of PLMs to deal with the efficiency issues of pre-training, fine-tuning, and inference. (1) We introduce knowledge inheritance to accelerate the pre-training process by exploiting existing PLMs instead of training models from scratch. (2) We explore the best practice of prompt tuning with large-scale PLMs. Compared with conventional fine-tuning, prompt tuning significantly reduces the number of task-specific parameters. (3) We implement a new inference toolkit, namely InfMoE, for using large-scale PLMs with limited computational resources. Based on our cost-effective pipeline, we pre-train two models: an encoder-decoder bilingual model with 11 billion parameters (CPM-2) and its corresponding MoE version with 198 billion parameters. In our experiments, we compare CPM-2 with mT5 on downstream tasks. Experimental results show that CPM-2 has excellent general language intelligence. Moreover, we validate the efficiency of InfMoE when conducting inference of large-scale models having tens of billions of parameters on a single GPU. All source code and model parameters are available at https://github.com/TsinghuaAI/CPM.
Published: 2021

33. JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs

Author: Ke, Pei, Ji, Haozhe, Ran, Yu, Cui, Xin, Wang, Liwei, Song, Linfeng, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Existing pre-trained models for knowledge-graph-to-text (KG-to-text) generation simply fine-tune text-to-text pre-trained models such as BART or T5 on KG-to-text datasets, which largely ignore the graph structure during encoding and lack elaborate pre-training tasks to explicitly model graph-text alignments. To tackle these problems, we propose a graph-text joint representation learning model called JointGT. During encoding, we devise a structure-aware semantic aggregation module which is plugged into each Transformer layer to preserve the graph structure. Furthermore, we propose three new pre-training tasks to explicitly enhance the graph-text alignment including respective text / graph reconstruction, and graph-text alignment in the embedding space via Optimal Transport. Experiments show that JointGT obtains new state-of-the-art performance on various KG-to-text datasets., Comment: ACL 2021 (Findings)
Published: 2021

34. Semantic-Enhanced Explainable Finetuning for Open-Domain Dialogues

Author: Zheng, Yinhe, Wang, Yida, Ke, Pei, Yang, Zhenyu, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: This paper propose to combine pretrained language models with the modular dialogue paradigm for open-domain dialogue modeling. Our method, semantic-enhanced finetuning, instantiates conversation understanding, planning, and response generation as a language model finetuning task. At inference, we disentangle semantic and token variations by specifying sampling methods and constraints for each module separately. For training and evaluation, we present X-Weibo, a Chinese multi-turn open-domain dialogue dataset with automatic annotation for emotions, DAs, and topical words. Experiments show that semantic-enhanced finetuning outperforms strong baselines on non-semantic and semantic metrics, improves the human-evaluated relevance, coherence, and informativeness, and exhibits considerable controllability over semantic variables., Comment: Under review
Published: 2021

35. Evaluation of the injection and plugging ability of a novel epoxy resin in cement cracks

Author: Leng, Guang-Yao, Yan, Wei, Ye, Hai-Mu, Yao, Er-Dong, Duan, Ji-Bin, Xu, Zheng-Xian, Li, Ke-Pei, Zhang, Jing-Ru, and Li, Zhong
Published: 2024
Full Text: View/download PDF

36. CPM: A Large-scale Generative Chinese Pre-trained Language Model

Author: Zhang, Zhengyan, Han, Xu, Zhou, Hao, Ke, Pei, Gu, Yuxian, Ye, Deming, Qin, Yujia, Su, Yusheng, Ji, Haozhe, Guan, Jian, Qi, Fanchao, Wang, Xiaozhi, Zheng, Yanan, Zeng, Guoyang, Cao, Huanqi, Chen, Shengqi, Li, Daixuan, Sun, Zhenbo, Liu, Zhiyuan, Huang, Minlie, Han, Wentao, Tang, Jie, Li, Juanzi, Zhu, Xiaoyan, and Sun, Maosong
Subjects: Computer Science - Computation and Language
Abstract: Pre-trained Language Models (PLMs) have proven to be beneficial for various downstream NLP tasks. Recently, GPT-3, with 175 billion parameters and 570GB training data, drew a lot of attention due to the capacity of few-shot (even zero-shot) learning. However, applying GPT-3 to address Chinese NLP tasks is still challenging, as the training corpus of GPT-3 is primarily English, and the parameters are not publicly available. In this technical report, we release the Chinese Pre-trained Language Model (CPM) with generative pre-training on large-scale Chinese training data. To the best of our knowledge, CPM, with 2.6 billion parameters and 100GB Chinese training data, is the largest Chinese pre-trained language model, which could facilitate several downstream Chinese NLP tasks, such as conversation, essay generation, cloze test, and language understanding. Extensive experiments demonstrate that CPM achieves strong performance on many NLP tasks in the settings of few-shot (even zero-shot) learning. The code and parameters are available at https://github.com/TsinghuaAI/CPM-Generate.
Published: 2020

37. Tanshinone IIA modulates cancer cell morphology and movement via Rho GTPases-mediated actin cytoskeleton remodeling

Author: Liang, En-yu, Huang, Meng-he, Chen, Ying-ting, Zhang, Peng-wei, Shen, Yan, Tu, Xiao-xin, Chen, Wei-ye, Wang, Yi, Yan, Jun, Wang, Hong-yu, Ke, Pei-feng, and Huang, Xian-zhang
Published: 2024
Full Text: View/download PDF

38. Identification of pattern recognition receptor genes in peripheral blood mononuclear cells and monocytes as biomarkers for the diagnosis of lupus nephritis

Author: Ke, Pei-Feng, Zhu, Yan-Ting, Cao, Song-Ling, Wang, Yi, Wu, Shi-Ting, He, Qian-Qian, Liang, Li-Fang, and Li, Ji-Cheng
Published: 2024
Full Text: View/download PDF

39. Triptolide inhibits the proinflammatory potential of myeloid-derived suppressor cells via reducing Arginase-1 in rheumatoid arthritis

Author: Zhao, Ziling, Huang, Huijie, Ke, Sikai, Deng, Bishun, Wang, Yun-xiu, Xu, Ning, Peng, Anping, Han, Guang, Liang, Enyu, He, Xiaohong, He, Qinglian, Ke, Pei-feng, Huang, Xian-zhang, and He, Min
Published: 2024
Full Text: View/download PDF

40. EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training

Author: Gu, Yuxian, Wen, Jiaxin, Sun, Hao, Song, Yi, Ke, Pei, Zheng, Chujie, Zhang, Zheng, Yao, Jianzhu, Liu, Lei, Zhu, Xiaoyan, and Huang, Minlie
Published: 2023
Full Text: View/download PDF

41. Research on geomagnetic indoor high-precision positioning algorithm based on generative model

Author: Shuai MA, Ke PEI, Huayan QI, Hang LI, Wen CAO, Hongmei WANG, Hailiang XIONG, and Shiyin LI
Subjects: deep learning, geomagnetic positioning, generative model, geomagnetic sequence, Telecommunication, TK5101-6720
Abstract: Aiming at the current bottleneck of constructing a fine geomagnetic fingerprint library that required a lot of labor costs, two generative models called the conditional variational autoencoder and the conditional confrontational generative network were proposed, which could collect a small number of data samples for a given location, and generate pseudo-label fingerprints.At the same time, in order to solve the problem of low positioning accuracy of single-point geomagnetic fingerprints, a geomagnetic sequence positioning algorithm based on attention mechanism of convolutional neural network-gated recurrent unit was designed, which could effectively use the spatial and temporal characteristics of fingerprints to achieve precise positioning.In addition, a real-time, portable mobile terminal data collection and positioning system was also designed and built.The actual test shows that the proposed model can effectively construct the available geomagnetic fingerprint database, and the average error of the proposed algorithm can reach 0.16 m.
Published: 2023
Full Text: View/download PDF

42. Generating Commonsense Explanation by Extracting Bridge Concepts from Reasoning Paths

Author: Ji, Haozhe, Ke, Pei, Huang, Shaohan, Wei, Furu, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: Commonsense explanation generation aims to empower the machine's sense-making capability by generating plausible explanations to statements against commonsense. While this task is easy to human, the machine still struggles to generate reasonable and informative explanations. In this work, we propose a method that first extracts the underlying concepts which are served as \textit{bridges} in the reasoning chain and then integrates these concepts to generate the final explanation. To facilitate the reasoning process, we utilize external commonsense knowledge to build the connection between a statement and the bridge concepts by extracting and pruning multi-hop paths to build a subgraph. We design a bridge concept extraction model that first scores the triples, routes the paths in the subgraph, and further selects bridge concepts with weak supervision at both the triple level and the concept level. We conduct experiments on the commonsense explanation generation task and our model outperforms the state-of-the-art baselines in both automatic and human evaluation., Comment: Accepted by AACL-IJCNLP 2020
Published: 2020

43. Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph

Author: Ji, Haozhe, Ke, Pei, Huang, Shaohan, Wei, Furu, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: Despite the success of generative pre-trained language models on a series of text generation tasks, they still suffer in cases where reasoning over underlying commonsense knowledge is required during generation. Existing approaches that integrate commonsense knowledge into generative pre-trained language models simply transfer relational knowledge by post-training on individual knowledge triples while ignoring rich connections within the knowledge graph. We argue that exploiting both the structural and semantic information of the knowledge graph facilitates commonsense-aware text generation. In this paper, we propose Generation with Multi-Hop Reasoning Flow (GRF) that enables pre-trained models with dynamic multi-hop reasoning on multi-relational paths extracted from the external commonsense knowledge graph. We empirically show that our model outperforms existing baselines on three text generation tasks that require reasoning over commonsense knowledge. We also demonstrate the effectiveness of the dynamic multi-hop reasoning module with reasoning paths inferred by the model that provide rationale to the generation., Comment: accepted by EMNLP 2020
Published: 2020

44. A Large-Scale Chinese Short-Text Conversation Dataset

Author: Wang, Yida, Ke, Pei, Zheng, Yinhe, Huang, Kaili, Jiang, Yong, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: The advancements of neural dialogue generation models show promising results on modeling short-text conversations. However, training such models usually needs a large-scale high-quality dialogue corpus, which is hard to access. In this paper, we present a large-scale cleaned Chinese conversation dataset, LCCC, which contains a base version (6.8million dialogues) and a large version (12.0 million dialogues). The quality of our dataset is ensured by a rigorous data cleaning pipeline, which is built based on a set of rules and a classifier that is trained on manually annotated 110K dialogue pairs. We also release pre-training dialogue models which are trained on LCCC-base and LCCC-large respectively. The cleaned dataset and the pre-training models will facilitate the research of short-text conversation modeling. All the models and datasets are available at https://github.com/thu-coai/CDial-GPT., Comment: Accepted by NLPCC 2020 (Best Student Paper)
Published: 2020

45. CoTK: An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation

Author: Huang, Fei, Wan, Dazhen, Shao, Zhihong, Ke, Pei, Guan, Jian, Niu, Yilin, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, I.2.7
Abstract: In text generation evaluation, many practical issues, such as inconsistent experimental settings and metric implementations, are often ignored but lead to unfair evaluation and untenable conclusions. We present CoTK, an open-source toolkit aiming to support fast development and fair evaluation of text generation. In model development, CoTK helps handle the cumbersome issues, such as data processing, metric implementation, and reproduction. It standardizes the development steps and reduces human errors which may lead to inconsistent experimental settings. In model evaluation, CoTK provides implementation for many commonly used metrics and benchmark models across different experimental settings. As a unique feature, CoTK can signify when and which metric cannot be fairly compared. We demonstrate that it is convenient to use CoTK for model development and evaluation, particularly across different experimental settings., Comment: Submitting to ACL2020 demo
Published: 2020

46. SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge

Author: Ke, Pei, Ji, Haozhe, Liu, Siyang, Zhu, Xiaoyan, and Huang, Minlie
Subjects: Computer Science - Computation and Language
Abstract: Most of the existing pre-trained language representation models neglect to consider the linguistic knowledge of texts, which can promote language understanding in NLP tasks. To benefit the downstream tasks in sentiment analysis, we propose a novel language representation model called SentiLARE, which introduces word-level linguistic knowledge including part-of-speech tag and sentiment polarity (inferred from SentiWordNet) into pre-trained models. We first propose a context-aware sentiment attention mechanism to acquire the sentiment polarity of each word with its part-of-speech tag by querying SentiWordNet. Then, we devise a new pre-training task called label-aware masked language model to construct knowledge-aware language representation. Experiments show that SentiLARE obtains new state-of-the-art performance on a variety of sentiment analysis tasks., Comment: Accepted by EMNLP 2020 (Main Conference)
Published: 2019

47. ARAML: A Stable Adversarial Training Framework for Text Generation

Author: Ke, Pei, Huang, Fei, Huang, Minlie, and Zhu, Xiaoyan
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, 68T50
Abstract: Most of the existing generative adversarial networks (GAN) for text generation suffer from the instability of reinforcement learning training algorithms such as policy gradient, leading to unstable performance. To tackle this problem, we propose a novel framework called Adversarial Reward Augmented Maximum Likelihood (ARAML). During adversarial training, the discriminator assigns rewards to samples which are acquired from a stationary distribution near the data rather than the generator's distribution. The generator is optimized with maximum likelihood estimation augmented by the discriminator's rewards instead of policy gradient. Experiments show that our model can outperform state-of-the-art text GANs with a more stable training process., Comment: Accepted by EMNLP 2019
Published: 2019

48. Monocular Vehicle 3D Bounding Box Estimation Using Homograhy and Geometry in Traffic Scene.

Author: Yiqiang Chen, Feng Liu 0040, and Ke Pei
Published: 2022
Full Text: View/download PDF

49. A modified LC-MS/MS method for the detection of whole blood tacrolimus and its clinical value in Chinese kidney transplant patients

Author: Yu, Ke-Wei, Li, Bing-Ling, Yuan, Ying-Shi, Liao, Jia-Min, Li, Wei-Kang, Dong, Heng, Ke, Pei-Feng, Jin, Xing, Chen, Lu, Zhao, Jing-Jing, Wang, Heng, Cao, Shun-Wang, Chen, Wei-Ye, Huang, Xian-Zhang, Zhao, Bei-Bei, and Kang, Chun-Min
Published: 2022
Full Text: View/download PDF

50. Ultrahigh Density of Atomic CoFe-Electron Synergy in Noncontinuous Carbon Matrix for Highly Efficient Magnetic Wave Adsorption

Author: Wenhuan Huang, Qiang Qiu, Xiufang Yang, Shouwei Zuo, Jianan Bai, Huabin Zhang, Ke Pei, and Renchao Che
Subjects: Electromagnetic wave-absorbing materials, Off-axis electron hologram, M–M’ interaction, Hierarchical porous structure, Energetic metal organic framework, Technology
Abstract: Abstract Improving the atom utilization of metals and clarifying the M–M’ interaction is both greatly significant in assembling high-performance ultra-light electromagnetic wave-absorbing materials. Herein, a high-temperature explosion strategy has been successfully applied to assemble the hierarchical porous carbon sponge with Co–Fe decoration via the pyrolysis of the energetic metal organic framework. The as-constructed hybrid displays a superior reflection loss (RL) value of − 57.7 dB and a specific RL value of − 192 dB mg−1 mm−1 at 12.08 GHz with a layer thickness of 2.0 mm (loading of 15 wt%). The off-axis electron hologram characterizes the highly distributed numerous polarized nanodomain variable capacitors, demonstrating the dipole and interfacial polarization along the edges of the nanopores. More importantly, the X-ray absorption spectroscopy analysis verifies the mutual interaction between the metal cluster and carbon matrix and the electronic coupling responsible for the greatly improved electromagnetic wave absorption.
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

458 results on '"Ke, Pei"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources