Author: "Liu, Xuebo" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Liu, Xuebo"' showing total 1,665 results

Start Over Author "Liu, Xuebo"

1,665 results on '"Liu, Xuebo"'

1. NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates

Author: Deng, Hexuan, Jiao, Wenxiang, Liu, Xuebo, Zhang, Min, and Tu, Zhaopeng
Subjects: Computer Science - Computation and Language
Abstract: Despite their remarkable abilities in various tasks, large language models (LLMs) still struggle with real-time information (e.g., new facts and terms) due to the knowledge cutoff in their development process. However, existing benchmarks focus on outdated content and limited fields, facing difficulties in real-time updating and leaving new terms unexplored. To address this problem, we propose an adaptive benchmark, NewTerm, for real-time evaluation of new terms. We design a highly automated construction method to ensure high-quality benchmark construction with minimal human effort, allowing flexible updates for real-time information. Empirical results on various LLMs demonstrate over 20% performance reduction caused by new terms. Additionally, while updates to the knowledge cutoff of LLMs can cover some of the new terms, they are unable to generalize to more distant new terms. We also analyze which types of terms are more challenging and why LLMs struggle with new terms, paving the way for future research. Finally, we construct NewTerm 2022 and 2023 to evaluate the new terms updated each year and will continue updating annually. The benchmark and codes can be found at https://github.com/hexuandeng/NewTerm., Comment: Accepted to NeurIPS 2024 Datasets and Benchmarks Track
Published: 2024

2. DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Author: Wang, Yutong, Zeng, Jiali, Liu, Xuebo, Wong, Derek F., Meng, Fandong, Zhou, Jie, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large language models (LLMs) have achieved reasonable quality improvements in machine translation (MT). However, most current research on MT-LLMs still faces significant challenges in maintaining translation consistency and accuracy when processing entire documents. In this paper, we introduce DelTA, a Document-levEL Translation Agent designed to overcome these limitations. DelTA features a multi-level memory structure that stores information across various granularities and spans, including Proper Noun Records, Bilingual Summary, Long-Term Memory, and Short-Term Memory, which are continuously retrieved and updated by auxiliary LLM-based components. Experimental results indicate that DelTA significantly outperforms strong baselines in terms of translation consistency and quality across four open/closed-source LLMs and two representative document translation datasets, achieving an increase in consistency scores by up to 4.58 percentage points and in COMET scores by up to 3.16 points on average. DelTA employs a sentence-by-sentence translation strategy, ensuring no sentence omissions and offering a memory-efficient solution compared to the mainstream method. Furthermore, DelTA improves pronoun translation accuracy, and the summary component of the agent also shows promise as a tool for query-based summarization tasks. We release our code and data at https://github.com/YutongWang1216/DocMTAgent.
Published: 2024

3. LPZero: Language Model Zero-cost Proxy Search from Zero

Author: Dong, Peijie, Li, Lujun, Liu, Xiang, Tang, Zhenheng, Liu, Xuebo, Wang, Qiang, and Chu, Xiaowen
Subjects: Computer Science - Computation and Language
Abstract: In spite of the outstanding performance, Neural Architecture Search (NAS) is criticized for massive computation. Recently, Zero-shot NAS has emerged as a promising approach by exploiting Zero-cost (ZC) proxies, which markedly reduce computational demands. Despite this, existing ZC proxies heavily rely on expert knowledge and incur significant trial-and-error costs. Particularly in NLP tasks, most existing ZC proxies fail to surpass the performance of the naive baseline. To address these challenges, we introduce a novel framework, \textbf{LPZero}, which is the first to automatically design ZC proxies for various tasks, achieving higher ranking consistency than human-designed proxies. Specifically, we model the ZC proxy as a symbolic equation and incorporate a unified proxy search space that encompasses existing ZC proxies, which are composed of a predefined set of mathematical symbols. To heuristically search for the best ZC proxy, LPZero incorporates genetic programming to find the optimal symbolic composition. We propose a \textit{Rule-based Pruning Strategy (RPS),} which preemptively eliminates unpromising proxies, thereby mitigating the risk of proxy degradation. Extensive experiments on FlexiBERT, GPT-2, and LLaMA-7B demonstrate LPZero's superior ranking ability and performance on downstream tasks compared to current approaches., Comment: 8 pages, 7 figures, 10 appendix
Published: 2024

4. Self-Powered LLM Modality Expansion for Large Speech-Text Models

Author: Yu, Tengfei, Liu, Xuebo, Hou, Zhiyi, Ding, Liang, Tao, Dacheng, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Large language models (LLMs) exhibit remarkable performance across diverse tasks, indicating their potential for expansion into large speech-text models (LSMs) by integrating speech capabilities. Although unified speech-text pre-training and multimodal data instruction-tuning offer considerable benefits, these methods generally entail significant resource demands and tend to overfit specific tasks. This study aims to refine the use of speech datasets for LSM training by addressing the limitations of vanilla instruction tuning. We explore the instruction-following dynamics within LSMs, identifying a critical issue termed speech anchor bias-a tendency for LSMs to over-rely on speech inputs, mistakenly interpreting the entire speech modality as directives, thereby neglecting textual instructions. To counteract this bias, we introduce a self-powered LSM that leverages augmented automatic speech recognition data generated by the model itself for more effective instruction tuning. Our experiments across a range of speech-based tasks demonstrate that self-powered LSM mitigates speech anchor bias and improves the fusion of speech and text modalities in LSMs. Data, code and scripts are freely available at https://github.com/ytf-philp/Self-powered-LSM., Comment: Accepted to EMNLP 2024
Published: 2024

5. CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions

Author: Rao, Jun, Liu, Xuebo, Lian, Lian, Cheng, Shengjun, Liao, Yunjie, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: With instruction tuning, Large Language Models (LLMs) can enhance their ability to adhere to commands. Diverging from most works focusing on data mixing, our study concentrates on enhancing the model's capabilities from the perspective of data sampling during training. Drawing inspiration from the human learning process, where it is generally easier to master solutions to similar topics through focused practice on a single type of topic, we introduce a novel instruction tuning strategy termed CommonIT: Commonality-aware Instruction Tuning. Specifically, we cluster instruction datasets into distinct groups with three proposed metrics (Task, Embedding and Length). We ensure each training mini-batch, or "partition", consists solely of data from a single group, which brings about both data randomness across mini-batches and intra-batch data similarity. Rigorous testing on LLaMa models demonstrates CommonIT's effectiveness in enhancing the instruction-following capabilities of LLMs through IT datasets (FLAN, CoT, and Alpaca) and models (LLaMa2-7B, Qwen2-7B, LLaMa 13B, and BLOOM 7B). CommonIT consistently boosts an average improvement of 2.1\% on the general domain (i.e., the average score of Knowledge, Reasoning, Multilinguality and Coding) with the Length metric, and 5.2\% on the special domain (i.e., GSM, Openfunctions and Code) with the Task metric, and 3.8\% on the specific tasks (i.e., MMLU) with the Embedding metric. Code is available at \url{https://github.com/raojay7/CommonIT}., Comment: Accepted to EMNLP 2024
Published: 2024

6. Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models

Author: Rao, Jun, Liu, Xuebo, Lin, Zepeng, Ding, Liang, Li, Jing, Tao, Dacheng, and Zhang, Min
Subjects: Computer Science - Computation and Language
Abstract: Knowledge distillation (KD) is a technique that compresses large teacher models by training smaller student models to mimic them. The success of KD in auto-regressive language models mainly relies on Reverse KL for mode-seeking and student-generated output (SGO) to combat exposure bias. Our theoretical analyses and experimental validation reveal that while Reverse KL effectively mimics certain features of the teacher distribution, it fails to capture most of its behaviors. Conversely, SGO incurs higher computational costs and presents challenges in optimization, particularly when the student model is significantly smaller than the teacher model. These constraints are primarily due to the immutable distribution of the teacher model, which fails to adjust adaptively to models of varying sizes. We introduce Online Knowledge Distillation (OKD), where the teacher network integrates small online modules to concurrently train with the student model. This strategy abolishes the necessity for on-policy sampling and merely requires minimal updates to the parameters of the teacher's online module during training, thereby allowing dynamic adaptation to the student's distribution to make distillation better. Extensive results across multiple generation datasets show that OKD achieves or exceeds the performance of leading methods in various model architectures and sizes, reducing training time by up to fourfold.
Published: 2024

7. TasTe: Teaching Large Language Models to Translate through Self-Reflection

Author: Wang, Yutong, Zeng, Jiali, Liu, Xuebo, Meng, Fandong, Zhou, Jie, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large language models (LLMs) have exhibited remarkable performance in various natural language processing tasks. Techniques like instruction tuning have effectively enhanced the proficiency of LLMs in the downstream task of machine translation. However, the existing approaches fail to yield satisfactory translation outputs that match the quality of supervised neural machine translation (NMT) systems. One plausible explanation for this discrepancy is that the straightforward prompts employed in these methodologies are unable to fully exploit the acquired instruction-following capabilities. To this end, we propose the TasTe framework, which stands for translating through self-reflection. The self-reflection process includes two stages of inference. In the first stage, LLMs are instructed to generate preliminary translations and conduct self-assessments on these translations simultaneously. In the second stage, LLMs are tasked to refine these preliminary translations according to the evaluation results. The evaluation results in four language directions on the WMT22 benchmark reveal the effectiveness of our approach compared to existing methods. Our work presents a promising approach to unleash the potential of LLMs and enhance their capabilities in MT. The codes and datasets are open-sourced at https://github.com/YutongWang1216/ReflectionLLMMT., Comment: This paper has been accepted to the ACL 2024 main conference
Published: 2024

8. Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore

Author: Wu, Junchao, Zhan, Runzhe, Wong, Derek F., Yang, Shu, Liu, Xuebo, Chao, Lidia S., and Zhang, Min
Subjects: Computer Science - Computation and Language
Abstract: The efficacy of an large language model (LLM) generated text detector depends substantially on the availability of sizable training data. White-box zero-shot detectors, which require no such data, are nonetheless limited by the accessibility of the source model of the LLM-generated text. In this paper, we propose an simple but effective black-box zero-shot detection approach, predicated on the observation that human-written texts typically contain more grammatical errors than LLM-generated texts. This approach entails computing the Grammar Error Correction Score (GECScore) for the given text to distinguish between human-written and LLM-generated text. Extensive experimental results show that our method outperforms current state-of-the-art (SOTA) zero-shot and supervised methods, achieving an average AUROC of 98.7% and showing strong robustness against paraphrase and adversarial perturbation attacks.
Published: 2024

9. 3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset

Author: Ma, Xinyu, Liu, Xuebo, Wong, Derek F., Rao, Jun, Li, Bei, Ding, Liang, Chao, Lidia S., Tao, Dacheng, and Zhang, Min
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Multimodal machine translation (MMT) is a challenging task that seeks to improve translation quality by incorporating visual information. However, recent studies have indicated that the visual information provided by existing MMT datasets is insufficient, causing models to disregard it and overestimate their capabilities. This issue presents a significant obstacle to the development of MMT research. This paper presents a novel solution to this issue by introducing 3AM, an ambiguity-aware MMT dataset comprising 26,000 parallel sentence pairs in English and Chinese, each with corresponding images. Our dataset is specifically designed to include more ambiguity and a greater variety of both captions and images than other MMT datasets. We utilize a word sense disambiguation model to select ambiguous data from vision-and-language datasets, resulting in a more challenging dataset. We further benchmark several state-of-the-art MMT models on our proposed dataset. Experimental results show that MMT models trained on our dataset exhibit a greater ability to exploit visual information than those trained on other MMT datasets. Our work provides a valuable resource for researchers in the field of multimodal learning and encourages further exploration in this area. The data, code and scripts are freely available at https://github.com/MaxyLee/3AM.
Published: 2024

10. Improving Attributed Text Generation of Large Language Models via Preference Learning

Author: Li, Dongfang, Sun, Zetian, Hu, Baotian, Liu, Zhenyu, Hu, Xinshuo, Liu, Xuebo, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large language models have been widely adopted in natural language processing, yet they face the challenge of generating unreliable content. Recent works aim to reduce misinformation and hallucinations by resorting to attribution as a means to provide evidence (i.e., citations). However, current attribution methods usually focus on the retrieval stage and automatic evaluation that neglect mirroring the citation mechanisms in human scholarly writing to bolster credibility. In this paper, we address these challenges by modelling the attribution task as preference learning and introducing an Automatic Preference Optimization (APO) framework. First, we create a curated collection for post-training with 6,330 examples by collecting and filtering from existing datasets. Second, considering the high cost of labelling preference data, we further propose an automatic method to synthesize attribution preference data resulting in 95,263 pairs. Moreover, inspired by the human citation process, we further propose a progressive preference optimization method by leveraging fine-grained information. Extensive experiments on three datasets (i.e., ASQA, StrategyQA, and ELI5) demonstrate that APO achieves state-of-the-art citation F1 with higher answer quality., Comment: 23 pages, 15 tables, 2 figures
Published: 2024

11. SelectIT: Selective Instruction Tuning for Large Language Models via Uncertainty-Aware Self-Reflection

Author: Liu, Liangxin, Liu, Xuebo, Wong, Derek F., Li, Dongfang, Wang, Ziyi, Hu, Baotian, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Instruction tuning (IT) is crucial to tailoring large language models (LLMs) towards human-centric interactions. Recent advancements have shown that the careful selection of a small, high-quality subset of IT data can significantly enhance the performance of LLMs. Despite this, common approaches often rely on additional models or data sets, which increases costs and limits widespread adoption. In this work, we propose a novel approach, termed SelectIT, that capitalizes on the foundational capabilities of the LLM itself. Specifically, we exploit the intrinsic uncertainty present in LLMs to more effectively select high-quality IT data, without the need for extra resources. Furthermore, we introduce a novel IT dataset, the Selective Alpaca, created by applying SelectIT to the Alpaca-GPT4 dataset. Empirical results demonstrate that IT using Selective Alpaca leads to substantial model ability enhancement. The robustness of SelectIT has also been corroborated in various foundation models and domain-specific tasks. Our findings suggest that longer and more computationally intensive IT data may serve as superior sources of IT, offering valuable insights for future research in this area. Data, code, and scripts are freely available at https://github.com/Blue-Raincoat/SelectIT.
Published: 2024

12. DB-LLM: Accurate Dual-Binarization for Efficient LLMs

Author: Chen, Hong, Lv, Chengtao, Ding, Liang, Qin, Haotong, Zhou, Xiabin, Ding, Yifu, Liu, Xuebo, Zhang, Min, Guo, Jinyang, Liu, Xianglong, and Tao, Dacheng
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing, while the expensive memory and computation consumption impede their practical deployment. Quantization emerges as one of the most effective methods for improving the computational efficiency of LLMs. However, existing ultra-low-bit quantization always causes severe accuracy drops. In this paper, we empirically relieve the micro and macro characteristics of ultra-low bit quantization and present a novel Dual-Binarization method for LLMs, namely DB-LLM. For the micro-level, we take both the accuracy advantage of 2-bit-width and the efficiency advantage of binarization into account, introducing Flexible Dual Binarization (FDB). By splitting 2-bit quantized weights into two independent sets of binaries, FDB ensures the accuracy of representations and introduces flexibility, utilizing the efficient bitwise operations of binarization while retaining the inherent high sparsity of ultra-low bit quantization. For the macro-level, we find the distortion that exists in the prediction of LLM after quantization, which is specified as the deviations related to the ambiguity of samples. We propose the Deviation-Aware Distillation (DAD) method, enabling the model to focus differently on various samples. Comprehensive experiments show that our DB-LLM not only significantly surpasses the current State-of-The-Art (SoTA) in ultra-low bit quantization (eg, perplexity decreased from 9.64 to 7.23), but also achieves an additional 20\% reduction in computational consumption compared to the SOTA method under the same bit-width. Our code will be released soon.
Published: 2024

13. Revisiting Demonstration Selection Strategies in In-Context Learning

Author: Peng, Keqin, Ding, Liang, Yuan, Yancheng, Liu, Xuebo, Zhang, Min, Ouyang, Yuanxin, and Tao, Dacheng
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) have shown an impressive ability to perform a wide range of tasks using in-context learning (ICL), where a few examples are used to describe a task to the model. However, the performance of ICL varies significantly with the choice of demonstrations, and it is still unclear why this happens or what factors will influence its choice. In this work, we first revisit the factors contributing to this variance from both data and model aspects, and find that the choice of demonstration is both data- and model-dependent. We further proposed a data- and model-dependent demonstration selection method, \textbf{TopK + ConE}, based on the assumption that \textit{the performance of a demonstration positively correlates with its contribution to the model's understanding of the test samples}, resulting in a simple and effective recipe for ICL. Empirically, our method yields consistent improvements in both language understanding and generation tasks with different model scales. Further analyses confirm that, besides the generality and stability under different circumstances, our method provides a unified explanation for the effectiveness of previous methods. Code will be released., Comment: ACL 2024
Published: 2024

14. Clustering Pseudo Language Family in Multilingual Translation Models with Fisher Information Matrix

Author: Ma, Xinyu, Liu, Xuebo, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: In multilingual translation research, the comprehension and utilization of language families are of paramount importance. Nevertheless, clustering languages based solely on their ancestral families can yield suboptimal results due to variations in the datasets employed during the model's training phase. To mitigate this challenge, we introduce an innovative method that leverages the fisher information matrix (FIM) to cluster language families, anchored on the multilingual translation model's characteristics. We hypothesize that language pairs with similar effects on model parameters exhibit a considerable degree of linguistic congruence and should thus be grouped cohesively. This concept has led us to define pseudo language families. We provide an in-depth discussion regarding the inception and application of these pseudo language families. Empirical evaluations reveal that employing these pseudo language families enhances performance over conventional language families in adapting a multilingual translation model to unfamiliar language pairs. The proposed methodology may also be extended to scenarios requiring language similarity measurements. The source code and associated scripts can be accessed at https://github.com/ecoli-hit/PseudoFamily., Comment: Accepted to EMNLP 2023
Published: 2023

15. EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

Author: Liu, Yaofang, Cun, Xiaodong, Liu, Xuebo, Wang, Xintao, Zhang, Yong, Chen, Haoxin, Liu, Yang, Zeng, Tieyong, Chan, Raymond, and Shan, Ying
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The vision and language generative models have been overgrown in recent years. For video generation, various open-sourced models and public-available services have been developed to generate high-quality videos. However, these methods often use a few metrics, e.g., FVD or IS, to evaluate the performance. We argue that it is hard to judge the large conditional generative models from the simple metrics since these models are often trained on very large datasets with multi-aspect abilities. Thus, we propose a novel framework and pipeline for exhaustively evaluating the performance of the generated videos. Our approach involves generating a diverse and comprehensive list of 700 prompts for text-to-video generation, which is based on an analysis of real-world user data and generated with the assistance of a large language model. Then, we evaluate the state-of-the-art video generative models on our carefully designed benchmark, in terms of visual qualities, content qualities, motion qualities, and text-video alignment with 17 well-selected objective metrics. To obtain the final leaderboard of the models, we further fit a series of coefficients to align the objective metrics to the users' opinions. Based on the proposed human alignment method, our final score shows a higher correlation than simply averaging the metrics, showing the effectiveness of the proposed evaluation method., Comment: Technical Report, Project page: https://evalcrafter.github.io/
Published: 2023

16. Holistic Exploration on Universal Decompositional Semantic Parsing: Architecture, Data Augmentation, and LLM Paradigm

Author: Deng, Hexuan, Zhang, Xin, Zhang, Meishan, Liu, Xuebo, and Zhang, Min
Subjects: Computer Science - Computation and Language
Abstract: In this paper, we conduct a holistic exploration of the Universal Decompositional Semantic (UDS) Parsing. We first introduce a cascade model for UDS parsing that decomposes the complex parsing task into semantically appropriate subtasks. Our approach outperforms the prior models, while significantly reducing inference time. We also incorporate syntactic information and further optimized the architecture. Besides, different ways for data augmentation are explored, which further improve the UDS Parsing. Lastly, we conduct experiments to investigate the efficacy of ChatGPT in handling the UDS task, revealing that it excels in attribute parsing but struggles in relation parsing, and using ChatGPT for data augmentation yields suboptimal results. Our code is available at https://github.com/hexuandeng/HExp4UDS., Comment: 12 pages, 7 figures, 3 tables
Published: 2023

17. Pluggable Neural Machine Translation Models via Memory-augmented Adapters

Author: Xu, Yuzhuang, Wang, Shuo, Li, Peng, Liu, Xuebo, Wang, Xiaolong, Liu, Weidong, and Liu, Yang
Subjects: Computer Science - Computation and Language
Abstract: Although neural machine translation (NMT) models perform well in the general domain, it remains rather challenging to control their generation behavior to satisfy the requirement of different users. Given the expensive training cost and the data scarcity challenge of learning a new model from scratch for each user requirement, we propose a memory-augmented adapter to steer pretrained NMT models in a pluggable manner. Specifically, we construct a multi-granular memory based on the user-provided text samples and propose a new adapter architecture to combine the model representations and the retrieved results. We also propose a training strategy using memory dropout to reduce spurious dependencies between the NMT model and the memory. We validate our approach on both style- and domain-specific experiments and the results indicate that our method can outperform several representative pluggable baselines., Comment: Accepted by LREC-COLING 2024
Published: 2023

18. Revisiting Token Dropping Strategy in Efficient BERT Pretraining

Author: Zhong, Qihuang, Ding, Liang, Liu, Juhua, Liu, Xuebo, Zhang, Min, Du, Bo, and Tao, Dacheng
Subjects: Computer Science - Computation and Language
Abstract: Token dropping is a recently-proposed strategy to speed up the pretraining of masked language models, such as BERT, by skipping the computation of a subset of the input tokens at several middle layers. It can effectively reduce the training time without degrading much performance on downstream tasks. However, we empirically find that token dropping is prone to a semantic loss problem and falls short in handling semantic-intense tasks. Motivated by this, we propose a simple yet effective semantic-consistent learning method (ScTD) to improve the token dropping. ScTD aims to encourage the model to learn how to preserve the semantic information in the representation space. Extensive experiments on 12 tasks show that, with the help of our ScTD, token dropping can achieve consistent and significant performance gains across all task types and model sizes. More encouragingly, ScTD saves up to 57% of pretraining time and brings up to +1.56% average improvement over the vanilla token dropping., Comment: Accepted to ACL2023 Main Conference
Published: 2023

19. Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization

Author: Cheang, Chi Seng, Chan, Hou Pong, Wong, Derek F., Liu, Xuebo, Li, Zhaocong, Sun, Yanming, Liu, Shudong, and Chao, Lidia S.
Subjects: Computer Science - Computation and Language
Abstract: Recent pre-trained language models (PLMs) achieve promising results in existing abstractive summarization datasets. However, existing summarization benchmarks overlap in time with the standard pre-training corpora and finetuning datasets. Hence, the strong performance of PLMs may rely on the parametric knowledge that is memorized during pre-training and fine-tuning. Moreover, the knowledge memorized by PLMs may quickly become outdated, which affects the generalization performance of PLMs on future data. In this work, we propose TempoSum, a novel benchmark that contains data samples from 2010 to 2022, to understand the temporal generalization ability of abstractive summarization models. Through extensive human evaluation, we show that parametric knowledge stored in summarization models significantly affects the faithfulness of the generated summaries on future data. Moreover, existing faithfulness enhancement methods cannot reliably improve the faithfulness of summarization models on future data. Finally, we discuss several recommendations to the research community on how to evaluate and improve the temporal generalization capability of text summarization models., Comment: Accepted at EMNLP 2023
Published: 2023

20. Towards Making the Most of ChatGPT for Machine Translation

Author: Peng, Keqin, Ding, Liang, Zhong, Qihuang, Shen, Li, Liu, Xuebo, Zhang, Min, Ouyang, Yuanxin, and Tao, Dacheng
Subjects: Computer Science - Computation and Language
Abstract: ChatGPT shows remarkable capabilities for machine translation (MT). Several prior studies have shown that it achieves comparable results to commercial systems for high-resource languages, but lags behind in complex tasks, e.g., low-resource and distant-language-pairs translation. However, they usually adopt simple prompts which can not fully elicit the capability of ChatGPT. In this paper, we aim to further mine ChatGPT's translation ability by revisiting several aspects: temperature, task information, and domain information, and correspondingly propose an optimal temperature setting and two (simple but effective) prompts: Task-Specific Prompts (TSP) and Domain-Specific Prompts (DSP). We show that: 1) The performance of ChatGPT depends largely on temperature, and a lower temperature usually can achieve better performance; 2) Emphasizing the task information can further improve ChatGPT's performance, particularly in complex MT tasks; 3) Introducing domain information can elicit ChatGPT's generalization ability and improve its performance in the specific domain; 4) ChatGPT tends to generate hallucinations for non-English-centric MT tasks, which can be partially addressed by our proposed prompts but still need to be highlighted for the MT/NLP community. We also explore the effects of advanced in-context learning strategies and find a (negative but interesting) observation: the powerful chain-of-thought prompt leads to word-by-word translation behavior, thus bringing significant translation degradation., Comment: EMNLP 2023 (findings)
Published: 2023

21. ConsistTL: Modeling Consistency in Transfer Learning for Low-Resource Neural Machine Translation

Author: Li, Zhaocong, Liu, Xuebo, Wong, Derek F., Chao, Lidia S., and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Transfer learning is a simple and powerful method that can be used to boost model performance of low-resource neural machine translation (NMT). Existing transfer learning methods for NMT are static, which simply transfer knowledge from a parent model to a child model once via parameter initialization. In this paper, we propose a novel transfer learning method for NMT, namely ConsistTL, which can continuously transfer knowledge from the parent model during the training of the child model. Specifically, for each training instance of the child model, ConsistTL constructs the semantically-equivalent instance for the parent model and encourages prediction consistency between the parent and child for this instance, which is equivalent to the child model learning each instance under the guidance of the parent model. Experimental results on five low-resource NMT tasks demonstrate that ConsistTL results in significant improvements over strong transfer learning baselines, with a gain up to 1.7 BLEU over the existing back-translation model on the widely-used WMT17 Turkish-English benchmark. Further analysis reveals that ConsistTL can improve the inference calibration of the child model. Code and scripts are freely available at https://github.com/NLP2CT/ConsistTL., Comment: Accepted to EMNLP 2022
Published: 2022

22. Improving Simultaneous Machine Translation with Monolingual Data

Author: Deng, Hexuan, Ding, Liang, Liu, Xuebo, Zhang, Meishan, Tao, Dacheng, and Zhang, Min
Subjects: Computer Science - Computation and Language
Abstract: Simultaneous machine translation (SiMT) is usually done via sequence-level knowledge distillation (Seq-KD) from a full-sentence neural machine translation (NMT) model. However, there is still a significant performance gap between NMT and SiMT. In this work, we propose to leverage monolingual data to improve SiMT, which trains a SiMT student on the combination of bilingual data and external monolingual data distilled by Seq-KD. Preliminary experiments on En-Zh and En-Ja news domain corpora demonstrate that monolingual data can significantly improve translation quality (e.g., +3.15 BLEU on En-Zh). Inspired by the behavior of human simultaneous interpreters, we propose a novel monolingual sampling strategy for SiMT, considering both chunk length and monotonicity. Experimental results show that our sampling strategy consistently outperforms the random sampling strategy (and other conventional typical NMT monolingual sampling strategies) by avoiding the key problem of SiMT -- hallucination, and has better scalability. We achieve +0.72 BLEU improvements on average against random sampling on En-Zh and En-Ja. Data and codes can be found at https://github.com/hexuandeng/Mono4SiMT., Comment: Accepted by AAAI 2023. Extended version includes supplementary material. 10 pages, 4 figures, 8 tables
Published: 2022

23. Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence Modeling

Author: Wang, Zhijun, Liu, Xuebo, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Existing research generally treats Chinese character as a minimum unit for representation. However, such Chinese character representation will suffer two bottlenecks: 1) Learning bottleneck, the learning cannot benefit from its rich internal features (e.g., radicals and strokes); and 2) Parameter bottleneck, each individual character has to be represented by a unique vector. In this paper, we introduce a novel representation method for Chinese characters to break the bottlenecks, namely StrokeNet, which represents a Chinese character by a Latinized stroke sequence (e.g., "ao1 (concave)" to "ajaie" and "tu1 (convex)" to "aeaqe"). Specifically, StrokeNet maps each stroke to a specific Latin character, thus allowing similar Chinese characters to have similar Latin representations. With the introduction of StrokeNet to neural machine translation (NMT), many powerful but not applicable techniques to non-Latin languages (e.g., shared subword vocabulary learning and ciphertext-based data augmentation) can now be perfectly implemented. Experiments on the widely-used NIST Chinese-English, WMT17 Chinese-English and IWSLT17 Japanese-English NMT tasks show that StrokeNet can provide a significant performance boost over the strong baselines with fewer model parameters, achieving 26.5 BLEU on the WMT17 Chinese-English task which is better than any previously reported results without using monolingual data. Code and scripts are freely available at https://github.com/zjwang21/StrokeNet., Comment: Accepted to EMNLP 2022
Published: 2022

24. Revisiting Grammatical Error Correction Evaluation and Beyond

Author: Gong, Peiyuan, Liu, Xuebo, Huang, Heyan, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Pretraining-based (PT-based) automatic evaluation metrics (e.g., BERTScore and BARTScore) have been widely used in several sentence generation tasks (e.g., machine translation and text summarization) due to their better correlation with human judgments over traditional overlap-based methods. Although PT-based methods have become the de facto standard for training grammatical error correction (GEC) systems, GEC evaluation still does not benefit from pretrained knowledge. This paper takes the first step towards understanding and improving GEC evaluation with pretraining. We first find that arbitrarily applying PT-based metrics to GEC evaluation brings unsatisfactory correlation results because of the excessive attention to inessential systems outputs (e.g., unchanged parts). To alleviate the limitation, we propose a novel GEC evaluation metric to achieve the best of both worlds, namely PT-M2 which only uses PT-based metrics to score those corrected parts. Experimental results on the CoNLL14 evaluation task show that PT-M2 significantly outperforms existing methods, achieving a new state-of-the-art result of 0.949 Pearson correlation. Further analysis reveals that PT-M2 is robust to evaluate competitive GEC systems. Source code and scripts are freely available at https://github.com/pygongnlp/PT-M2., Comment: Accepted to EMNLP 2022
Published: 2022

25. Comparison of the Effects of pH-Shifting, Acetic Acid Modification, and TGase Treatment on the Physicochemical and Functional Properties of Wheat Gluten Protein

Author: Dong, Mengxue, Sun, Yusha, Xiong, Dandan, Song, Qi, jia, Jie, Liu, Xuebo, Sheng, Long, and Duan, Xiang
Published: 2024
Full Text: View/download PDF

26. BLISS: Robust Sequence-to-Sequence Learning via Self-Supervised Input Representation

Author: Zhang, Zheng, Ding, Liang, Cheng, Dazhao, Liu, Xuebo, Zhang, Min, and Tao, Dacheng
Subjects: Computer Science - Computation and Language
Abstract: Data augmentations (DA) are the cores to achieving robust sequence-to-sequence learning on various natural language processing (NLP) tasks. However, most of the DA approaches force the decoder to make predictions conditioned on the perturbed input representation, underutilizing supervised information provided by perturbed input. In this work, we propose a framework-level robust sequence-to-sequence learning approach, named BLISS, via self-supervised input representation, which has the great potential to complement the data-level augmentation approaches. The key idea is to supervise the sequence-to-sequence framework with both the \textit{supervised} ("input$\rightarrow$output") and \textit{self-supervised} ("perturbed input$\rightarrow$input") information. We conduct comprehensive experiments to validate the effectiveness of BLISS on various tasks, including machine translation, grammatical error correction, and text summarization. The results show that BLISS outperforms significantly the vanilla Transformer and consistently works well across tasks than the other five contrastive baselines. Extensive analyses reveal that BLISS learns robust representations and rich linguistic knowledge, confirming our claim. Source code will be released upon publication.
Published: 2022

27. ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation

Author: Li, Bei, Du, Quan, Zhou, Tao, Jing, Yi, Zhou, Shuhan, Zeng, Xin, Xiao, Tong, Zhu, JingBo, Liu, Xuebo, and Zhang, Min
Subjects: Computer Science - Computation and Language
Abstract: Residual networks are an Euler discretization of solutions to Ordinary Differential Equations (ODE). This paper explores a deeper relationship between Transformer and numerical ODE methods. We first show that a residual block of layers in Transformer can be described as a higher-order solution to ODE. Inspired by this, we design a new architecture, {\it ODE Transformer}, which is analogous to the Runge-Kutta method that is well motivated in ODE. As a natural extension to Transformer, ODE Transformer is easy to implement and efficient to use. Experimental results on the large-scale machine translation, abstractive summarization, and grammar error correction tasks demonstrate the high genericity of ODE Transformer. It can gain large improvements in model performance over strong baselines (e.g., 30.77 and 44.11 BLEU scores on the WMT'14 English-German and English-French benchmarks) at a slight cost in inference efficiency., Comment: Long paper accepted by ACL2022 main conference. arXiv admin note: substantial text overlap with arXiv:2104.02308
Published: 2022

28. Exploiting the interactions between plant proteins and gut microbiota to promote intestinal health

Author: Jia, Jie, Dell’Olio, Andrea, Izquierdo-Sandoval, David, Capuano, Edoardo, Liu, Xuebo, Duan, Xiang, and Rubert, Josep
Published: 2024
Full Text: View/download PDF

29. Liberated bioactive bound phenolics during in vitro gastrointestinal digestion and colonic fermentation boost the prebiotic effects of triticale insoluble dietary fiber

Author: Hou, Chunyan, Zhao, Lianjia, Ji, Muhua, Yu, Jingjing, Di, Yan, Liu, Qian, Zhang, Zhengmao, Sun, Lijun, Liu, Xuebo, and Wang, Yutang
Published: 2024
Full Text: View/download PDF

30. Simulation of predicting atrial fibrosis in patients with paroxysmal atrial fibrillation during sinus node recovery time in optical imaging

Author: Chen, Zhisong, Liu, Hongwei, Liu, Xuebo, and Song, Haoming
Published: 2024
Full Text: View/download PDF

31. Addition of Cyperus esculentus (tiger nut) milk improved the flavor and gelation properties of set yogurt: The main contribution of volatile constituents, starch and proteins

Author: Liu, Chengcheng, Li, Wenyue, Li, Caixia, Zhang, Xia, Wang, Guidan, Shen, Yajun, Wang, Yutang, Liu, Xuebo, and Sun, Lijun
Published: 2024
Full Text: View/download PDF

32. Identification and functional characterization of differentially expressed circRNAs in high glucose treated endothelial cells: Construction of circRNA-miRNA-mRNA network

Author: Lin, Hao, Yao, Tongqing, Ding, Haoran, Chu, Jiapeng, Yuan, Deqiang, Ping, Fan, Chen, Fei, and Liu, Xuebo
Published: 2024
Full Text: View/download PDF

33. Increasing the pasteurization temperature of liquid egg white by phosphorylation modification: Exploring the mechanism and application

Author: Dong, Mengxue, Sun, Yusha, Cui, Lu, Ye, Qiuyu, Lv, Fengxiao, Han, Wenduo, Pei, Jiawen, Liu, Xuebo, and Duan, Xiang
Published: 2024
Full Text: View/download PDF

34. Synbiotics containing sea buckthorn polysaccharides ameliorate DSS-induced colitis in mice via regulating Th17/Treg homeostasis through intestinal microbiota and their production of BA metabolites and SCFAs

Author: Yuan, Mingyou, Chang, Lili, Gao, Pan, Li, Jing, Lu, Xinyuan, Hua, Mingfang, Li, Xiulian, Liu, Xuebo, and Lan, Ying
Published: 2024
Full Text: View/download PDF

35. Development of zein-based complexes and conjugates with enhanced surface hydrophilicity: Structure, emulsifying, foaming, and antioxidant properties

Author: Li, Yueting, Wan, Ziyan, Zhao, Sheliang, Lu, Hao, McClements, David Julian, Liu, Xuebo, and Liu, Fuguo
Published: 2024
Full Text: View/download PDF

36. Proteins make tea polyphenol EGCG not always develop satisfactory α-glucosidase inhibition: The influences of three proteins on α-glucosidase inhibitory activity of EGCG

Author: Li, Wenyue, Cao, Ruibo, Cao, Junwei, Zhang, Jifan, Yang, Xi, Liu, Xuebo, and Sun, Lijun
Published: 2025
Full Text: View/download PDF

37. Effects of combined hot alkaline and pH-shift treatments on structure and functionality of legume protein-EGCG conjugates: Soybean-, pea-, and chickpea protein-EGCG systems

Author: Zhang, Yifan, Wu, Chang, Shen, Xuelian, McClements, David Julian, Liu, Xuebo, and Liu, Fuguo
Published: 2025
Full Text: View/download PDF

38. Interfacial engineering of Pickering emulsions stabilized by pea protein-alginate microgels for encapsulation of hydrophobic bioactives

Author: Yan, Xiaojia, Peng, Xiaoke, McClements, David Julian, Ma, Cuicui, Liu, Xuebo, and Liu, Fuguo
Published: 2024
Full Text: View/download PDF

39. Variance-Aware Machine Translation Test Sets

Author: Zhan, Runzhe, Liu, Xuebo, Wong, Derek F., and Chao, Lidia S.
Subjects: Computer Science - Computation and Language
Abstract: We release 70 small and discriminative test sets for machine translation (MT) evaluation called variance-aware test sets (VAT), covering 35 translation directions from WMT16 to WMT20 competitions. VAT is automatically created by a novel variance-aware filtering method that filters the indiscriminative test instances of the current MT test sets without any human labor. Experimental results show that VAT outperforms the original WMT test sets in terms of the correlation with human judgement across mainstream language pairs and test sets. Further analysis on the properties of VAT reveals the challenging linguistic features (e.g., translation of low-frequency words and proper nouns) for competitive MT systems, providing guidance for constructing future MT test sets. The test sets and the code for preparing variance-aware MT test sets are freely available at https://github.com/NLP2CT/Variance-Aware-MT-Test-Sets ., Comment: Accepted to NeurIPS 2021 Datasets and Benchmarks Track
Published: 2021

40. On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation

Author: Liu, Xuebo, Wang, Longyue, Wong, Derek F., Ding, Liang, Chao, Lidia S., Shi, Shuming, and Tu, Zhaopeng
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Pre-training (PT) and back-translation (BT) are two simple and powerful methods to utilize monolingual data for improving the model performance of neural machine translation (NMT). This paper takes the first step to investigate the complementarity between PT and BT. We introduce two probing tasks for PT and BT respectively and find that PT mainly contributes to the encoder module while BT brings more benefits to the decoder. Experimental results show that PT and BT are nicely complementary to each other, establishing state-of-the-art performances on the WMT16 English-Romanian and English-Russian benchmarks. Through extensive analyses on sentence originality and word frequency, we also demonstrate that combining Tagged BT with PT is more helpful to their complementarity, leading to better translation quality. Source code is freely available at https://github.com/SunbowLiu/PTvsBT., Comment: Accepted to Findings of EMNLP 2021
Published: 2021

41. Mechanism of enhancing rehydration capacity and foaming property of spray-dried egg white powder by lactic acid fermentation

Author: Jia, Jie, Duan, Jiayi, Jia, Xin, Ge, Junlin, Dong, Mengxue, Liu, Xuebo, Yang, Xiujuan, and Duan, Xiang
Published: 2024
Full Text: View/download PDF

42. Milk casein hydrolysate peptides regulate starch digestion through inhibition of α-glucosidase: An insight into the active oligopeptide screening, enzyme inhibition behaviors, and oligopeptide-enzyme binding interactions

Author: Cao, Ruibo, Li, Wenyue, Zhang, Jifan, Bao, Xingyu, Feng, Haotian, Sun, Jiaqi, Liu, Xuebo, and Sun, Lijun
Published: 2024
Full Text: View/download PDF

43. Enhanced printability of food-grade edible inks: Emulsions formulated with modified pea protein and sodium alginate

Author: Ma, Cuicui, Yan, Jun, Li, Wenhan, Wang, Yutang, McClements, David Julian, Liu, Xuebo, and Liu, Fuguo
Published: 2024
Full Text: View/download PDF

44. Change of physiochemical characteristics, nutritional quality, and volatile compounds of Chenopodium quinoa Willd. during germination

Author: Lan, Yongli, Wang, Xinze, Wang, Lei, Zhang, Wengang, Song, Yujie, Zhao, Shiyang, Yang, Xijuan, and Liu, Xuebo
Published: 2024
Full Text: View/download PDF

45. Chemical acylation of pea protein isolate hydrolysate with fatty acid N-hydroxysuccinimide esters: Effect on structure and functional properties

Author: Xiao, Jing, Niu, Li, Tong, Zongbo, Jin, Shuxiu, Wang, Xiaomei, Liu, Xuebo, Xiao, Chunxia, and Fan, Huafang
Published: 2024
Full Text: View/download PDF

46. Advances in enhancing the mechanical properties of biopolymer hydrogels via multi-strategic approaches

Author: Yan, Xiaojia, Huang, Hechun, Bakry, Amr M., Wu, Wanqiang, Liu, Xuebo, and Liu, Fuguo
Published: 2024
Full Text: View/download PDF

47. High internal phase double emulsions stabilized by modified pea protein-alginate complexes: Application for co-encapsulation of riboflavin and β-carotene

Author: Yan, Xiaojia, Yan, Jun, Shi, Xinyue, Song, Yuying, McClements, David Julian, Ma, Cuicui, Liu, Xuebo, Chen, Shuai, Xu, Duoxia, and Liu, Fuguo
Published: 2024
Full Text: View/download PDF

48. Effects of dietary lipid levels on growth, lipid metabolism, fatty acid composition and antioxidant capacity of juvenile greasyback shrimp (Metapenaeus ensis)

Author: Chen, Chuan, Tian, Haoyu, Liu, Xuebo, Dai, Yishuang, Wen, Xiaobo, Zhao, Huihong, and Wu, Kun
Published: 2024
Full Text: View/download PDF

49. Modulating peppermint oil flavor release properties of emulsion-filled protein gels: Impact of cross-linking method and matrix composition

Author: Li, Siqi, Yan, Jun, Yang, Junhao, Chen, Guipan, McClements, David Julian, Ma, Cuicui, Liu, Xuebo, and Liu, Fuguo
Published: 2024
Full Text: View/download PDF

50. Life's essential 8 metrics and mortality outcomes in insulin resistance: The role of inflammation, vascular aging, and gender

Author: Feng, Yuntao, Lin, Hao, Tan, Hongwei, and Liu, Xuebo
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

1,665 results on '"Liu, Xuebo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources