Author: "Chan, Chi-Min" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chan, Chi-Min"' showing total 18 results

Start Over Author "Chan, Chi-Min"

18 results on '"Chan, Chi-Min"'

1. EVA: An Embodied World Model for Future Video Anticipation

Author: Chi, Xiaowei, Zhang, Hengyuan, Fan, Chun-Kai, Qi, Xingqun, Zhang, Rongyu, Chen, Anthony, Chan, Chi-min, Xue, Wei, Luo, Wenhan, Zhang, Shanghang, and Guo, Yike
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia, Computer Science - Robotics
Abstract: World models integrate raw data from various modalities, such as images and language to simulate comprehensive interactions in the world, thereby displaying crucial roles in fields like mixed reality and robotics. Yet, applying the world model for accurate video prediction is quite challenging due to the complex and dynamic intentions of the various scenes in practice. In this paper, inspired by the human rethinking process, we decompose the complex video prediction into four meta-tasks that enable the world model to handle this issue in a more fine-grained manner. Alongside these tasks, we introduce a new benchmark named Embodied Video Anticipation Benchmark (EVA-Bench) to provide a well-rounded evaluation. EVA-Bench focused on evaluating the video prediction ability of human and robot actions, presenting significant challenges for both the language model and the generation model. Targeting embodied video prediction, we propose the Embodied Video Anticipator (EVA), a unified framework aiming at video understanding and generation. EVA integrates a video generation model with a visual language model, effectively combining reasoning capabilities with high-quality generation. Moreover, to enhance the generalization of our framework, we tailor-designed a multi-stage pretraining paradigm that adaptatively ensembles LoRA to produce high-fidelity results. Extensive experiments on EVA-Bench highlight the potential of EVA to significantly improve performance in embodied scenes, paving the way for large-scale pre-trained models in real-world prediction tasks.
Published: 2024

2. HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

Author: Liu, Xinyu, He, Yingqing, Guo, Lanqing, Li, Xiang, Jin, Bu, Li, Peng, Li, Yan, Chan, Chi-Min, Chen, Qifeng, Xue, Wei, Luo, Wenhan, Liu, Qifeng, and Guo, Yike
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The potential for higher-resolution image generation using pretrained diffusion models is immense, yet these models often struggle with issues of object repetition and structural artifacts especially when scaling to 4K resolution and higher. We figure out that the problem is caused by that, a single prompt for the generation of multiple scales provides insufficient efficacy. In response, we propose HiPrompt, a new tuning-free solution that tackles the above problems by introducing hierarchical prompts. The hierarchical prompts offer both global and local guidance. Specifically, the global guidance comes from the user input that describes the overall content, while the local guidance utilizes patch-wise descriptions from MLLMs to elaborately guide the regional structure and texture generation. Furthermore, during the inverse denoising process, the generated noise is decomposed into low- and high-frequency spatial components. These components are conditioned on multiple prompt levels, including detailed patch-wise descriptions and broader image-level prompts, facilitating prompt-guided denoising under hierarchical semantic guidance. It further allows the generation to focus more on local spatial regions and ensures the generated images maintain coherent local and global semantics, structures, and textures with high definition. Extensive experiments demonstrate that HiPrompt outperforms state-of-the-art works in higher-resolution image generation, significantly reducing object repetition and enhancing structural quality., Comment: https://liuxinyv.github.io/HiPrompt/
Published: 2024

3. AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems

Author: Chan, Chi-Min, Yu, Jianxuan, Chen, Weize, Jiang, Chunyang, Liu, Xinyu, Shi, Weijie, Liu, Zhiyuan, Xue, Wei, and Guo, Yike
Subjects: Computer Science - Computation and Language
Abstract: The rapid advancement of large language models (LLMs) has led to the rise of LLM-based agents. Recent research shows that multi-agent systems (MAS), where each agent plays a specific role, can outperform individual LLMs. However, configuring an MAS for a task remains challenging, with performance only observable post-execution. Inspired by scaling laws in LLM development, we investigate whether MAS performance can be predicted beforehand. We introduce AgentMonitor, a framework that integrates at the agent level to capture inputs and outputs, transforming them into statistics for training a regression model to predict task performance. Additionally, it can further apply real-time corrections to address security risks posed by malicious agents, mitigating negative impacts and enhancing MAS security. Experiments demonstrate that an XGBoost model achieves a Spearman correlation of 0.89 in-domain and 0.58 in more challenging scenarios. Furthermore, using AgentMonitor reduces harmful content by 6.2% and increases helpful content by 1.8% on average, enhancing safety and reliability. Code is available at \url{https://github.com/chanchimin/AgentMonitor}.
Published: 2024

4. Importance Weighting Can Help Large Language Models Self-Improve

Author: Jiang, Chunyang, Chan, Chi-min, Xue, Wei, Liu, Qifeng, and Guo, Yike
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large language models (LLMs) have shown remarkable capability in numerous tasks and applications. However, fine-tuning LLMs using high-quality datasets under external supervision remains prohibitively expensive. In response, LLM self-improvement approaches have been vibrantly developed recently. The typical paradigm of LLM self-improvement involves training LLM on self-generated data, part of which may be detrimental and should be filtered out due to the unstable data quality. While current works primarily employs filtering strategies based on answer correctness, in this paper, we demonstrate that filtering out correct but with high distribution shift extent (DSE) samples could also benefit the results of self-improvement. Given that the actual sample distribution is usually inaccessible, we propose a new metric called DS weight to approximate DSE, inspired by the Importance Weighting methods. Consequently, we integrate DS weight with self-consistency to comprehensively filter the self-generated samples and fine-tune the language model. Experiments show that with only a tiny valid set (up to 5\% size of the training set) to compute DS weight, our approach can notably promote the reasoning ability of current LLM self-improvement methods. The resulting performance is on par with methods that rely on external supervision from pre-trained reward models.
Published: 2024

5. RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation

Author: Chan, Chi-Min, Xu, Chunpu, Yuan, Ruibin, Luo, Hongyin, Xue, Wei, Guo, Yike, and Fu, Jie
Subjects: Computer Science - Computation and Language
Abstract: Large Language Models (LLMs) exhibit remarkable capabilities but are prone to generating inaccurate or hallucinatory responses. This limitation stems from their reliance on vast pretraining datasets, making them susceptible to errors in unseen scenarios. To tackle these challenges, Retrieval-Augmented Generation (RAG) addresses this by incorporating external, relevant documents into the response generation process, thus leveraging non-parametric knowledge alongside LLMs' in-context learning abilities. However, existing RAG implementations primarily focus on initial input for context retrieval, overlooking the nuances of ambiguous or complex queries that necessitate further clarification or decomposition for accurate responses. To this end, we propose learning to Refine Query for Retrieval Augmented Generation (RQ-RAG) in this paper, endeavoring to enhance the model by equipping it with capabilities for explicit rewriting, decomposition, and disambiguation. Our experimental results indicate that our method, when applied to a 7B Llama2 model, surpasses the previous state-of-the-art (SOTA) by an average of 1.9\% across three single-hop QA datasets, and also demonstrates enhanced performance in handling complex, multi-hop QA datasets. Our code is available at https://github.com/chanchimin/RQ-RAG.
Published: 2024

6. AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

Author: Chen, Weize, Su, Yusheng, Zuo, Jingwei, Yang, Cheng, Yuan, Chenfei, Chan, Chi-Min, Yu, Heyang, Lu, Yaxi, Hung, Yi-Hsin, Qian, Chen, Qin, Yujia, Cong, Xin, Xie, Ruobing, Liu, Zhiyuan, Sun, Maosong, and Zhou, Jie
Subjects: Computer Science - Computation and Language
Abstract: Autonomous agents empowered by Large Language Models (LLMs) have undergone significant improvements, enabling them to generalize across a broad spectrum of tasks. However, in real-world scenarios, cooperation among individuals is often required to enhance the efficiency and effectiveness of task accomplishment. Hence, inspired by human group dynamics, we propose a multi-agent framework \framework that can collaboratively and dynamically adjust its composition as a greater-than-the-sum-of-its-parts system. Our experiments demonstrate that \framework framework can effectively deploy multi-agent groups that outperform a single agent. Furthermore, we delve into the emergence of social behaviors among individual agents within a group during collaborative task accomplishment. In view of these behaviors, we discuss some possible strategies to leverage positive ones and mitigate negative ones for improving the collaborative potential of multi-agent groups. Our codes for \framework will soon be released at \url{https://github.com/OpenBMB/AgentVerse}., Comment: Under review. Code at https://github.com/OpenBMB/AgentVerse/
Published: 2023

7. ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate

Author: Chan, Chi-Min, Chen, Weize, Su, Yusheng, Yu, Jianxuan, Xue, Wei, Zhang, Shanghang, Fu, Jie, and Liu, Zhiyuan
Subjects: Computer Science - Computation and Language
Abstract: Text evaluation has historically posed significant challenges, often demanding substantial labor and time cost. With the emergence of large language models (LLMs), researchers have explored LLMs' potential as alternatives for human evaluation. While these single-agent-based approaches show promise, experimental results suggest that further advancements are needed to bridge the gap between their current effectiveness and human-level evaluation quality. Recognizing that best practices of human evaluation processes often involve multiple human annotators collaborating in the evaluation, we resort to a multi-agent debate framework, moving beyond single-agent prompting strategies. The multi-agent-based approach enables a group of LLMs to synergize with an array of intelligent counterparts, harnessing their distinct capabilities and expertise to enhance efficiency and effectiveness in handling intricate tasks. In this paper, we construct a multi-agent referee team called ChatEval to autonomously discuss and evaluate the quality of generated responses from different models on open-ended questions and traditional natural language generation (NLG) tasks. Our analysis shows that ChatEval transcends mere textual scoring, offering a human-mimicking evaluation process for reliable assessments. Our code is available at https://github.com/chanchimin/ChatEval.
Published: 2023

8. Exploring the Impact of Model Scaling on Parameter-Efficient Tuning

Author: Su, Yusheng, Chan, Chi-Min, Cheng, Jiali, Qin, Yujia, Lin, Yankai, Hu, Shengding, Yang, Zonghan, Ding, Ning, Sun, Xingzhi, Xie, Guotong, Liu, Zhiyuan, and Sun, Maosong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Parameter-efficient tuning (PET) methods can effectively drive extremely large pre-trained language models (PLMs) by training only minimal parameters. Different PET methods utilize different manually designed tunable modules. In small PLMs, there are usually noticeable performance differences among PET methods. Nevertheless, as the model scale increases, the performance differences become marginal. Hence, we hypothesize that model scaling mitigates the impact of design differences on PET methods. To investigate this hypothesis, we introduce a more flexible PET method called Arbitrary PET (APET) method. The APET method is compatible with a tunable module, which consists of any number of parameters distributed in arbitrary positions. Then, we utilize it and conduct experiments on 11 NLP tasks across 3 representative PLMs. Our investigations reveal that model scaling (1) mitigates the effects of the positions of tunable parameters on performance, and (2) enables tuning methods to achieve performance comparable to full-parameter fine-tuning by optimizing fewer tunable parameters. Intriguingly, we also observe that tuning methods optimize the similar number of tunable parameters to exceed random guess performance on different tasks. We collectively discuss this phenomenon and the two aforementioned findings from an optimization perspective to understand the underlying mechanisms. These conclusions enhance our understanding of the impact of model scaling on PET and assist in designing more effective and efficient PET methods for PLMs of different scales. The source code can be obtained from this GitHub repository: \url{https://github.com/yushengsu-thu/PET_Scaling}.
Published: 2023
Full Text: View/download PDF

9. Plug-and-Play Document Modules for Pre-trained Models

Author: Xiao, Chaojun, Zhang, Zhengyan, Han, Xu, Chan, Chi-Min, Lin, Yankai, Liu, Zhiyuan, Li, Xiangyang, Li, Zhonghua, Cao, Zhao, and Sun, Maosong
Subjects: Computer Science - Computation and Language
Abstract: Large-scale pre-trained models (PTMs) have been widely used in document-oriented NLP tasks, such as question answering. However, the encoding-task coupling requirement results in the repeated encoding of the same documents for different tasks and queries, which is highly computationally inefficient. To this end, we target to decouple document encoding from downstream tasks, and propose to represent each document as a plug-and-play document module, i.e., a document plugin, for PTMs (PlugD). By inserting document plugins into the backbone PTM for downstream tasks, we can encode a document one time to handle multiple tasks, which is more efficient than conventional encoding-task coupling methods that simultaneously encode documents and input queries using task-specific encoders. Extensive experiments on 8 datasets of 4 typical NLP tasks show that PlugD enables models to encode documents once and for all across different scenarios. Especially, PlugD can save $69\%$ computational costs while achieving comparable performance to state-of-the-art encoding-task coupling methods. Additionally, we show that PlugD can serve as an effective post-processing way to inject knowledge into task-specific models, improving model performance without any additional model training., Comment: Accepted by ACL 2023
Published: 2023

10. Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

Author: Ding, Ning, Qin, Yujia, Yang, Guang, Wei, Fuchao, Yang, Zonghan, Su, Yusheng, Hu, Shengding, Chen, Yulin, Chan, Chi-Min, Chen, Weize, Yi, Jing, Zhao, Weilin, Wang, Xiaozhi, Liu, Zhiyuan, Zheng, Hai-Tao, Chen, Jianfei, Liu, Yang, Tang, Jie, Li, Juanzi, and Sun, Maosong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Despite the success, the process of fine-tuning large-scale PLMs brings prohibitive adaptation costs. In fact, fine-tuning all the parameters of a colossal model and retaining separate instances for different tasks are practically infeasible. This necessitates a new branch of research focusing on the parameter-efficient adaptation of PLMs, dubbed as delta tuning in this paper. In contrast with the standard fine-tuning, delta tuning only fine-tunes a small portion of the model parameters while keeping the rest untouched, largely reducing both the computation and storage costs. Recent studies have demonstrated that a series of delta tuning methods with distinct tuned parameter selection could achieve performance on a par with full-parameter fine-tuning, suggesting a new promising way of stimulating large-scale PLMs. In this paper, we first formally describe the problem of delta tuning and then comprehensively review recent delta tuning approaches. We also propose a unified categorization criterion that divide existing delta tuning methods into three groups: addition-based, specification-based, and reparameterization-based methods. Though initially proposed as an efficient method to steer large models, we believe that some of the fascinating evidence discovered along with delta tuning could help further reveal the mechanisms of PLMs and even deep neural networks. To this end, we discuss the theoretical principles underlying the effectiveness of delta tuning and propose frameworks to interpret delta tuning from the perspective of optimization and optimal control, respectively. Furthermore, we provide a holistic empirical study of representative methods, where results on over 100 NLP tasks demonstrate a comprehensive performance comparison of different approaches. The experimental results also cover the analysis of combinatorial, scaling and transferable properties of delta tuning., Comment: 49 pages
Published: 2022

11. On Transferability of Prompt Tuning for Natural Language Processing

Author: Su, Yusheng, Wang, Xiaozhi, Qin, Yujia, Chan, Chi-Min, Lin, Yankai, Wang, Huadong, Wen, Kaiyue, Liu, Zhiyuan, Li, Peng, Li, Juanzi, Hou, Lei, Sun, Maosong, and Zhou, Jie
Subjects: Computer Science - Computation and Language
Abstract: Prompt tuning (PT) is a promising parameter-efficient method to utilize extremely large pre-trained language models (PLMs), which can achieve comparable performance to full-parameter fine-tuning by only tuning a few soft prompts. However, PT requires much more training time than fine-tuning. Intuitively, knowledge transfer can help to improve the efficiency. To explore whether we can improve PT via prompt transfer, we empirically investigate the transferability of soft prompts across different downstream tasks and PLMs in this work. We find that (1) in zero-shot setting, trained soft prompts can effectively transfer to similar tasks on the same PLM and also to other PLMs with a cross-model projector trained on similar tasks; (2) when used as initialization, trained soft prompts of similar tasks and projected prompts of other PLMs can significantly accelerate training and also improve the performance of PT. Moreover, to explore what decides prompt transferability, we investigate various transferability indicators and find that the overlapping rate of activated neurons strongly reflects the transferability, which suggests how the prompts stimulate PLMs is essential. Our findings show that prompt transfer is promising for improving PT, and further research shall focus more on prompts' stimulation to PLMs. The source code can be obtained from https://github.com/thunlp/Prompt-Transferability.
Published: 2021
Full Text: View/download PDF

12. Parameter-efficient fine-tuning of large-scale pre-trained language models

Author: Ding, Ning, Qin, Yujia, Yang, Guang, Wei, Fuchao, Yang, Zonghan, Su, Yusheng, Hu, Shengding, Chen, Yulin, Chan, Chi-Min, Chen, Weize, Yi, Jing, Zhao, Weilin, Wang, Xiaozhi, Liu, Zhiyuan, Zheng, Hai-Tao, Chen, Jianfei, Liu, Yang, Tang, Jie, Li, Juanzi, and Sun, Maosong
Published: 2023
Full Text: View/download PDF

13. Arbitrary Few Parameters are Good Enough for Adapting Large-scale Pre-trained Language Models

Author: Su, Yusheng, Chan, Chi-Min, Cheng, Jiali, Qin, Yujia, Lin, Yankai, Hu, Shengding, Yang, Zonghan, Ding, Ning, Liu, Zhiyuan, and Sun, Maosong
Subjects: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computation and Language (cs.CL)
Abstract: Parameter-efficient tuning (PET) methods can effectively drive extremely large pre-trained language models (PLMs) by only training minimal parameters. Different PET methods utilize different manually designed modules. In a small PLM, there are usually noticeable performance differences among PET methods. Nevertheless, when a PLM's scale grows up to tens of billions of parameters, all PET methods achieve almost the same performance and even perform on par with the full-parameter fine-tuning method. Hence, we hypothesize that model scaling can mitigate the design differences (the module structures and the number of trainable parameters) among PET methods. To study this hypothesis, we introduce a more flexible PET method - arbitrary PET (APET) method - to be compatible with arbitrary module structures and any number of trainable parameters. Then, we experiment on $11$ NLP tasks of $5$ types and $2$ representative PLMs. From our investigations, we find that the model scaling (1) mitigates the effects of the arbitrary module structure on the performance of tuning methods, and (2) enables the tuning methods to optimize fewer parameters to achieve the full-parameter fine-tuning performance. Intriguingly, we also observe that all tuning methods require almost the same number of trainable parameters to drive PLMs. We discuss this phenomenon and the above two findings collectively from optimization perspectives to fathom the mechanisms behind them. These conclusions not only demonstrate the positive impact of model scaling on tuning methods but disclose its mechanisms, which help us design more effective and efficient tuning methods on larger-scale PLMs.
Published: 2023

14. Exploring the Impact of Model Scaling on Parameter-Efficient Tuning

Author: Su, Yusheng, primary, Chan, Chi-Min, additional, Cheng, Jiali, additional, Qin, Yujia, additional, Lin, Yankai, additional, Hu, Shengding, additional, Yang, Zonghan, additional, Ding, Ning, additional, Sun, Xingzhi, additional, Xie, Guotong, additional, Liu, Zhiyuan, additional, and Sun, Maosong, additional
Published: 2023
Full Text: View/download PDF

15. Plug-and-Play Document Modules for Pre-trained Models

Author: Xiao, Chaojun, primary, Zhang, Zhengyan, additional, Han, Xu, additional, Chan, Chi-Min, additional, Lin, Yankai, additional, Liu, Zhiyuan, additional, Li, Xiangyang, additional, Li, Zhonghua, additional, Cao, Zhao, additional, and Sun, Maosong, additional
Published: 2023
Full Text: View/download PDF

16. Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

Author: Ding, Ning, primary, Qin, Yujia, additional, Yang, Guang, additional, Wei, Fuchao, additional, Yang, Zonghan, additional, Su, Yusheng, additional, Hu, Shengding, additional, Chen, Yulin, additional, Chan, Chi-Min, additional, Chen, Weize, additional, Yi, Jing, additional, Zhao, Weilin, additional, Wang, Xiaozhi, additional, Liu, Zhiyuan, additional, Zheng, Hai-Tao, additional, Chen, Jianfei, additional, Liu, Yang, additional, Tang, Jie, additional, Li, Juanzi, additional, and Sun, Maosong, additional
Published: 2022
Full Text: View/download PDF

17. On Transferability of Prompt Tuning for Natural Language Processing

Author: Su, Yusheng, primary, Wang, Xiaozhi, additional, Qin, Yujia, additional, Chan, Chi-Min, additional, Lin, Yankai, additional, Wang, Huadong, additional, Wen, Kaiyue, additional, Liu, Zhiyuan, additional, Li, Peng, additional, Li, Juanzi, additional, Hou, Lei, additional, Sun, Maosong, additional, and Zhou, Jie, additional
Published: 2022
Full Text: View/download PDF

18. Scapular Fracture Complicating Suprascapular Neuropathy: The Role of Computed Tomography with 3D Reconstruction

Author: Chan, Chi-Min, Chung, Chin-Teng, and Lan, Howard Haw-Chung
Published: 2009
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

18 results on '"Chan, Chi-Min"'

1. EVA: An Embodied World Model for Future Video Anticipation

2. HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

3. AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems

4. Importance Weighting Can Help Large Language Models Self-Improve

5. RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation

6. AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

7. ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate

8. Exploring the Impact of Model Scaling on Parameter-Efficient Tuning

9. Plug-and-Play Document Modules for Pre-trained Models

10. Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

11. On Transferability of Prompt Tuning for Natural Language Processing

12. Parameter-efficient fine-tuning of large-scale pre-trained language models

13. Arbitrary Few Parameters are Good Enough for Adapting Large-scale Pre-trained Language Models

14. Exploring the Impact of Model Scaling on Parameter-Efficient Tuning

15. Plug-and-Play Document Modules for Pre-trained Models

16. Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

17. On Transferability of Prompt Tuning for Natural Language Processing

18. Scapular Fracture Complicating Suprascapular Neuropathy: The Role of Computed Tomography with 3D Reconstruction

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

18 results on '"Chan, Chi-Min"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources