Author: "Do, Xuan Long" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Do, Xuan Long"' showing total 18 results

Start Over Author "Do, Xuan Long"

18 results on '"Do, Xuan Long"'

1. ToXCL: A Unified Framework for Toxic Speech Detection and Explanation

Author: Hoang, Nhat M., Do, Xuan Long, Do, Duc Anh, Vu, Duc Anh, and Tuan, Luu Anh
Subjects: Computer Science - Computation and Language, Computer Science - Computers and Society
Abstract: The proliferation of online toxic speech is a pertinent problem posing threats to demographic groups. While explicit toxic speech contains offensive lexical signals, implicit one consists of coded or indirect language. Therefore, it is crucial for models not only to detect implicit toxic speech but also to explain its toxicity. This draws a unique need for unified frameworks that can effectively detect and explain implicit toxic speech. Prior works mainly formulated the task of toxic speech detection and explanation as a text generation problem. Nonetheless, models trained using this strategy can be prone to suffer from the consequent error propagation problem. Moreover, our experiments reveal that the detection results of such models are much lower than those that focus only on the detection task. To bridge these gaps, we introduce ToXCL, a unified framework for the detection and explanation of implicit toxic speech. Our model consists of three modules: a (i) Target Group Generator to generate the targeted demographic group(s) of a given post; an (ii) Encoder-Decoder Model in which the encoder focuses on detecting implicit toxic speech and is boosted by a (iii) Teacher Classifier via knowledge distillation, and the decoder generates the necessary explanation. ToXCL achieves new state-of-the-art effectiveness, and outperforms baselines significantly., Comment: Accepted at NAACL 2024 (Main Conference)
Published: 2024

2. Accelerating Greedy Coordinate Gradient via Probe Sampling

Author: Zhao, Yiran, Zheng, Wenyue, Cai, Tianle, Do, Xuan Long, Kawaguchi, Kenji, Goyal, Anirudh, and Shieh, Michael
Subjects: Computer Science - Computation and Language
Abstract: Safety of Large Language Models (LLMs) has become a critical issue given their rapid progresses. Greedy Coordinate Gradient (GCG) is shown to be effective in constructing adversarial prompts to break the aligned LLMs, but optimization of GCG is time-consuming. To reduce the time cost of GCG and enable more comprehensive studies of LLM safety, in this work, we study a new algorithm called $\texttt{Probe sampling}$. At the core of the algorithm is a mechanism that dynamically determines how similar a smaller draft model's predictions are to the target model's predictions for prompt candidates. When the target model is similar to the draft model, we rely heavily on the draft model to filter out a large number of potential prompt candidates. Probe sampling achieves up to $5.6$ times speedup using Llama2-7b-chat and leads to equal or improved attack success rate (ASR) on the AdvBench. Furthermore, probe sampling is also able to accelerate other prompt optimization techniques and adversarial methods, leading to acceleration of $1.8\times$ for AutoPrompt, $2.4\times$ for APE and $2.4\times$ for AutoDAN.
Published: 2024

3. Do LLMs Work on Charts? Designing Few-Shot Prompts for Chart Question Answering and Summarization

Author: Do, Xuan Long, Hassanpour, Mohammad, Masry, Ahmed, Kavehzadeh, Parsa, Hoque, Enamul, and Joty, Shafiq
Subjects: Computer Science - Computation and Language
Abstract: A number of tasks have been proposed recently to facilitate easy access to charts such as chart QA and summarization. The dominant paradigm to solve these tasks has been to fine-tune a pretrained model on the task data. However, this approach is not only expensive but also not generalizable to unseen tasks. On the other hand, large language models (LLMs) have shown impressive generalization capabilities to unseen tasks with zero- or few-shot prompting. However, their application to chart-related tasks is not trivial as these tasks typically involve considering not only the underlying data but also the visual features in the chart image. We propose PromptChart, a multimodal few-shot prompting framework with LLMs for chart-related applications. By analyzing the tasks carefully, we have come up with a set of prompting guidelines for each task to elicit the best few-shot performance from LLMs. We further propose a strategy to inject visual information into the prompts. Our experiments on three different chart-related information consumption tasks show that with properly designed prompts LLMs can excel on the benchmarks, achieving state-of-the-art., Comment: 23 pages
Published: 2023

4. Prompt Optimization via Adversarial In-Context Learning

Author: Do, Xuan Long, Zhao, Yiran, Brown, Hannah, Xie, Yuxi, Zhao, James Xu, Chen, Nancy F., Kawaguchi, Kenji, Shieh, Michael, and He, Junxian
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: We propose a new method, Adversarial In-Context Learning (adv-ICL), to optimize prompt for in-context learning (ICL) by employing one LLM as a generator, another as a discriminator, and a third as a prompt modifier. As in traditional adversarial learning, adv-ICL is implemented as a two-player game between the generator and discriminator, where the generator tries to generate realistic enough output to fool the discriminator. In each round, given an input prefixed by task instructions and several exemplars, the generator produces an output. The discriminator is then tasked with classifying the generator input-output pair as model-generated or real data. Based on the discriminator loss, the prompt modifier proposes possible edits to the generator and discriminator prompts, and the edits that most improve the adversarial loss are selected. We show that adv-ICL results in significant improvements over state-of-the-art prompt optimization techniques for both open and closed-source models on 11 generation and classification tasks including summarization, arithmetic reasoning, machine translation, data-to-text generation, and the MMLU and big-bench hard benchmarks. In addition, because our method uses pre-trained models and updates only prompts rather than model parameters, it is computationally efficient, easy to extend to any LLM and task, and effective in low-resource settings., Comment: ACL 2024
Published: 2023

5. ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

Author: Van Long, Phuoc Pham, Vu, Duc Anh, Hoang, Nhat M., Do, Xuan Long, and Luu, Anh Tuan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Mathematical questioning is crucial for assessing students problem-solving skills. Since manually creating such questions requires substantial effort, automatic methods have been explored. Existing state-of-the-art models rely on fine-tuning strategies and struggle to generate questions that heavily involve multiple steps of logical and arithmetic reasoning. Meanwhile, large language models(LLMs) such as ChatGPT have excelled in many NLP tasks involving logical and arithmetic reasoning. Nonetheless, their applications in generating educational questions are underutilized, especially in the field of mathematics. To bridge this gap, we take the first step to conduct an in-depth analysis of ChatGPT in generating pre-university math questions. Our analysis is categorized into two main settings: context-aware and context-unaware. In the context-aware setting, we evaluate ChatGPT on existing math question-answering benchmarks covering elementary, secondary, and ternary classes. In the context-unaware setting, we evaluate ChatGPT in generating math questions for each lesson from pre-university math curriculums that we crawl. Our crawling results in TopicMath, a comprehensive and novel collection of pre-university math curriculums collected from 121 math topics and 428 lessons from elementary, secondary, and tertiary classes. Through this analysis, we aim to provide insight into the potential of ChatGPT as a math questioner., Comment: Accepted at the 39th ACM/SIGAPP Symposium On Applied Computing (SAC 2024), Main Conference
Published: 2023

6. UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning

Author: Masry, Ahmed, Kavehzadeh, Parsa, Do, Xuan Long, Hoque, Enamul, and Joty, Shafiq
Subjects: Computer Science - Computation and Language
Abstract: Charts are very popular for analyzing data, visualizing key insights and answering complex reasoning questions about data. To facilitate chart-based data analysis using natural language, several downstream tasks have been introduced recently such as chart question answering and chart summarization. However, most of the methods that solve these tasks use pretraining on language or vision-language tasks that do not attempt to explicitly model the structure of the charts (e.g., how data is visually encoded and how chart elements are related to each other). To address this, we first build a large corpus of charts covering a wide variety of topics and visual styles. We then present UniChart, a pretrained model for chart comprehension and reasoning. UniChart encodes the relevant text, data, and visual elements of charts and then uses a chart-grounded text decoder to generate the expected output in natural language. We propose several chart-specific pretraining tasks that include: (i) low-level tasks to extract the visual elements (e.g., bars, lines) and data from charts, and (ii) high-level tasks to acquire chart understanding and reasoning skills. We find that pretraining the model on a large corpus with chart-specific low- and high-level tasks followed by finetuning on three down-streaming tasks results in state-of-the-art performance on three downstream tasks.
Published: 2023

7. Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation

Author: Do, Xuan Long, Zou, Bowei, Joty, Shafiq, Tran, Anh Tai, Pan, Liangming, Chen, Nancy F., and Aw, Ai Ti
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Conversational Question Generation (CQG) is a critical task for machines to assist humans in fulfilling their information needs through conversations. The task is generally cast into two different settings: answer-aware and answer-unaware. While the former facilitates the models by exposing the expected answer, the latter is more realistic and receiving growing attentions recently. What-to-ask and how-to-ask are the two main challenges in the answer-unaware setting. To address the first challenge, existing methods mainly select sequential sentences in context as the rationales. We argue that the conversation generated using such naive heuristics may not be natural enough as in reality, the interlocutors often talk about the relevant contents that are not necessarily sequential in context. Additionally, previous methods decide the type of question to be generated (boolean/span-based) implicitly. Modeling the question type explicitly is crucial as the answer, which hints the models to generate a boolean or span-based question, is unavailable. To this end, we present SG-CQG, a two-stage CQG framework. For the what-to-ask stage, a sentence is selected as the rationale from a semantic graph that we construct, and extract the answer span from it. For the how-to-ask stage, a classifier determines the target answer type of the question via two explicit control signals before generating and filtering. In addition, we propose Conv-Distinct, a novel evaluation metric for CQG, to evaluate the diversity of the generated conversation from a context. Compared with the existing answer-unaware CQG models, the proposed SG-CQG achieves state-of-the-art performance., Comment: 17 pages, ACL 2023
Published: 2023

8. Retrieving Multimodal Information for Augmented Generation: A Survey

Author: Zhao, Ruochen, Chen, Hailin, Wang, Weishi, Jiao, Fangkai, Do, Xuan Long, Qin, Chengwei, Ding, Bosheng, Guo, Xiaobao, Li, Minzhi, Li, Xingxuan, and Joty, Shafiq
Subjects: Computer Science - Computation and Language
Abstract: As Large Language Models (LLMs) become popular, there emerged an important trend of using multimodality to augment the LLMs' generation ability, which enables LLMs to better interact with the world. However, there lacks a unified perception of at which stage and how to incorporate different modalities. In this survey, we review methods that assist and augment generative models by retrieving multimodal knowledge, whose formats range from images, codes, tables, graphs, to audio. Such methods offer a promising solution to important concerns such as factuality, reasoning, interpretability, and robustness. By providing an in-depth review, this survey is expected to provide scholars with a deeper understanding of the methods' applications and encourage them to adapt existing techniques to the fast-growing field of LLMs.
Published: 2023

9. xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval

Author: Khan, Mohammad Abdullah Matin, Bari, M Saiful, Do, Xuan Long, Wang, Weishi, Parvez, Md Rizwan, and Joty, Shafiq
Subjects: Computer Science - Computation and Language
Abstract: Recently, pre-trained large language models (LLMs) have shown impressive abilities in generating codes from natural language descriptions, repairing buggy codes, translating codes between languages, and retrieving relevant code segments. However, the evaluation of these models has often been performed in a scattered way on only one or two specific tasks, in a few languages, at a partial granularity (e.g., function) level, and in many cases without proper training data. Even more concerning is that in most cases the evaluation of generated codes has been done in terms of mere lexical overlap with a reference code rather than actual execution. We introduce xCodeEval, the largest executable multilingual multitask benchmark to date consisting of $25$M document-level coding examples ($16.5$B tokens) from about $7.5$K unique problems covering up to $11$ programming languages with execution-level parallelism. It features a total of $7$ tasks involving code understanding, generation, translation and retrieval. xCodeEval adopts an execution-based evaluation and offers a multilingual code execution engine, ExecEval that supports unit test based execution in all the $11$ languages. To address the challenge of balancing the distributions of text-code samples over multiple attributes in validation/test sets, we propose a novel data splitting and a data selection schema based on the geometric mean and graph-theoretic principle. Our experiments with OpenAI's LLMs (zero-shot) and open-LLMs (zero-shot and fine-tuned) on the tasks and languages demonstrate **xCodeEval** to be quite challenging as per the current advancements in language models., Comment: Code & Data available at https://github.com/ntunlp/xCodeEval, https://huggingface.co/datasets/NTU-NLP-sg/xCodeEval. Evaluation framework available at https://github.com/ntunlp/execeval
Published: 2023

10. OpenCQA: Open-ended Question Answering with Charts

Author: Kantharaj, Shankar, Do, Xuan Long, Leong, Rixie Tiffany Ko, Tan, Jia Qing, Hoque, Enamul, and Joty, Shafiq
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: Charts are very popular to analyze data and convey important insights. People often analyze visualizations to answer open-ended questions that require explanatory answers. Answering such questions are often difficult and time-consuming as it requires a lot of cognitive and perceptual efforts. To address this challenge, we introduce a new task called OpenCQA, where the goal is to answer an open-ended question about a chart with descriptive texts. We present the annotation process and an in-depth analysis of our dataset. We implement and evaluate a set of baselines under three practical settings. In the first setting, a chart and the accompanying article is provided as input to the model. The second setting provides only the relevant paragraph(s) to the chart instead of the entire article, whereas the third setting requires the model to generate an answer solely based on the chart. Our analysis of the results show that the top performing models generally produce fluent and coherent text while they struggle to perform complex logical and arithmetic reasoning.
Published: 2022

11. CoHS-CQG: Context and History Selection for Conversational Question Generation

Author: Do, Xuan Long, Zou, Bowei, Pan, Liangming, Chen, Nancy F., Joty, Shafiq, and Aw, Ai Ti
Subjects: Computer Science - Computation and Language
Abstract: Conversational question generation (CQG) serves as a vital task for machines to assist humans, such as interactive reading comprehension, through conversations. Compared to traditional single-turn question generation (SQG), CQG is more challenging in the sense that the generated question is required not only to be meaningful, but also to align with the occurred conversation history. While previous studies mainly focus on how to model the flow and alignment of the conversation, there has been no thorough study to date on which parts of the context and history are necessary for the model. We argue that shortening the context and history is crucial as it can help the model to optimise more on the conversational alignment property. To this end, we propose CoHS-CQG, a two-stage CQG framework, which adopts a CoHS module to shorten the context and history of the input. In particular, CoHS selects contiguous sentences and history turns according to their relevance scores by a top-p strategy. Our model achieves state-of-the-art performances on CoQA in both the answer-aware and answer-unaware settings., Comment: Accepted by 29th International Conference on Computational Linguistics (COLING 2022)
Published: 2022

12. UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning.

Author: Ahmed Masry, Parsa Kavehzadeh, Do Xuan Long, Enamul Hoque, and Shafiq Joty
Published: 2023
Full Text: View/download PDF

13. Retrieving Multimodal Information for Augmented Generation: A Survey.

Author: Ruochen Zhao, Hailin Chen, Weishi Wang, Fangkai Jiao, Do Xuan Long, Chengwei Qin, Bosheng Ding, Xiaobao Guo, Minzhi Li, Xingxuan Li, and Shafiq Joty
Published: 2023
Full Text: View/download PDF

14. ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning.

Author: Ahmed Masry, Do Xuan Long, Jia Qing Tan, Shafiq R. Joty, and Enamul Hoque
Published: 2022
Full Text: View/download PDF

15. ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

Author: Pham, Phuoc Van Long, primary, Duc, Anh Vu, additional, Hoang, Nhat Minh, additional, Do, Xuan Long, additional, and Luu, Anh Tuan, additional
Published: 2024
Full Text: View/download PDF

16. ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning

Author: Do, Xuan Long, Kawaguchi, Kenji, Kan, Min-Yen, Chen, Nancy F., Do, Xuan Long, Kawaguchi, Kenji, Kan, Min-Yen, and Chen, Nancy F.
Abstract: Aligning language models (LMs) with human opinion is challenging yet vital to enhance their grasp of human values, preferences, and beliefs. We present ChOiRe, a four-step framework to predict human opinion which differentially models the user explicit personae (i.e. demographic or ideological attributes) that are manually declared, and implicit personae inferred from user historical opinions. ChOiRe consists of (i) an LM analyzing the user explicit personae to filter out irrelevant attributes; (ii) the LM ranking the implicit persona opinions into a preferential list; (iii) Chain-of-Opinion (CoO) reasoning, where the LM sequentially analyzes the explicit personae and the most relevant implicit personae to perform opinion prediction; (iv) and where ChOiRe executes Step (iii) CoO multiple times with increasingly larger lists of implicit personae to overcome insufficient personae information to infer a final result. ChOiRe achieves new state-of-the-art effectiveness with limited inference calls, improving previous techniques significantly by 3.22%. We also show that ChOiRe Steps (i) and (ii) can significantly better fine-tune opinion-aligned models, by up to 18.44%., Comment: 22 pages
Published: 2023

17. Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation

Author: Do, Xuan Long, primary, Zou, Bowei, additional, Joty, Shafiq, additional, Tai, Tran, additional, Pan, Liangming, additional, Chen, Nancy, additional, and Aw, Ai Ti, additional
Published: 2023
Full Text: View/download PDF

18. OpenCQA: Open-ended Question Answering with Charts

Author: Kantharaj, Shankar, primary, Do, Xuan Long, additional, Leong, Rixie Tiffany, additional, Tan, Jia Qing, additional, Hoque, Enamul, additional, and Joty, Shafiq, additional
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

18 results on '"Do, Xuan Long"'

1. ToXCL: A Unified Framework for Toxic Speech Detection and Explanation

2. Accelerating Greedy Coordinate Gradient via Probe Sampling

3. Do LLMs Work on Charts? Designing Few-Shot Prompts for Chart Question Answering and Summarization

4. Prompt Optimization via Adversarial In-Context Learning

5. ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

6. UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning

7. Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation

8. Retrieving Multimodal Information for Augmented Generation: A Survey

9. xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval

10. OpenCQA: Open-ended Question Answering with Charts

11. CoHS-CQG: Context and History Selection for Conversational Question Generation

12. UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning.

13. Retrieving Multimodal Information for Augmented Generation: A Survey.

14. ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning.

15. ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

16. ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning

17. Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation

18. OpenCQA: Open-ended Question Answering with Charts

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

18 results on '"Do, Xuan Long"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources