Author: "Ma, Wentao" / Publication Type: Reports - Searchworks@Jio Institute Digital Library Search Results

1. FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents

Author: Xiao, Ruixuan, Ma, Wentao, Wang, Ke, Wu, Yuchuan, Zhao, Junbo, Wang, Haobo, Huang, Fei, and Li, Yongbin
Subjects: Computer Science - Computation and Language
Abstract: LLM-based agents have emerged as promising tools, which are crafted to fulfill complex tasks by iterative planning and action. However, these agents are susceptible to undesired planning hallucinations when lacking specific knowledge for expertise-intensive tasks. To address this, preliminary attempts are made to enhance planning reliability by incorporating external workflow-related knowledge. Despite the promise, such infused knowledge is mostly disorganized and diverse in formats, lacking rigorous formalization and comprehensive comparisons. Motivated by this, we formalize different formats of workflow knowledge and present FlowBench, the first benchmark for workflow-guided planning. FlowBench covers 51 different scenarios from 6 domains, with knowledge presented in diverse formats. To assess different LLMs on FlowBench, we design a multi-tiered evaluation framework. We evaluate the efficacy of workflow knowledge across multiple formats, and the results indicate that current LLM agents need considerable improvements for satisfactory planning. We hope that our challenging benchmark can pave the way for future agent planning research.
Published: 2024

2. PTA: Enhancing Multimodal Sentiment Analysis through Pipelined Prediction and Translation-based Alignment

Author: Song, Shezheng, Li, Shasha, Zhao, Shan, Wang, Chengyu, Li, Xiaopeng, Yu, Jie, Wan, Qian, Ma, Jun, Yan, Tianwei, Ma, Wentao, and Mao, Xiaoguang
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Multimedia
Abstract: Multimodal aspect-based sentiment analysis (MABSA) aims to understand opinions in a granular manner, advancing human-computer interaction and other fields. Traditionally, MABSA methods use a joint prediction approach to identify aspects and sentiments simultaneously. However, we argue that joint models are not always superior. Our analysis shows that joint models struggle to align relevant text tokens with image patches, leading to misalignment and ineffective image utilization. In contrast, a pipeline framework first identifies aspects through MATE (Multimodal Aspect Term Extraction) and then aligns these aspects with image patches for sentiment classification (MASC: Multimodal Aspect-Oriented Sentiment Classification). This method is better suited for multimodal scenarios where effective image use is crucial. We present three key observations: (a) MATE and MASC have different feature requirements, with MATE focusing on token-level features and MASC on sequence-level features; (b) the aspect identified by MATE is crucial for effective image utilization; and (c) images play a trivial role in previous MABSA methods due to high noise. Based on these observations, we propose a pipeline framework that first predicts the aspect and then uses translation-based alignment (TBA) to enhance multimodal semantic consistency for better image utilization. Our method achieves state-of-the-art (SOTA) performance on widely used MABSA datasets Twitter-15 and Twitter-17. This demonstrates the effectiveness of the pipeline approach and its potential to provide valuable insights for future MABSA research. For reproducibility, the code and checkpoint will be released., Comment: Code will be released upon publication
Published: 2024

3. Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models

Author: Gao, Haoyu, Lin, Ting-En, Li, Hangyu, Yang, Min, Wu, Yuchuan, Ma, Wentao, and Li, Yongbin
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Task-oriented dialogue (TOD) systems facilitate users in executing various activities via multi-turn dialogues, but Large Language Models (LLMs) often struggle to comprehend these intricate contexts. In this study, we propose a novel "Self-Explanation" prompting strategy to enhance the comprehension abilities of LLMs in multi-turn dialogues. This task-agnostic approach requires the model to analyze each dialogue utterance before task execution, thereby improving performance across various dialogue-centric tasks. Experimental results from six benchmark datasets confirm that our method consistently outperforms other zero-shot prompts and matches or exceeds the efficacy of few-shot prompts, demonstrating its potential as a powerful tool in enhancing LLMs' comprehension in complex dialogue tasks.
Published: 2023

4. UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt

Author: Cai, Yucheng, Ma, Wentao, Wu, Yuchuan, Si, Shuzheng, Shao, Yuan, Ou, Zhijian, and Li, Yongbin
Subjects: Computer Science - Computation and Language
Abstract: Recent research has shown that multi-task pre-training greatly improves the model's robustness and transfer ability, which is crucial for building a high-quality dialog system. However, most previous works on multi-task pre-training rely heavily on human-defined input format or prompt, which is not optimal in quality and quantity. In this work, we propose to use Task-based Automatic Prompt generation (TAP) to automatically generate high-quality prompts. Using the high-quality prompts generated, we scale the corpus of the pre-trained conversation model to 122 datasets from 15 dialog-related tasks, resulting in Universal Pre-trained Conversation Model (UniPCM), a powerful foundation model for various conversational tasks and different dialog systems. Extensive experiments have shown that UniPCM is robust to input prompts and capable of various dialog-related tasks. Moreover, UniPCM has strong transfer ability and excels at low resource scenarios, achieving SOTA results on 9 different datasets ranging from task-oriented dialog to open-domain conversation. Furthermore, we are amazed to find that TAP can generate prompts on par with those collected with crowdsourcing. The code is released with the paper.
Published: 2023

5. Long-Pulse Laser-Induced Cavitation: A Race Between Advection and Phase Transition

Author: Zhao, Xuning, Ma, Wentao, Chen, Junqin, Xiang, Gaoming, Zhong, Pei, and Wang, Kevin
Subjects: Physics - Fluid Dynamics
Abstract: Vapor bubbles generated by long-pulsed laser often have complex non-spherical shapes that reflect some characteristics (e.g., direction, width) of the laser beam. The transition between two commonly observed shapes - namely, a rounded pear-like shape and an elongated conical shape - is studied using a new computational model that combines compressible multiphase fluid dynamics with laser radiation and phase transition. Two laboratory experiments are simulated, in which Holmium:YAG and Thulium fiber lasers are used separately to generate bubbles of different shapes. In both cases, the bubble morphology predicted by the simulation agrees reasonably well with the experimental measurement. The simulated laser radiance, temperature, velocity, and pressure fields are analyzed to explain bubble dynamics and energy transmission. It is found that due to the lasting energy input (i.e. long-pulsed laser), the vapor bubble's dynamics is driven not only by advection, but also by the continuation of vaporization. Notably, vaporization lasts less than 1 microsecond in the case of the pear-shaped bubble, versus more than 50 microseconds for the elongated bubble. It is hypothesized that the bubble's shape is the result of a competition. When the speed of advection is higher than that of vaporization, the bubble tends to grow spherically. Otherwise, it elongates along the laser beam direction. To clarify and test this hypothesis, the two speeds are defined analytically using a simplified model, then estimated for the experiments using simulation results. The results support the hypothesis. They also suggest that a higher laser absorption coefficient and a narrower beam facilitate bubble elongation.
Published: 2023

6. Data-driven Approximation of Distributionally Robust Chance Constraints using Bayesian Credible Intervals

Author: Chen, Zhiping, Ma, Wentao, and Ji, Bingbing
Subjects: Mathematics - Optimization and Control
Abstract: The non-convexity and intractability of distributionally robust chance constraints make them challenging to cope with. From a data-driven perspective, we propose formulating it as a robust optimization problem to ensure that the distributionally robust chance constraint is satisfied with high probability. To incorporate available data and prior distribution knowledge, we construct ambiguity sets for the distributionally robust chance constraint using Bayesian credible intervals. We establish the congruent relationship between the ambiguity set in Bayesian distributionally robust chance constraints and the uncertainty set in a specific robust optimization. In contrast to most existent uncertainty set construction methods which are only applicable for particular settings, our approach provides a unified framework for constructing uncertainty sets under different marginal distribution assumptions, thus making it more flexible and widely applicable. Additionally, under the concavity assumption, our method provides strong finite sample probability guarantees for optimal solutions. The practicality and effectiveness of our approach are illustrated with numerical experiments on portfolio management and queuing system problems. Overall, our approach offers a promising solution to distributionally robust chance constrained problems and has potential applications in other fields.
Published: 2023

7. SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents

Author: Si, Shuzheng, Ma, Wentao, Gao, Haoyu, Wu, Yuchuan, Lin, Ting-En, Dai, Yinpei, Li, Hangyu, Yan, Rui, Huang, Fei, and Li, Yongbin
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Task-oriented dialogue (TOD) models have made significant progress in recent years. However, previous studies primarily focus on datasets written by annotators, which has resulted in a gap between academic research and real-world spoken conversation scenarios. While several small-scale spoken TOD datasets are proposed to address robustness issues such as ASR errors, they ignore the unique challenges in spoken conversation. To tackle the limitations, we introduce SpokenWOZ, a large-scale speech-text dataset for spoken TOD, containing 8 domains, 203k turns, 5.7k dialogues and 249 hours of audios from human-to-human spoken conversations. SpokenWOZ further incorporates common spoken characteristics such as word-by-word processing and reasoning in spoken language. Based on these characteristics, we present cross-turn slot and reasoning slot detection as new challenges. We conduct experiments on various baselines, including text-modal models, newly proposed dual-modal models, and LLMs, e.g., ChatGPT. The results show that the current models still have substantial room for improvement in spoken conversation, where the most advanced dialogue state tracker only achieves 25.65% in joint goal accuracy and the SOTA end-to-end model only correctly completes the user request in 52.1% of dialogues. The dataset, code, and leaderboard are available: https://spokenwoz.github.io/., Comment: NeurIPS 2023
Published: 2023

8. Speech-Text Dialog Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment

Author: Yu, Tianshu, Gao, Haoyu, Lin, Ting-En, Yang, Min, Wu, Yuchuan, Ma, Wentao, Wang, Chao, Huang, Fei, and Li, Yongbin
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Recently, speech-text pre-training methods have shown remarkable success in many speech and natural language processing tasks. However, most previous pre-trained models are usually tailored for one or two specific tasks, but fail to conquer a wide range of speech-text tasks. In addition, existing speech-text pre-training methods fail to explore the contextual information within a dialogue to enrich utterance representations. In this paper, we propose Speech-text dialog Pre-training for spoken dialog understanding with ExpliCiT cRoss-Modal Alignment (SPECTRA), which is the first-ever speech-text dialog pre-training model. Concretely, to consider the temporality of speech modality, we design a novel temporal position prediction task to capture the speech-text alignment. This pre-training task aims to predict the start and end time of each textual word in the corresponding speech waveform. In addition, to learn the characteristics of spoken dialogs, we generalize a response selection task from textual dialog pre-training to speech-text dialog pre-training scenarios. Experimental results on four different downstream speech-text tasks demonstrate the superiority of SPECTRA in learning speech-text alignment and multi-turn dialog context., Comment: Accepted at ACL 2023 main conference
Published: 2023

9. Gate Recurrent Unit Network based on Hilbert-Schmidt Independence Criterion for State-of-Health Estimation

Author: Huang, Ziyue, Dang, Lujuan, Xie, Yuqing, Ma, Wentao, and Chen, Badong
Subjects: Computer Science - Machine Learning, Computer Science - Human-Computer Interaction, Computer Science - Neural and Evolutionary Computing
Abstract: State-of-health (SOH) estimation is a key step in ensuring the safe and reliable operation of batteries. Due to issues such as varying data distribution and sequence length in different cycles, most existing methods require health feature extraction technique, which can be time-consuming and labor-intensive. GRU can well solve this problem due to the simple structure and superior performance, receiving widespread attentions. However, redundant information still exists within the network and impacts the accuracy of SOH estimation. To address this issue, a new GRU network based on Hilbert-Schmidt Independence Criterion (GRU-HSIC) is proposed. First, a zero masking network is used to transform all battery data measured with varying lengths every cycle into sequences of the same length, while still retaining information about the original data size in each cycle. Second, the Hilbert-Schmidt Independence Criterion (HSIC) bottleneck, which evolved from Information Bottleneck (IB) theory, is extended to GRU to compress the information from hidden layers. To evaluate the proposed method, we conducted experiments on datasets from the Center for Advanced Life Cycle Engineering (CALCE) of the University of Maryland and NASA Ames Prognostics Center of Excellence. Experimental results demonstrate that our model achieves higher accuracy than other recurrent models.
Published: 2023

10. Fluid-Solid Coupled Simulation of Hypervelocity Impact and Plasma Formation

Author: Islam, Shafquat T., Ma, Wentao, Michopoulos, John G., and Wang, Kevin
Subjects: Physics - Plasma Physics
Abstract: The generation of plasma from hypervelocity impacts is an active research topic due to its important science and engineering ramifications in various applications. Previous studies have mainly focused on the ionization of the solid materials that constitute the projectile and the target. In this letter, we consider impact events that occur in a fluid (e.g.,~gas) medium, and present a multiphysics computational modeling approach and associated analysis to predict the behavior of the dynamic fluid-solid interaction that causes the surrounding fluid to ionize. The proposed computational framework is applied to a specific case involving a system of three interacting domains: a copper rod projectile impacting onto a soda lime glass target in a neon gas environment. The impact velocity is varied between 3 km/s and 6 km/s in different simulations. The computational model couples the compressible inviscid Navier-Stokes equations with the Saha ionization equations. The three material interfaces formed among the projectile, the target, and the ambient gas are tracked implicitly by solving two level set equations that share the same velocity field. The mass, momentum, and energy fluxes across the interfaces are computed using the FInite Volume method with Exact two-material Riemann problems (FIVER). The simulation result reveals a region of neon gas with high velocity, temperature, pressure, and mass density, formed in the early stage of the impact mainly due to the hypersonic compression of the fluid between the projectile and the target. For impact velocities higher than 4 km/s, ionization is predicted in this region.
Published: 2023

11. Efficient Solution of Bimaterial Riemann Problems for Compressible Multi-Material Flow Simulations

Author: Ma, Wentao, Zhao, Xuning, Islam, Shafquat, Narkhede, Aditya, and Wang, Kevin
Subjects: Physics - Computational Physics, Mathematics - Numerical Analysis, Physics - Fluid Dynamics
Abstract: When solving compressible multi-material flow problems, an unresolved challenge is the computation of advective fluxes across material interfaces that separate drastically different thermodynamic states and relations. A popular idea in this regard is to locally construct bimaterial Riemann problems, and to apply their exact solutions in flux computation. For general equations of state, however, finding the exact solution of a Riemann problem is expensive as it requires nested loops. Multiplied by the large number of Riemann problems constructed during a simulation, the computational cost often becomes prohibitive. The work presented in this paper aims to accelerate the solution of bimaterial Riemann problems without introducing approximations or offline precomputation tasks. The basic idea is to exploit some special properties of the Riemann problem equations, and to recycle previous solutions as much as possible. Following this idea, four acceleration methods are developed, including (1) a change of integration variable through rarefaction fans, (2) storing and reusing integration trajectory data, (3) step size adaptation, and (4) constructing an R-tree on the fly to generate initial guesses. The performance of these acceleration methods are assessed using four example problems in underwater explosion, laser-induced cavitation, and hypervelocity impact. These problems exhibit strong shock waves, large interface deformation, contact of multiple (>2) interfaces, and interaction between gases and condensed matters. In these challenging cases, the solution of bimaterial Riemann problems is accelerated by 37 to 87 times. As a result, the total cost of advective flux computation, which includes the exact Riemann problem solution at material interfaces and the numerical flux calculation over the entire computational domain, is accelerated by 18 to 81 times.
Published: 2023
Full Text: View/download PDF

12. Computational Analysis of Bubble-Structure Interactions in Near-Field Underwater Explosion

Author: Ma, Wentao, Zhao, Xuning, Gilbert, Christine, and Wang, Kevin
Subjects: Physics - Fluid Dynamics, Physics - Applied Physics, Physics - Computational Physics
Abstract: The response of underwater structures to a near-field explosion is coupled with the dynamics of the explosion bubble and the surrounding water. This multiphase fluid-structure interaction process is investigated using a model problem that features the yielding and collapse of a thin-walled aluminum cylinder. A recently developed computational framework that couples a compressible fluid dynamics solver with a structural dynamics solver is employed. The fluid-structure and liquid-gas interfaces are tracked using embedded boundary and level set methods. The conservation law across the interfaces is enforced by solving one-dimensional bimaterial Riemann problems. The initial pressure inside the explosion bubble is varied by two orders of magnitude in different test cases. Three different modes of collapse are discovered, including an horizontal collapse (i.e. with one lobe extending towards the explosive charge) that appears counterintuitive, yet has been observed in previous laboratory experiments. Because of the transition of modes, the time it takes for the structure to reach self-contact does not decrease monotonically as the explosion magnitude increases. The flow fields, the bubble dynamics, and the transient structural deformation are visualized to elucidate the cause of each collapse mode and the mode transitions. The result suggests that the pressure pulse resulting from the contraction of the explosion bubble has significant effect on the structure's collapse. The phase difference between the structural vibration and bubble oscillation influences the structure's mode of collapse. Furthermore, the transient structural deformation has clear effect on the bubble dynamics, leading to a two-way interaction. A liquid jet that points away from the structure is observed. Compared to the liquid jets produced by bubbles collapsing near a rigid wall, this jet is in the opposite direction.
Published: 2022
Full Text: View/download PDF

13. Bilingual Alignment Pre-Training for Zero-Shot Cross-Lingual Transfer

Author: Yang, Ziqing, Ma, Wentao, Cui, Yiming, Ye, Jiani, Che, Wanxiang, and Wang, Shijin
Subjects: Computer Science - Computation and Language
Abstract: Multilingual pre-trained models have achieved remarkable performance on cross-lingual transfer learning. Some multilingual models such as mBERT, have been pre-trained on unlabeled corpora, therefore the embeddings of different languages in the models may not be aligned very well. In this paper, we aim to improve the zero-shot cross-lingual transfer performance by proposing a pre-training task named Word-Exchange Aligning Model (WEAM), which uses the statistical alignment information as the prior knowledge to guide cross-lingual word prediction. We evaluate our model on multilingual machine reading comprehension task MLQA and natural language interface task XNLI. The results show that WEAM can significantly improve the zero-shot performance., Comment: 5 pages; accepted to MRQA 2021 @ EMNLP 2021
Published: 2021
Full Text: View/download PDF

14. CharBERT: Character-aware Pre-trained Language Model

Author: Ma, Wentao, Cui, Yiming, Si, Chenglei, Liu, Ting, Wang, Shijin, and Hu, Guoping
Subjects: Computer Science - Computation and Language
Abstract: Most pre-trained language models (PLMs) construct word representations at subword level with Byte-Pair Encoding (BPE) or its variations, by which OOV (out-of-vocab) words are almost avoidable. However, those methods split a word into subword units and make the representation incomplete and fragile. In this paper, we propose a character-aware pre-trained language model named CharBERT improving on the previous methods (such as BERT, RoBERTa) to tackle these problems. We first construct the contextual word embedding for each token from the sequential character representations, then fuse the representations of characters and the subword representations by a novel heterogeneous interaction module. We also propose a new pre-training task named NLM (Noisy LM) for unsupervised character representation learning. We evaluate our method on question answering, sequence labeling, and text classification tasks, both on the original datasets and adversarial misspelling test sets. The experimental results show that our method can significantly improve the performance and robustness of PLMs simultaneously. Pretrained models, evaluation sets, and code are available at https://github.com/wtma/CharBERT, Comment: 12 pages, to appear at COLING 2020
Published: 2020
Full Text: View/download PDF

15. Benchmarking Robustness of Machine Reading Comprehension Models

Author: Si, Chenglei, Yang, Ziqing, Cui, Yiming, Ma, Wentao, Liu, Ting, and Wang, Shijin
Subjects: Computer Science - Computation and Language
Abstract: Machine Reading Comprehension (MRC) is an important testbed for evaluating models' natural language understanding (NLU) ability. There has been rapid progress in this area, with new models achieving impressive performance on various benchmarks. However, existing benchmarks only evaluate models on in-domain test sets without considering their robustness under test-time perturbations or adversarial attacks. To fill this important gap, we construct AdvRACE (Adversarial RACE), a new model-agnostic benchmark for evaluating the robustness of MRC models under four different types of adversarial attacks, including our novel distractor extraction and generation attacks. We show that state-of-the-art (SOTA) models are vulnerable to all of these attacks. We conclude that there is substantial room for building more robust MRC models and our benchmark can help motivate and measure progress in this area. We release our data and code at https://github.com/NoviScl/AdvRACE ., Comment: ACL 2021 (Findings)
Published: 2020

16. Conversational Word Embedding for Retrieval-Based Dialog System

Author: Ma, Wentao, Cui, Yiming, Liu, Ting, Wang, Dong, Wang, Shijin, and Hu, Guoping
Subjects: Computer Science - Computation and Language
Abstract: Human conversations contain many types of information, e.g., knowledge, common sense, and language habits. In this paper, we propose a conversational word embedding method named PR-Embedding, which utilizes the conversation pairs $ \left\langle{post, reply} \right\rangle$ to learn word embedding. Different from previous works, PR-Embedding uses the vectors from two different semantic spaces to represent the words in post and reply. To catch the information among the pair, we first introduce the word alignment model from statistical machine translation to generate the cross-sentence window, then train the embedding on word-level and sentence-level. We evaluate the method on single-turn and multi-turn response selection tasks for retrieval-based dialog systems. The experiment results show that PR-Embedding can improve the quality of the selected response. PR-Embedding source code is available at https://github.com/wtma/PR-Embedding, Comment: To appear at ACL 2020
Published: 2020
Full Text: View/download PDF

17. A Sentence Cloze Dataset for Chinese Machine Reading Comprehension

Author: Cui, Yiming, Liu, Ting, Yang, Ziqing, Chen, Zhipeng, Ma, Wentao, Che, Wanxiang, Wang, Shijin, and Hu, Guoping
Subjects: Computer Science - Computation and Language
Abstract: Owing to the continuous efforts by the Chinese NLP community, more and more Chinese machine reading comprehension datasets become available. To add diversity in this area, in this paper, we propose a new task called Sentence Cloze-style Machine Reading Comprehension (SC-MRC). The proposed task aims to fill the right candidate sentence into the passage that has several blanks. We built a Chinese dataset called CMRC 2019 to evaluate the difficulty of the SC-MRC task. Moreover, to add more difficulties, we also made fake candidates that are similar to the correct ones, which requires the machine to judge their correctness in the context. The proposed dataset contains over 100K blanks (questions) within over 10K passages, which was originated from Chinese narrative stories. To evaluate the dataset, we implement several baseline systems based on the pre-trained models, and the results show that the state-of-the-art model still underperforms human performance by a large margin. We release the dataset and baseline system to further facilitate our community. Resources available through https://github.com/ymcui/cmrc2019, Comment: 7 pages, to appear at COLING 2020
Published: 2020
Full Text: View/download PDF

18. CJRC: A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension

Author: Duan, Xingyi, Wang, Baoxin, Wang, Ziyue, Ma, Wentao, Cui, Yiming, Wu, Dayong, Wang, Shijin, Liu, Ting, Huo, Tianxiang, Hu, Zhen, Wang, Heng, and Liu, Zhiyuan
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We present a Chinese judicial reading comprehension (CJRC) dataset which contains approximately 10K documents and almost 50K questions with answers. The documents come from judgment documents and the questions are annotated by law experts. The CJRC dataset can help researchers extract elements by reading comprehension technology. Element extraction is an important task in the legal field. However, it is difficult to predefine the element types completely due to the diversity of document types and causes of action. By contrast, machine reading comprehension technology can quickly extract elements by answering various questions from the long document. We build two strong baseline models based on BERT and BiDAF. The experimental results show that there is enough space for improvement compared to human annotators.
Published: 2019
Full Text: View/download PDF

19. TripleNet: Triple Attention Network for Multi-Turn Response Selection in Retrieval-based Chatbots

Author: Ma, Wentao, Cui, Yiming, Shao, Nan, He, Su, Zhang, Wei-Nan, Liu, Ting, Wang, Shijin, and Hu, Guoping
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval
Abstract: We consider the importance of different utterances in the context for selecting the response usually depends on the current query. In this paper, we propose the model TripleNet to fully model the task with the triple instead of in previous works. The heart of TripleNet is a novel attention mechanism named triple attention to model the relationships within the triple at four levels. The new mechanism updates the representation for each element based on the attention with the other two concurrently and symmetrically. We match the triple centered on the response from char to context level for prediction. Experimental results on two large-scale multi-turn response selection datasets show that the proposed model can significantly outperform the state-of-the-art methods. TripleNet source code is available at https://github.com/wtma/TripleNet, Comment: 10 pages, accepted as a conference paper at CoNLL 2019
Published: 2019
Full Text: View/download PDF

20. Convolutional Spatial Attention Model for Reading Comprehension with Multiple-Choice Questions

Author: Chen, Zhipeng, Cui, Yiming, Ma, Wentao, Wang, Shijin, and Hu, Guoping
Subjects: Computer Science - Computation and Language
Abstract: Machine Reading Comprehension (MRC) with multiple-choice questions requires the machine to read given passage and select the correct answer among several candidates. In this paper, we propose a novel approach called Convolutional Spatial Attention (CSA) model which can better handle the MRC with multiple-choice questions. The proposed model could fully extract the mutual information among the passage, question, and the candidates, to form the enriched representations. Furthermore, to merge various attention results, we propose to use convolutional operation to dynamically summarize the attention values within the different size of regions. Experimental results show that the proposed model could give substantial improvements over various state-of-the-art systems on both RACE and SemEval-2018 Task11 datasets., Comment: 8 pages. Accepted as a conference paper at AAAI-19 Technical Track
Published: 2018
Full Text: View/download PDF

21. A Span-Extraction Dataset for Chinese Machine Reading Comprehension

Author: Cui, Yiming, Liu, Ting, Che, Wanxiang, Xiao, Li, Chen, Zhipeng, Ma, Wentao, Wang, Shijin, and Hu, Guoping
Subjects: Computer Science - Computation and Language
Abstract: Machine Reading Comprehension (MRC) has become enormously popular recently and has attracted a lot of attention. However, the existing reading comprehension datasets are mostly in English. In this paper, we introduce a Span-Extraction dataset for Chinese machine reading comprehension to add language diversities in this area. The dataset is composed by near 20,000 real questions annotated on Wikipedia paragraphs by human experts. We also annotated a challenge set which contains the questions that need comprehensive understanding and multi-sentence inference throughout the context. We present several baseline systems as well as anonymous submissions for demonstrating the difficulties in this dataset. With the release of the dataset, we hosted the Second Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2018). We hope the release of the dataset could further accelerate the Chinese machine reading comprehension research. Resources are available: https://github.com/ymcui/cmrc2018, Comment: 6 pages, accepted as a conference paper at EMNLP-IJCNLP 2019 (short paper)
Published: 2018
Full Text: View/download PDF

22. HFL-RC System at SemEval-2018 Task 11: Hybrid Multi-Aspects Model for Commonsense Reading Comprehension

Author: Chen, Zhipeng, Cui, Yiming, Ma, Wentao, Wang, Shijin, Liu, Ting, and Hu, Guoping
Subjects: Computer Science - Computation and Language
Abstract: This paper describes the system which got the state-of-the-art results at SemEval-2018 Task 11: Machine Comprehension using Commonsense Knowledge. In this paper, we present a neural network called Hybrid Multi-Aspects (HMA) model, which mimic the human's intuitions on dealing with the multiple-choice reading comprehension. In this model, we aim to produce the predictions in multiple aspects by calculating attention among the text, question and choices, and combine these results for final predictions. Experimental results show that our HMA model could give substantial improvements over the baseline system and got the first place on the final test set leaderboard with the accuracy of 84.13%., Comment: 6 pages
Published: 2018

23. Bias-Compensated Normalized Maximum Correntropy Criterion Algorithm for System Identification with Noisy Input

Author: Ma, Wentao, Zheng, Dongqiao, Li, Yuanhao, Zhang, Zhiyu, and Chen, Badong
Subjects: Statistics - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
Abstract: This paper proposed a bias-compensated normalized maximum correntropy criterion (BCNMCC) algorithm charactered by its low steady-state misalignment for system identification with noisy input in an impulsive output noise environment. The normalized maximum correntropy criterion (NMCC) is derived from a correntropy based cost function, which is rather robust with respect to impulsive noises. To deal with the noisy input, we introduce a bias-compensated vector (BCV) to the NMCC algorithm, and then an unbiasedness criterion and some reasonable assumptions are used to compute the BCV. Taking advantage of the BCV, the bias caused by the input noise can be effectively suppressed. System identification simulation results demonstrate that the proposed BCNMCC algorithm can outperform other related algorithms with noisy input especially in an impulsive output noise environment., Comment: 14 pages, 4 figures
Published: 2017

24. Dataset for the First Evaluation on Chinese Machine Reading Comprehension

Author: Cui, Yiming, Liu, Ting, Chen, Zhipeng, Ma, Wentao, Wang, Shijin, and Hu, Guoping
Subjects: Computer Science - Computation and Language
Abstract: Machine Reading Comprehension (MRC) has become enormously popular recently and has attracted a lot of attention. However, existing reading comprehension datasets are mostly in English. To add diversity in reading comprehension datasets, in this paper we propose a new Chinese reading comprehension dataset for accelerating related research in the community. The proposed dataset contains two different types: cloze-style reading comprehension and user query reading comprehension, associated with large-scale training data as well as human-annotated validation and hidden test set. Along with this dataset, we also hosted the first Evaluation on Chinese Machine Reading Comprehension (CMRC-2017) and successfully attracted tens of participants, which suggest the potential impact of this dataset., Comment: 5 pages, published at LREC 2018
Published: 2017

25. Diffusion Maximum Correntropy Criterion Algorithms for Robust Distributed Estimation

Author: Ma, Wentao, Chen, Badong, Duan, Jiandong, and Zhao, Haiquan
Subjects: Statistics - Machine Learning, Computer Science - Learning
Abstract: Robust diffusion adaptive estimation algorithms based on the maximum correntropy criterion (MCC), including adaptation to combination MCC and combination to adaptation MCC, are developed to deal with the distributed estimation over network in impulsive (long-tailed) noise environments. The cost functions used in distributed estimation are in general based on the mean square error (MSE) criterion, which is desirable when the measurement noise is Gaussian. In non-Gaussian situations, such as the impulsive-noise case, MCC based methods may achieve much better performance than the MSE methods as they take into account higher order statistics of error distribution. The proposed methods can also outperform the robust diffusion least mean p-power(DLMP) and diffusion minimum error entropy (DMEE) algorithms. The mean and mean square convergence analysis of the new algorithms are also carried out., Comment: 17 pages,10 figures
Published: 2015

26. Sparsity Aware Normalized Least Mean p-power Algorithms with Correntropy Induced Metric Penalty

Author: Ma, Wentao, Qu, Hua, Zhao, Jihong, Chen, Badong, and Gui, Guan
Subjects: Computer Science - Information Theory
Abstract: For identifying the non-Gaussian impulsive noise systems, normalized LMP (NLMP) has been proposed to combat impulsive-inducing instability. However, the standard algorithm is without considering the inherent sparse structure distribution of unknown system. To exploit sparsity as well as to mitigate the impulsive noise, this paper proposes a sparse NLMP algorithm, i.e., Correntropy Induced Metric (CIM) constraint based NLMP (CIMNLMP). Based on the first proposed algorithm, moreover, we propose an improved CIM constraint variable regularized NLMP(CIMVRNLMP) algorithm by utilizing variable regularized parameter(VRP) selection method which can further adjust convergence speed and steady-state error. Numerical simulations are given to confirm the proposed algorithms., Comment: 5 pages, 4 figures, submitted for DSP2015
Published: 2015

27. Maximum correntropy criterion based sparse adaptive filtering algorithms for robust channel estimation under non-Gaussian environments

Author: Ma, Wentao, Qua, Hua, Gui, Guan, Xu, Li, Zhaoa, Jihong, and Chen, Badong
Subjects: Computer Science - Information Theory
Abstract: Sparse adaptive channel estimation problem is one of the most important topics in broadband wireless communications systems due to its simplicity and robustness. So far many sparsity-aware channel estimation algorithms have been developed based on the well-known minimum mean square error (MMSE) criterion, such as the zero-attracting least mean square (ZALMS), which are robust under Gaussian assumption. In non-Gaussian environments, however, these methods are often no longer robust especially when systems are disturbed by random impulsive noises. To address this problem, we propose in this work a robust sparse adaptive filtering algorithm using correntropy induced metric (CIM) penalized maximum correntropy criterion (MCC) rather than conventional MMSE criterion for robust channel estimation. Specifically, MCC is utilized to mitigate the impulsive noise while CIM is adopted to exploit the channel sparsity efficiently. Both theoretical analysis and computer simulations are provided to corroborate the proposed methods., Comment: 29 pages, 12 figures, accepted by Journal of the Franklin Institute
Published: 2015

28. Robust Adaptive Sparse Channel Estimation in the Presence of Impulsive Noises

Author: Gui, Guan, Xu, Li, Ma, Wentao, and Chen, Badong
Subjects: Computer Science - Information Theory, Computer Science - Systems and Control
Abstract: Broadband wireless channels usually have the sparse nature. Based on the assumption of Gaussian noise model, adaptive filtering algorithms for reconstruction sparse channels were proposed to take advantage of channel sparsity. However, impulsive noises are often existed in many advance broadband communications systems. These conventional algorithms are vulnerable to deteriorate due to interference of impulsive noise. In this paper, sign least mean square algorithm (SLMS) based robust sparse adaptive filtering algorithms are proposed for estimating channels as well as for mitigating impulsive noise. By using different sparsity-inducing penalty functions, i.e., zero-attracting (ZA), reweighted ZA (RZA), reweighted L1-norm (RL1) and Lp-norm (LP), the proposed SLMS algorithms are termed as SLMS-ZA, SLMS-RZA, LSMS-RL1 and SLMS-LP. Simulation results are given to validate the proposed algorithms., Comment: 5 pages, 4 figures, submitted for DSP2015 conference paper
Published: 2015

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

28 results on '"Ma, Wentao"'

1. FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents

2. PTA: Enhancing Multimodal Sentiment Analysis through Pipelined Prediction and Translation-based Alignment

3. Self-Explanation Prompting Improves Dialogue Understanding in Large Language Models

4. UniPCM: Universal Pre-trained Conversation Model with Task-aware Automatic Prompt

5. Long-Pulse Laser-Induced Cavitation: A Race Between Advection and Phase Transition

6. Data-driven Approximation of Distributionally Robust Chance Constraints using Bayesian Credible Intervals

7. SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents

8. Speech-Text Dialog Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment

9. Gate Recurrent Unit Network based on Hilbert-Schmidt Independence Criterion for State-of-Health Estimation

10. Fluid-Solid Coupled Simulation of Hypervelocity Impact and Plasma Formation

11. Efficient Solution of Bimaterial Riemann Problems for Compressible Multi-Material Flow Simulations

12. Computational Analysis of Bubble-Structure Interactions in Near-Field Underwater Explosion

13. Bilingual Alignment Pre-Training for Zero-Shot Cross-Lingual Transfer

14. CharBERT: Character-aware Pre-trained Language Model

15. Benchmarking Robustness of Machine Reading Comprehension Models

16. Conversational Word Embedding for Retrieval-Based Dialog System

17. A Sentence Cloze Dataset for Chinese Machine Reading Comprehension

18. CJRC: A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension

19. TripleNet: Triple Attention Network for Multi-Turn Response Selection in Retrieval-based Chatbots

20. Convolutional Spatial Attention Model for Reading Comprehension with Multiple-Choice Questions

21. A Span-Extraction Dataset for Chinese Machine Reading Comprehension

22. HFL-RC System at SemEval-2018 Task 11: Hybrid Multi-Aspects Model for Commonsense Reading Comprehension

23. Bias-Compensated Normalized Maximum Correntropy Criterion Algorithm for System Identification with Noisy Input

24. Dataset for the First Evaluation on Chinese Machine Reading Comprehension

25. Diffusion Maximum Correntropy Criterion Algorithms for Robust Distributed Estimation

26. Sparsity Aware Normalized Least Mean p-power Algorithms with Correntropy Induced Metric Penalty

27. Maximum correntropy criterion based sparse adaptive filtering algorithms for robust channel estimation under non-Gaussian environments

28. Robust Adaptive Sparse Channel Estimation in the Presence of Impulsive Noises

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Database

28 results on '"Ma, Wentao"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources