Author: "Xu, Yumo" / Database: OAIster - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Xu, Yumo"' showing total 10 results

Start Over Author "Xu, Yumo" Database OAIster

10 results on '"Xu, Yumo"'

1. Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks

Author: Zhang, Huajian, Xu, Yumo, Perez-Beltrachini, Laura, Zhang, Huajian, Xu, Yumo, and Perez-Beltrachini, Laura
Abstract: We study existing approaches to leverage off-the-shelf Natural Language Inference (NLI) models for the evaluation of summary faithfulness and argue that these are sub-optimal due to the granularity level considered for premises and hypotheses. That is, the smaller content unit considered as hypothesis is a sentence and premises are made up of a fixed number of document sentences. We propose a novel approach, namely InFusE, that uses a variable premise size and simplifies summary sentences into shorter hypotheses. Departing from previous studies which focus on single short document summarisation, we analyse NLI based faithfulness evaluation for diverse summarisation tasks. We introduce DiverSumm, a new benchmark comprising long form summarisation (long documents and summaries) and diverse summarisation tasks (e.g., meeting and multi-document summarisation). In experiments, InFusE obtains superior performance across the different summarisation tasks. Our code and data are available at https://github.com/HJZnlp/infuse., Comment: EACL 2024
Published: 2024

2. QTSumm: Query-Focused Summarization over Tabular Data

Author: Zhao, Yilun, Qi, Zhenting, Nan, Linyong, Mi, Boyu, Liu, Yixin, Zou, Weijin, Han, Simeng, Chen, Ruizhe, Tang, Xiangru, Xu, Yumo, Radev, Dragomir, Cohan, Arman, Zhao, Yilun, Qi, Zhenting, Nan, Linyong, Mi, Boyu, Liu, Yixin, Zou, Weijin, Han, Simeng, Chen, Ruizhe, Tang, Xiangru, Xu, Yumo, Radev, Dragomir, and Cohan, Arman
Abstract: People primarily consult tables to conduct data analysis or answer specific questions. Text generation systems that can provide accurate table summaries tailored to users' information needs can facilitate more efficient access to relevant data insights. Motivated by this, we define a new query-focused table summarization task, where text generation models have to perform human-like reasoning and analysis over the given table to generate a tailored summary. We introduce a new benchmark named QTSumm for this task, which contains 7,111 human-annotated query-summary pairs over 2,934 tables covering diverse topics. We investigate a set of strong baselines on QTSumm, including text generation, table-to-text generation, and large language models. Experimental results and manual analysis reveal that the new task presents significant challenges in table-to-text generation for future research. Moreover, we propose a new approach named ReFactor, to retrieve and reason over query-relevant information from tabular data to generate several natural language facts. Experimental results demonstrate that ReFactor can bring improvements to baselines by concatenating the generated facts to the model input. Our data and code are publicly available at https://github.com/yale-nlp/QTSumm., Comment: Accepted at EMNLP 2023
Published: 2023

3. Text Summarization with Oracle Expectation

Author: Xu, Yumo, Lapata, Mirella, Xu, Yumo, and Lapata, Mirella
Abstract: Extractive summarization produces summaries by identifying and concatenating the most important sentences in a document. Since most summarization datasets do not come with gold labels indicating whether document sentences are summary-worthy, different labeling algorithms have been proposed to extrapolate oracle extracts for model training. In this work, we identify two flaws with the widely used greedy labeling approach: it delivers suboptimal and deterministic oracles. To alleviate both issues, we propose a simple yet effective labeling algorithm that creates soft, expectation-based sentence labels. We define a new learning objective for extractive summarization which incorporates learning signals from multiple oracle summaries and prove it is equivalent to estimating the oracle expectation for each document sentence. Without any architectural modifications, the proposed labeling scheme achieves superior performance on a variety of summarization benchmarks across domains and languages, in both supervised and zero-shot settings., Comment: 18 pages, 5 figures
Published: 2022

4. Tackling Query-Focused Summarization as A Knowledge-Intensive Task: A Pilot Study

Author: Zhang, Weijia, Vakulenko, Svitlana, Rajapakse, Thilina, Xu, Yumo, Kanoulas, Evangelos, Zhang, Weijia, Vakulenko, Svitlana, Rajapakse, Thilina, Xu, Yumo, and Kanoulas, Evangelos
Abstract: Query-focused summarization (QFS) requires generating a summary given a query using a set of relevant documents. However, such relevant documents should be annotated manually and thus are not readily available in realistic scenarios. To address this limitation, we tackle the QFS task as a knowledge-intensive (KI) task without access to any relevant documents. Instead, we assume that these documents are present in a large-scale knowledge corpus and should be retrieved first. To explore this new setting, we build a new dataset (KI-QFS) by adapting existing QFS datasets. In this dataset, answering the query requires document retrieval from a knowledge corpus. We construct three different knowledge corpora, and we further provide relevance annotations to enable retrieval evaluation. Finally, we benchmark the dataset with state-of-the-art QFS models and retrieval-enhanced models. The experimental results demonstrate that QFS models perform significantly worse on KI-QFS compared to the original QFS task, indicating that the knowledge-intensive setting is much more challenging and offers substantial room for improvement. We believe that our investigation will inspire further research into addressing QFS in more realistic scenarios., Comment: Accepted by Gen-IR@SIGIR 2023 workshop
Published: 2021

5. Text Summarization with Latent Queries

Author: Xu, Yumo, Lapata, Mirella, Xu, Yumo, and Lapata, Mirella
Abstract: The availability of large-scale datasets has driven the development of neural models that create summaries from single documents, for generic purposes. When using a summarization system, users often have specific intents with various language realizations, which, depending on the information need, can range from a single keyword to a long narrative composed of multiple questions. Existing summarization systems, however, often either fail to support or act robustly on this query focused summarization task. We introduce LaQSum, the first unified text summarization system that learns Latent Queries from documents for abstractive summarization with any existing query forms. Under a deep generative framework, our system jointly optimizes a latent query model and a conditional language model, allowing users to plug-and-play queries of any type at test time. Despite learning from only generic summarization data and requiring no further optimization for downstream summarization tasks, our system robustly outperforms strong comparison systems across summarization benchmarks with different query types, document settings, and target domains., Comment: 12 pages
Published: 2021

6. Meta Dialogue Policy Learning

Author: Xu, Yumo, Zhu, Chenguang, Peng, Baolin, Zeng, Michael, Xu, Yumo, Zhu, Chenguang, Peng, Baolin, and Zeng, Michael
Abstract: Dialog policy determines the next-step actions for agents and hence is central to a dialogue system. However, when migrated to novel domains with little data, a policy model can fail to adapt due to insufficient interactions with the new environment. We propose Deep Transferable Q-Network (DTQN) to utilize shareable low-level signals between domains, such as dialogue acts and slots. We decompose the state and action representation space into feature subspaces corresponding to these low-level components to facilitate cross-domain knowledge transfer. Furthermore, we embed DTQN in a meta-learning framework and introduce Meta-DTQN with a dual-replay mechanism to enable effective off-policy training and adaptation. In experiments, our model outperforms baseline models in terms of both success rate and dialogue efficiency on the multi-domain dialogue dataset MultiWOZ 2.0., Comment: 10 pages, 3 figures
Published: 2020

7. Bootstrapping a Crosslingual Semantic Parser

Author: Sherborne, Tom, Xu, Yumo, Lapata, Mirella, Sherborne, Tom, Xu, Yumo, and Lapata, Mirella
Abstract: Recent progress in semantic parsing scarcely considers languages other than English but professional translation can be prohibitively expensive. We adapt a semantic parser trained on a single language, such as English, to new languages and multiple domains with minimal annotation. We query if machine translation is an adequate substitute for training data, and extend this to investigate bootstrapping using joint training with English, paraphrasing, and multilingual pre-trained models. We develop a Transformer-based parser combining paraphrases by ensembling attention over multiple encoders and present new versions of ATIS and Overnight in German and Chinese for evaluation. Experimental results indicate that MT can approximate training data in a new language for accurate parsing when augmented with paraphrasing through multiple MT engines. Considering when MT is inadequate, we also find that using our approach achieves parsing accuracy within 2% of complete translation using only 50% of training data., Comment: Camera Ready for EMNLP2020 Findings
Published: 2020

8. Query Focused Multi-Document Summarization with Distant Supervision

Author: Xu, Yumo, Lapata, Mirella, Xu, Yumo, and Lapata, Mirella
Abstract: We consider the problem of better modeling query-cluster interactions to facilitate query focused multi-document summarization (QFS). Due to the lack of training data, existing work relies heavily on retrieval-style methods for estimating the relevance between queries and text segments. In this work, we leverage distant supervision from question answering where various resources are available to more explicitly capture the relationship between queries and documents. We propose a coarse-to-fine modeling framework which introduces separate modules for estimating whether segments are relevant to the query, likely to contain an answer, and central. Under this framework, a trained evidence estimator further discerns which retrieved segments might answer the query for final selection in the summary. We demonstrate that our framework outperforms strong comparison systems on standard QFS benchmarks., Comment: 11 pages, 3 figures
Published: 2020

9. Generating Query Focused Summaries from Query-Free Resources

Author: Xu, Yumo, Lapata, Mirella, Xu, Yumo, and Lapata, Mirella
Abstract: The availability of large-scale datasets has driven the development of neural models that create generic summaries from single or multiple documents. In this work we consider query focused summarization (QFS), a task for which training data in the form of queries, documents, and summaries is not readily available. We propose to decompose QFS into (1) query modeling (i.e., finding supportive evidence within a set of documents for a query) and (2) conditional language modeling (i.e., summary generation). We introduce MaRGE, a Masked ROUGE Regression framework for evidence estimation and ranking which relies on a unified representation for summaries and queries, so that summaries in generic data can be converted into proxy queries for learning a query model. Experiments across QFS benchmarks and query types show that our model achieves state-of-the-art performance despite learning from weak supervision., Comment: ACL 2021
Published: 2020

10. Weakly Supervised Domain Detection

Author: Xu, Yumo, Lapata, Mirella, Xu, Yumo, and Lapata, Mirella
Abstract: In this paper we introduce domain detection as a new natural language processing task. We argue that the ability to detect textual segments which are domain-heavy, i.e., sentences or phrases which are representative of and provide evidence for a given domain could enhance the robustness and portability of various text classification applications. We propose an encoder-detector framework for domain detection and bootstrap classifiers with multiple instance learning (MIL). The model is hierarchically organized and suited to multilabel classification. We demonstrate that despite learning with minimal supervision, our model can be applied to text spans of different granularities, languages, and genres. We also showcase the potential of domain detection for text summarization., Comment: To appear in Transactions of the Association for Computational Linguistics (TACL); 16 pages
Published: 2019

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

10 results on '"Xu, Yumo"'

1. Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks

2. QTSumm: Query-Focused Summarization over Tabular Data

3. Text Summarization with Oracle Expectation

4. Tackling Query-Focused Summarization as A Knowledge-Intensive Task: A Pilot Study

5. Text Summarization with Latent Queries

6. Meta Dialogue Policy Learning

7. Bootstrapping a Crosslingual Semantic Parser

8. Query Focused Multi-Document Summarization with Distant Supervision

9. Generating Query Focused Summaries from Query-Free Resources

10. Weakly Supervised Domain Detection

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Publication Year Range

Publication Type

Database

10 results on '"Xu, Yumo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources