Author: "Budzianowski, Paweł" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Budzianowski, Paweł"' showing total 93 results

Start Over Author "Budzianowski, Paweł"

93 results on '"Budzianowski, Paweł"'

1. Wait, that's not an option: LLMs Robustness with Incorrect Multiple-Choice Options

Author: Góral, Gracjan, Wiśnios, Emilia, Sankowski, Piotr, and Budzianowski, Paweł
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Decision-making under full alignment requires balancing between reasoning and faithfulness - a challenge for large language models (LLMs). This study explores whether LLMs prioritize following instructions over reasoning and truth when given "misleading" instructions, such as "Respond solely with A or B", even when neither option is correct. We introduce a new metric called "reflective judgment", which sheds new light on the relationship between the pre-training and post-training alignment schemes. In tasks ranging from basic arithmetic to domain-specific assessments, models like GPT-4o, o1-mini, or Claude 3 Opus adhered to instructions correctly but failed to reflect on the validity of the provided options. Contrary, models from the Llama 3.1 family (8B, 70B, 405B) or base Qwen2.5 (7B, 14B, 32B) families exhibit improved refusal rates with size, indicating a scaling effect. We also observed that alignment techniques, though intended to enhance reasoning, sometimes weakened the models' ability to reject incorrect instructions, leading them to follow flawed prompts uncritically. Finally, we have also conducted a parallel human study revealing similar patterns in human behavior and annotations. We highlight how popular RLHF datasets might disrupt either training or evaluation due to annotations exhibiting poor reflective judgement., Comment: Accepted for NeurIPS 2024 FM-EduAssess Workshop
Published: 2024

2. Pheme: Efficient and Conversational Speech Generation

Author: Budzianowski, Paweł, Sereda, Taras, Cichy, Tomasz, and Vulić, Ivan
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: In recent years, speech generation has seen remarkable progress, now achieving one-shot generation capability that is often virtually indistinguishable from real human voice. Integrating such advancements in speech generation with large language models might revolutionize a wide range of applications. However, certain applications, such as assistive conversational systems, require natural and conversational speech generation tools that also operate efficiently in real time. Current state-of-the-art models like VALL-E and SoundStorm, powered by hierarchical neural audio codecs, require large neural components and extensive training data to work well. In contrast, MQTTS aims to build more compact conversational TTS models while capitalizing on smaller-scale real-life conversational speech data. However, its autoregressive nature yields high inference latency and thus limits its real-time usage. In order to mitigate the current limitations of the state-of-the-art TTS models while capitalizing on their strengths, in this work we introduce the Pheme model series that 1) offers compact yet high-performing models, 2) allows for parallel speech generation of 3) natural conversational speech, and 4) it can be trained efficiently on smaller-scale conversational data, cutting data demands by more than 10x but still matching the quality of the autoregressive TTS models. We also show that through simple teacher-student distillation we can meet significant improvements in voice quality for single-speaker setups on top of pretrained Pheme checkpoints, relying solely on synthetic speech generated by much larger teacher models. Audio samples and pretrained models are available online.
Published: 2024

3. $\textit{Dial BeInfo for Faithfulness}$: Improving Factuality of Information-Seeking Dialogue via Behavioural Fine-Tuning

Author: Razumovskaia, Evgeniia, Vulić, Ivan, Marković, Pavle, Cichy, Tomasz, Zheng, Qian, Wen, Tsung-Hsien, and Budzianowski, Paweł
Subjects: Computer Science - Computation and Language
Abstract: Factuality is a crucial requirement in information seeking dialogue: the system should respond to the user's queries so that the responses are meaningful and aligned with the knowledge provided to the system. However, most modern large language models suffer from hallucinations, that is, they generate responses not supported by or contradicting the knowledge source. To mitigate the issue and increase faithfulness of information-seeking dialogue systems, we introduce BeInfo, a simple yet effective method that applies behavioural tuning to aid information-seeking dialogue. Relying on three standard datasets, we show that models tuned with BeInfo} become considerably more faithful to the knowledge source both for datasets and domains seen during BeInfo-tuning, as well as on unseen domains, when applied in a zero-shot manner. In addition, we show that the models with 3B parameters (e.g., Flan-T5) tuned with BeInfo demonstrate strong performance on data from real `production' conversations and outperform GPT4 when tuned on a limited amount of such realistic in-domain dialogues.
Published: 2023

4. Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data

Author: Sun, Guangzhi, Zhang, Chao, Vulić, Ivan, Budzianowski, Paweł, and Woodland, Philip C.
Subjects: Computer Science - Computation and Language
Abstract: Manually annotating fine-grained slot-value labels for task-oriented dialogue (ToD) systems is an expensive and time-consuming endeavour. This motivates research into slot-filling methods that operate with limited amounts of labelled data. Moreover, the majority of current work on ToD is based solely on text as the input modality, neglecting the additional challenges of imperfect automatic speech recognition (ASR) when working with spoken language. In this work, we propose a Knowledge-Aware Audio-Grounded generative slot-filling framework, termed KA2G, that focuses on few-shot and zero-shot slot filling for ToD with speech input. KA2G achieves robust and data-efficient slot filling for speech-based ToD by 1) framing it as a text generation task, 2) grounding text generation additionally in the audio modality, and 3) conditioning on available external knowledge (e.g. a predefined list of possible slot values). We show that combining both modalities within the KA2G framework improves the robustness against ASR errors. Further, the knowledge-aware slot-value generator in KA2G, implemented via a pointer generator mechanism, particularly benefits few-shot and zero-shot learning. Experiments, conducted on the standard speech-based single-turn SLURP dataset and a multi-turn dataset extracted from a commercial ToD system, display strong and consistent gains over prior work, especially in few-shot and zero-shot setups., Comment: to submit to CS&L
Published: 2023

5. EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification

Author: Spithourakis, Georgios P., Vulić, Ivan, Lis, Michał, Casanueva, Iñigo, and Budzianowski, Paweł
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Knowledge-based authentication is crucial for task-oriented spoken dialogue systems that offer personalised and privacy-focused services. Such systems should be able to enrol (E), verify (V), and identify (I) new and recurring users based on their personal information, e.g. postcode, name, and date of birth. In this work, we formalise the three authentication tasks and their evaluation protocols, and we present EVI, a challenging spoken multilingual dataset with 5,506 dialogues in English, Polish, and French. Our proposed models set the first competitive benchmarks, explore the challenges of multilingual natural language processing of spoken dialogue, and set directions for future research., Comment: 13 pages, 7 figures, 7 tables. Accepted in NAACL 2022 (Findings)
Published: 2022

6. NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue

Author: Casanueva, Iñigo, Vulić, Ivan, Spithourakis, Georgios P., and Budzianowski, Paweł
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We present NLU++, a novel dataset for natural language understanding (NLU) in task-oriented dialogue (ToD) systems, with the aim to provide a much more challenging evaluation environment for dialogue NLU models, up to date with the current application and industry requirements. NLU++ is divided into two domains (BANKING and HOTELS) and brings several crucial improvements over current commonly used NLU datasets. 1) NLU++ provides fine-grained domain ontologies with a large set of challenging multi-intent sentences, introducing and validating the idea of intent modules that can be combined into complex intents that convey complex user goals, combined with finer-grained and thus more challenging slot sets. 2) The ontology is divided into domain-specific and generic (i.e., domain-universal) intent modules that overlap across domains, promoting cross-domain reusability of annotated examples. 3) The dataset design has been inspired by the problems observed in industrial ToD systems, and 4) it has been collected, filtered and carefully annotated by dialogue NLU experts, yielding high-quality annotated data. Finally, we benchmark a series of current state-of-the-art NLU models on NLU++; the results demonstrate the challenging nature of the dataset, especially in low-data regimes, the validity of `intent modularisation', and call for further research on ToD NLU., Comment: 16 pages, 1 figure, 10 tables. Accepted in NAACL 2022 (Findings)
Published: 2022

7. Improved and Efficient Conversational Slot Labeling through Question Answering

Author: Fuisz, Gabor, Vulić, Ivan, Gibbons, Samuel, Casanueva, Inigo, and Budzianowski, Paweł
Subjects: Computer Science - Computation and Language
Abstract: Transformer-based pretrained language models (PLMs) offer unmatched performance across the majority of natural language understanding (NLU) tasks, including a body of question answering (QA) tasks. We hypothesize that improvements in QA methodology can also be directly exploited in dialog NLU; however, dialog tasks must be \textit{reformatted} into QA tasks. In particular, we focus on modeling and studying \textit{slot labeling} (SL), a crucial component of NLU for dialog, through the QA optics, aiming to improve both its performance and efficiency, and make it more effective and resilient to working with limited task data. To this end, we make a series of contributions: 1) We demonstrate how QA-tuned PLMs can be applied to the SL task, reaching new state-of-the-art performance, with large gains especially pronounced in such low-data regimes. 2) We propose to leverage contextual information, required to tackle ambiguous values, simply through natural language. 3) Efficiency and compactness of QA-oriented fine-tuning are boosted through the use of lightweight yet effective adapter modules. 4) Trading-off some of the quality of QA datasets for their size, we experiment with larger automatically generated QA datasets for QA-tuning, arriving at even higher performance. Finally, our analysis suggests that our novel QA-based slot labeling models, supported by the PLMs, reach a performance ceiling in high-data regimes, calling for more challenging and more nuanced benchmarks in future work.
Published: 2022

8. Knowledge-aware audio-grounded generative slot filling for limited annotated data

Author: Sun, Guangzhi, Zhang, Chao, Vulić, Ivan, Budzianowski, Paweł, and Woodland, Philip C.
Published: 2025
Full Text: View/download PDF

9. ConvFiT: Conversational Fine-Tuning of Pretrained Language Models

Author: Vulić, Ivan, Su, Pei-Hao, Coope, Sam, Gerz, Daniela, Budzianowski, Paweł, Casanueva, Iñigo, Mrkšić, Nikola, and Wen, Tsung-Hsien
Subjects: Computer Science - Computation and Language
Abstract: Transformer-based language models (LMs) pretrained on large text collections are proven to store a wealth of semantic knowledge. However, 1) they are not effective as sentence encoders when used off-the-shelf, and 2) thus typically lag behind conversationally pretrained (e.g., via response selection) encoders on conversational tasks such as intent detection (ID). In this work, we propose ConvFiT, a simple and efficient two-stage procedure which turns any pretrained LM into a universal conversational encoder (after Stage 1 ConvFiT-ing) and task-specialised sentence encoder (after Stage 2). We demonstrate that 1) full-blown conversational pretraining is not required, and that LMs can be quickly transformed into effective conversational encoders with much smaller amounts of unannotated data; 2) pretrained LMs can be fine-tuned into task-specialised sentence encoders, optimised for the fine-grained semantics of a particular task. Consequently, such specialised sentence encoders allow for treating ID as a simple semantic similarity task based on interpretable nearest neighbours retrieval. We validate the robustness and versatility of the ConvFiT framework with such similarity-based inference on the standard ID evaluation sets: ConvFiT-ed LMs achieve state-of-the-art ID performance across the board, with particular gains in the most challenging, few-shot setups., Comment: EMNLP 2021 (long paper)
Published: 2021

10. Semi-supervised Bootstrapping of Dialogue State Trackers for Task Oriented Modelling

Author: Tseng, Bo-Hsiang, Rei, Marek, Budzianowski, Paweł, Turner, Richard E., Byrne, Bill, and Korhonen, Anna
Subjects: Computer Science - Computation and Language
Abstract: Dialogue systems benefit greatly from optimizing on detailed annotations, such as transcribed utterances, internal dialogue state representations and dialogue act labels. However, collecting these annotations is expensive and time-consuming, holding back development in the area of dialogue modelling. In this paper, we investigate semi-supervised learning methods that are able to reduce the amount of required intermediate labelling. We find that by leveraging un-annotated data instead, the amount of turn-level annotations of dialogue state can be significantly reduced when building a neural dialogue system. Our analysis on the MultiWOZ corpus, covering a range of domains and topics, finds that annotations can be reduced by up to 30\% while maintaining equivalent system performance. We also describe and evaluate the first end-to-end dialogue model created for the MultiWOZ corpus., Comment: This article is published at EMNLP-IJCNLP 2019
Published: 2019

11. Tree-Structured Semantic Encoder with Knowledge Sharing for Domain Adaptation in Natural Language Generation

Author: Tseng, Bo-Hsiang, Budzianowski, Paweł, Wu, Yen-Chen, and Gašić, Milica
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Domain adaptation in natural language generation (NLG) remains challenging because of the high complexity of input semantics across domains and limited data of a target domain. This is particularly the case for dialogue systems, where we want to be able to seamlessly include new domains into the conversation. Therefore, it is crucial for generation models to share knowledge across domains for the effective adaptation from one domain to another. In this study, we exploit a tree-structured semantic encoder to capture the internal structure of complex semantic representations required for multi-domain dialogues in order to facilitate knowledge sharing across domains. In addition, a layer-wise attention mechanism between the tree encoder and the decoder is adopted to further improve the model's capability. The automatic evaluation results show that our model outperforms previous methods in terms of the BLEU score and the slot error rate, in particular when the adaptation data is limited. In subjective evaluation, human judges tend to prefer the sentences generated by our model, rating them more highly on informativeness and naturalness than other systems., Comment: Published in SIGDIAL2019
Published: 2019

12. Domain Transfer in Dialogue Systems without Turn-Level Supervision

Author: Bingel, Joachim, Hansen, Victor Petrén Bach, Gonzalez, Ana Valeria, Budzianowski, Paweł, Augenstein, Isabelle, and Søgaard, Anders
Subjects: Computer Science - Computation and Language
Abstract: Task oriented dialogue systems rely heavily on specialized dialogue state tracking (DST) modules for dynamically predicting user intent throughout the conversation. State-of-the-art DST models are typically trained in a supervised manner from manual annotations at the turn level. However, these annotations are costly to obtain, which makes it difficult to create accurate dialogue systems for new domains. To address these limitations, we propose a method, based on reinforcement learning, for transferring DST models to new domains without turn-level supervision. Across several domains, our experiments show that this method quickly adapts off-the-shelf models to new domains and performs on par with models trained with turn-level supervision. We also show our method can improve models trained using turn-level supervision by subsequent fine-tuning optimization toward dialog-level rewards.
Published: 2019

13. PolyResponse: A Rank-based Approach to Task-Oriented Dialogue with Application in Restaurant Search and Booking

Author: Henderson, Matthew, Vulić, Ivan, Casanueva, Iñigo, Budzianowski, Paweł, Gerz, Daniela, Coope, Sam, Spithourakis, Georgios, Wen, Tsung-Hsien, Mrkšić, Nikola, and Su, Pei-Hao
Subjects: Computer Science - Computation and Language
Abstract: We present PolyResponse, a conversational search engine that supports task-oriented dialogue. It is a retrieval-based approach that bypasses the complex multi-component design of traditional task-oriented dialogue systems and the use of explicit semantics in the form of task-specific ontologies. The PolyResponse engine is trained on hundreds of millions of examples extracted from real conversations: it learns what responses are appropriate in different conversational contexts. It then ranks a large index of text and visual responses according to their similarity to the given context, and narrows down the list of relevant entities during the multi-turn conversation. We introduce a restaurant search and booking system powered by the PolyResponse engine, currently available in 8 different languages., Comment: EMNLP 2019 (Demo paper)
Published: 2019

14. Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems

Author: Budzianowski, Paweł and Vulić, Ivan
Subjects: Computer Science - Computation and Language
Abstract: Data scarcity is a long-standing and crucial challenge that hinders quick development of task-oriented dialogue systems across multiple domains: task-oriented dialogue models are expected to learn grammar, syntax, dialogue reasoning, decision making, and language generation from absurdly small amounts of task-specific data. In this paper, we demonstrate that recent progress in language modeling pre-training and transfer learning shows promise to overcome this problem. We propose a task-oriented dialogue model that operates solely on text input: it effectively bypasses explicit policy and language generation modules. Building on top of the TransferTransfo framework (Wolf et al., 2019) and generative model pre-training (Radford et al., 2019), we validate the approach on complex multi-domain task-oriented dialogues from the MultiWOZ dataset. Our automatic and human evaluations show that the proposed model is on par with a strong task-specific neural baseline. In the long run, our approach holds promise to mitigate the data scarcity problem, and to support the construction of more engaging and more eloquent task-oriented conversational agents.
Published: 2019

15. Training Neural Response Selection for Task-Oriented Dialogue Systems

Author: Henderson, Matthew, Vulić, Ivan, Gerz, Daniela, Casanueva, Iñigo, Budzianowski, Paweł, Coope, Sam, Spithourakis, Georgios, Wen, Tsung-Hsien, Mrkšić, Nikola, and Su, Pei-Hao
Subjects: Computer Science - Computation and Language
Abstract: Despite their popularity in the chatbot literature, retrieval-based models have had modest impact on task-oriented dialogue systems, with the main obstacle to their application being the low-data regime of most task-oriented dialogue tasks. Inspired by the recent success of pretraining in language modelling, we propose an effective method for deploying response selection in task-oriented dialogue. To train response selection models for task-oriented dialogue tasks, we propose a novel method which: 1) pretrains the response selection model on large general-domain conversational corpora; and then 2) fine-tunes the pretrained model for the target dialogue domain, relying only on the small in-domain dataset to capture the nuances of the given dialogue domain. Our evaluation on six diverse application domains, ranging from e-commerce to banking, demonstrates the effectiveness of the proposed training method., Comment: ACL 2019 long paper
Published: 2019

16. A Repository of Conversational Datasets

Author: Henderson, Matthew, Budzianowski, Paweł, Casanueva, Iñigo, Coope, Sam, Gerz, Daniela, Kumar, Girish, Mrkšić, Nikola, Spithourakis, Georgios, Su, Pei-Hao, Vulić, Ivan, and Wen, Tsung-Hsien
Subjects: Computer Science - Computation and Language
Abstract: Progress in Machine Learning is often driven by the availability of large datasets, and consistent evaluation metrics for comparing modeling approaches. To this end, we present a repository of conversational datasets consisting of hundreds of millions of examples, and a standardised evaluation procedure for conversational response selection models using '1-of-100 accuracy'. The repository contains scripts that allow researchers to reproduce the standard datasets, or to adapt the pre-processing and data filtering steps to their needs. We introduce and evaluate several competitive baselines for conversational response selection, whose implementations are shared in the repository, as well as a neural encoder model that is trained on the entire training set.
Published: 2019

17. Addressing Objects and Their Relations: The Conversational Entity Dialogue Model

Author: Ultes, Stefan, Budzianowski, Paweł\, Casanueva, Iñigo, Rojas-Barahona, Lina, Tseng, Bo-Hsiang, Wu, Yen-Chen, Young, Steve, and Gašić, Milica
Subjects: Computer Science - Computation and Language
Abstract: Statistical spoken dialogue systems usually rely on a single- or multi-domain dialogue model that is restricted in its capabilities of modelling complex dialogue structures, e.g., relations. In this work, we propose a novel dialogue model that is centred around entities and is able to model relations as well as multiple entities of the same type. We demonstrate in a prototype implementation benefits of relation modelling on the dialogue level and show that a trained policy using these relations outperforms the multi-domain baseline. Furthermore, we show that by modelling the relations on the dialogue level, the system is capable of processing relations present in the user input and even learns to address them in the system response., Comment: Accepted at SIGDial 2018
Published: 2019

18. Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

Author: Tseng, Bo-Hsiang, Kreyssig, Florian, Budzianowski, Pawel, Casanueva, Inigo, Wu, Yen-Chen, Ultes, Stefan, and Gasic, Milica
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Cross-domain natural language generation (NLG) is still a difficult task within spoken dialogue modelling. Given a semantic representation provided by the dialogue manager, the language generator should generate sentences that convey desired information. Traditional template-based generators can produce sentences with all necessary information, but these sentences are not sufficiently diverse. With RNN-based models, the diversity of the generated sentences can be high, however, in the process some information is lost. In this work, we improve an RNN-based generator by considering latent information at the sentence level during generation using the conditional variational autoencoder architecture. We demonstrate that our model outperforms the original RNN-based generator, while yielding highly diverse sentences. In addition, our model performs better when the training data is limited., Comment: Sigdial 2018
Published: 2018

19. MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Author: Budzianowski, Paweł, Wen, Tsung-Hsien, Tseng, Bo-Hsiang, Casanueva, Iñigo, Ultes, Stefan, Ramadan, Osman, and Gašić, Milica
Subjects: Computer Science - Computation and Language
Abstract: Even though machine learning has become the major scene in dialogue research community, the real breakthrough has been blocked by the scale of data available. To address this fundamental obstacle, we introduce the Multi-Domain Wizard-of-Oz dataset (MultiWOZ), a fully-labeled collection of human-human written conversations spanning over multiple domains and topics. At a size of $10$k dialogues, it is at least one order of magnitude larger than all previous annotated task-oriented corpora. The contribution of this work apart from the open-sourced dataset labelled with dialogue belief states and dialogue actions is two-fold: firstly, a detailed description of the data collection procedure along with a summary of data structure and analysis is provided. The proposed data-collection pipeline is entirely based on crowd-sourcing without the need of hiring professional annotators; secondly, a set of benchmark results of belief tracking, dialogue act and response generation is reported, which shows the usability of the data and sets a baseline for future studies., Comment: Accepted for publication at EMNLP 2018
Published: 2018

20. Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

Author: Ramadan, Osman, Budzianowski, Paweł, and Gašić, Milica
Subjects: Computer Science - Computation and Language
Abstract: Robust dialogue belief tracking is a key component in maintaining good quality dialogue systems. The tasks that dialogue systems are trying to solve are becoming increasingly complex, requiring scalability to multi domain, semantically rich dialogues. However, most current approaches have difficulty scaling up with domains because of the dependency of the model parameters on the dialogue ontology. In this paper, a novel approach is introduced that fully utilizes semantic similarity between dialogue utterances and the ontology terms, allowing the information to be shared across domains. The evaluation is performed on a recently collected multi-domain dialogues dataset, one order of magnitude larger than currently available corpora. Our model demonstrates great capability in handling multi-domain dialogues, simultaneously outperforming existing state-of-the-art models in single-domain dialogue tracking tasks., Comment: 10 pages, 1 figure and 2 tables. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL)
Published: 2018

21. Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

Author: Rojas-Barahona, Lina M., Ultes, Stefan, Budzianowski, Pawel, Casanueva, Iñigo, Gasic, Milica, Tseng, Bo-Hsiang, and Young, Steve
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: This paper presents two ways of dealing with scarce data in semantic decoding using N-Best speech recognition hypotheses. First, we learn features by using a deep learning architecture in which the weights for the unknown and known categories are jointly optimised. Second, an unsupervised method is used for further tuning the weights. Sharing weights injects prior knowledge to unknown categories. The unsupervised tuning (i.e. the risk minimisation) improves the F-Measure when recognising nearly zero-shot data on the DSTC3 corpus. This unsupervised method can be applied subject to two assumptions: the rank of the class marginal is assumed to be known and the class-conditional scores of the classifier are assumed to follow a Gaussian distribution.
Published: 2018

22. Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems

Author: Kreyssig, Florian, Casanueva, Inigo, Budzianowski, Pawel, and Gasic, Milica
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: User Simulators are one of the major tools that enable offline training of task-oriented dialogue systems. For this task the Agenda-Based User Simulator (ABUS) is often used. The ABUS is based on hand-crafted rules and its output is in semantic form. Issues arise from both properties such as limited diversity and the inability to interface a text-level belief tracker. This paper introduces the Neural User Simulator (NUS) whose behaviour is learned from a corpus and which generates natural language, hence needing a less labelled dataset than simulators generating a semantic output. In comparison to much of the past work on this topic, which evaluates user simulators on corpus-based metrics, we use the NUS to train the policy of a reinforcement learning based Spoken Dialogue System. The NUS is compared to the ABUS by evaluating the policies that were trained using the simulators. Cross-model evaluation is performed i.e. training on one simulator and testing on the other. Furthermore, the trained policies are tested on real users. In both evaluation tasks the NUS outperformed the ABUS., Comment: Accepted to SIGDIAL 2018
Published: 2018

23. Feudal Reinforcement Learning for Dialogue Management in Large Domains

Author: Casanueva, Iñigo, Budzianowski, Paweł, Su, Pei-Hao, Ultes, Stefan, Rojas-Barahona, Lina, Tseng, Bo-Hsiang, and Gašić, Milica
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Neural and Evolutionary Computing
Abstract: Reinforcement learning (RL) is a promising approach to solve dialogue policy optimisation. Traditional RL algorithms, however, fail to scale to large domains due to the curse of dimensionality. We propose a novel Dialogue Management architecture, based on Feudal RL, which decomposes the decision into two steps; a first step where a master policy selects a subset of primitive actions, and a second step where a primitive action is chosen from the selected subset. The structural information included in the domain ontology is used to abstract the dialogue state space, taking the decisions at each step using different parts of the abstracted state. This, combined with an information sharing mechanism between slots, increases the scalability to large domains. We show that an implementation of this approach, based on Deep-Q Networks, significantly outperforms previous state of the art in several dialogue domains and environments, without the need of any additional reward signal., Comment: Accepted as a short paper in NAACL 2018
Published: 2018

24. Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces

Author: Weisz, Gellért, Budzianowski, Paweł, Su, Pei-Hao, and Gašić, Milica
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Learning, Statistics - Machine Learning
Abstract: In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue agents that can converse with humans. A part of this effort is the policy optimisation task, which attempts to find a policy describing how to respond to humans, in the form of a function taking the current state of the dialogue and returning the response of the system. In this paper, we investigate deep reinforcement learning approaches to solve this problem. Particular attention is given to actor-critic methods, off-policy reinforcement learning with experience replay, and various methods aimed at reducing the bias and variance of estimators. When combined, these methods result in the previously proposed ACER algorithm that gave competitive results in gaming environments. These environments however are fully observable and have a relatively small action set so in this paper we examine the application of ACER to dialogue policy optimisation. We show that this method beats the current state-of-the-art in deep learning approaches for spoken dialogue systems. This not only leads to a more sample efficient algorithm that can train faster, but also allows us to apply the algorithm in more difficult environments than before. We thus experiment with learning in a very large action space, which has two orders of magnitude more actions than previously considered. We find that ACER trains significantly faster than the current state-of-the-art.
Published: 2018

25. Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation

Author: Tegho, Christopher, Budzianowski, Paweł, and Gašić, Milica
Subjects: Statistics - Machine Learning, Computer Science - Computation and Language, Computer Science - Learning, Computer Science - Neural and Evolutionary Computing
Abstract: In statistical dialogue management, the dialogue manager learns a policy that maps a belief state to an action for the system to perform. Efficient exploration is key to successful policy optimisation. Current deep reinforcement learning methods are very promising but rely on epsilon-greedy exploration, thus subjecting the user to a random choice of action during learning. Alternative approaches such as Gaussian Process SARSA (GPSARSA) estimate uncertainties and are sample efficient, leading to better user experience, but on the expense of a greater computational complexity. This paper examines approaches to extract uncertainty estimates from deep Q-networks (DQN) in the context of dialogue management. We perform an extensive benchmark of deep Bayesian methods to extract uncertainty estimates, namely Bayes-By-Backprop, dropout, its concrete variation, bootstrapped ensemble and alpha-divergences, combining it with DQN algorithm., Comment: Accepted at the Bayesian Deep Learning Workshop, 31st Conference on Neural Information Processing Systems (NIPS 2017)
Published: 2017

26. A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Author: Casanueva, Iñigo, Budzianowski, Paweł, Su, Pei-Hao, Mrkšić, Nikola, Wen, Tsung-Hsien, Ultes, Stefan, Rojas-Barahona, Lina, Young, Steve, and Gašić, Milica
Subjects: Statistics - Machine Learning, Computer Science - Computation and Language, Computer Science - Neural and Evolutionary Computing
Abstract: Dialogue assistants are rapidly becoming an indispensable daily aid. To avoid the significant effort needed to hand-craft the required dialogue flow, the Dialogue Management (DM) module can be cast as a continuous Markov Decision Process (MDP) and trained through Reinforcement Learning (RL). Several RL models have been investigated over recent years. However, the lack of a common benchmarking framework makes it difficult to perform a fair comparison between different models and their capability to generalise to different environments. Therefore, this paper proposes a set of challenging simulated environments for dialogue model development and evaluation. To provide some baselines, we investigate a number of representative parametric algorithms, namely deep reinforcement learning algorithms - DQN, A2C and Natural Actor-Critic and compare them to a non-parametric model, GP-SARSA. Both the environments and policy models are implemented using the publicly available PyDial toolkit and released on-line, in order to establish a testbed framework for further experiments and to facilitate experimental reproducibility., Comment: Accepted at the Deep Reinforcement Learning Symposium, 31st Conference on Neural Information Processing Systems (NIPS 2017) Paper updated with minor changes
Published: 2017

27. Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

Author: Ultes, Stefan, Budzianowski, Paweł, Casanueva, Iñigo, Mrkšić, Nikola, Rojas-Barahona, Lina, Su, Pei-Hao, Wen, Tsung-Hsien, Gašić, Milica, and Young, Steve
Subjects: Computer Science - Computation and Language, Statistics - Machine Learning
Abstract: Reinforcement learning is widely used for dialogue policy optimization where the reward function often consists of more than one component, e.g., the dialogue success and the dialogue length. In this work, we propose a structured method for finding a good balance between these components by searching for the optimal reward component weighting. To render this search feasible, we use multi-objective reinforcement learning to significantly reduce the number of training dialogues required. We apply our proposed method to find optimized component weights for six domains and compare them to a default baseline., Comment: Accepted at SIGDial 2017
Published: 2017

28. Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management

Author: Su, Pei-Hao, Budzianowski, Pawel, Ultes, Stefan, Gasic, Milica, and Young, Steve
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Learning
Abstract: Deep reinforcement learning (RL) methods have significant potential for dialogue policy optimisation. However, they suffer from a poor performance in the early stages of learning. This is especially problematic for on-line learning with real users. Two approaches are introduced to tackle this problem. Firstly, to speed up the learning process, two sample-efficient neural networks algorithms: trust region actor-critic with experience replay (TRACER) and episodic natural actor-critic with experience replay (eNACER) are presented. For TRACER, the trust region helps to control the learning step size and avoid catastrophic model changes. For eNACER, the natural gradient identifies the steepest ascent direction in policy space to speed up the convergence. Both models employ off-policy learning with experience replay to improve sample-efficiency. Secondly, to mitigate the cold start issue, a corpus of demonstration data is utilised to pre-train the models prior to on-line reinforcement learning. Combining these two approaches, we demonstrate a practical approach to learn deep RL-based dialogue policies and demonstrate their effectiveness in a task-oriented information seeking domain., Comment: Accepted as a long paper in SigDial 2017
Published: 2017

29. Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

Author: Budzianowski, Paweł, Ultes, Stefan, Su, Pei-Hao, Mrkšić, Nikola, Wen, Tsung-Hsien, Casanueva, Iñigo, Rojas-Barahona, Lina, and Gašić, Milica
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Human conversation is inherently complex, often spanning many different topics/domains. This makes policy learning for dialogue systems very challenging. Standard flat reinforcement learning methods do not provide an efficient framework for modelling such dialogues. In this paper, we focus on the under-explored problem of multi-domain dialogue management. First, we propose a new method for hierarchical reinforcement learning using the option framework. Next, we show that the proposed architecture learns faster and arrives at a better policy than the existing flat ones do. Moreover, we show how pretrained policies can be adapted to more complex systems with an additional set of new actions. In doing that, we show that our approach has the potential to facilitate policy optimisation for more sophisticated multi-domain dialogue systems., Comment: Update of the section 4 and the bibliography
Published: 2017

30. Predictors of atrial fibrillation early recurrence following cryoballoon ablation of pulmonary veins using statistical assessment and machine learning algorithms

Author: Budzianowski, Jan, Hiczkiewicz, Jarosław, Burchardt, Paweł, Pieszko, Konrad, Rzeźniczak, Janusz, Budzianowski, Paweł, and Korybalska, Katarzyna
Published: 2019
Full Text: View/download PDF

31. Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data

Author: Sun, Guangzhi, primary, Zhang, Chao, additional, Vulić, Ivan, additional, Budzianowski, Paweł, additional, and Woodland, Phil, additional
Published: 2023
Full Text: View/download PDF

32. Machine-learned models using hematological inflammation markers in the prediction of short-term acute coronary syndrome outcomes

Author: Pieszko, Konrad, Hiczkiewicz, Jarosław, Budzianowski, Paweł, Rzeźniczak, Janusz, Budzianowski, Jan, Błaszczyński, Jerzy, Słowiński, Roman, and Burchardt, Paweł
Published: 2018
Full Text: View/download PDF

33. NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue

Author: Casanueva, Inigo, primary, Vulić, Ivan, additional, Spithourakis, Georgios, additional, and Budzianowski, Paweł, additional
Published: 2022
Full Text: View/download PDF

34. EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification

Author: Spithourakis, Georgios, primary, Vulić, Ivan, additional, Lis, Michał, additional, Casanueva, Inigo, additional, and Budzianowski, Paweł, additional
Published: 2022
Full Text: View/download PDF

35. Multi-Label Intent Detection via Contrastive Task Specialization of Sentence Encoders

Author: Vulić, Ivan, primary, Casanueva, Iñigo, additional, Spithourakis, Georgios, additional, Mondal, Avishek, additional, Wen, Tsung-Hsien, additional, and Budzianowski, Paweł, additional
Published: 2022
Full Text: View/download PDF

36. ConvFiT: Conversational Fine-Tuning of Pretrained Language Models

Author: Vulić, Ivan, primary, Su, Pei-Hao, additional, Coope, Samuel, additional, Gerz, Daniela, additional, Budzianowski, Paweł, additional, Casanueva, Iñigo, additional, Mrkšić, Nikola, additional, and Wen, Tsung-Hsien, additional
Published: 2021
Full Text: View/download PDF

37. Predicting Long-Term Mortality after Acute Coronary Syndrome Using Machine Learning Techniques and Hematological Markers

Author: Pieszko, Konrad, primary, Hiczkiewicz, Jarosław, additional, Budzianowski, Paweł, additional, Budzianowski, Jan, additional, Rzeźniczak, Janusz, additional, Pieszko, Karolina, additional, and Burchardt, Paweł, additional
Published: 2019
Full Text: View/download PDF

38. Tree-Structured Semantic Encoder with Knowledge Sharing for Domain Adaptation in Natural Language Generation

Author: Tseng, Bo-Hsiang, primary, Budzianowski, Paweł, additional, Wu, Yen-chen, additional, and Gasic, Milica, additional
Published: 2019
Full Text: View/download PDF

39. Semi-Supervised Bootstrapping of Dialogue State Trackers for Task-Oriented Modelling

Author: Tseng, Bo-Hsiang, primary, Rei, Marek, additional, Budzianowski, Paweł, additional, Turner, Richard, additional, Byrne, Bill, additional, and Korhonen, Anna, additional
Published: 2019
Full Text: View/download PDF

40. A Repository of Conversational Datasets

Author: Henderson, Matthew, primary, Budzianowski, Paweł, additional, Casanueva, Iñigo, additional, Coope, Sam, additional, Gerz, Daniela, additional, Kumar, Girish, additional, Mrkšić, Nikola, additional, Spithourakis, Georgios, additional, Su, Pei-Hao, additional, Vulić, Ivan, additional, and Wen, Tsung-Hsien, additional
Published: 2019
Full Text: View/download PDF

41. PolyResponse: A Rank-based Approach to Task-Oriented Dialogue with Application in Restaurant Search and Booking

Author: Henderson, Matthew, primary, Vulić, Ivan, additional, Casanueva, Iñigo, additional, Budzianowski, Paweł, additional, Gerz, Daniela, additional, Coope, Sam, additional, Spithourakis, Georgios, additional, Wen, Tsung-Hsien, additional, Mrkšić, Nikola, additional, and Su, Pei-Hao, additional
Published: 2019
Full Text: View/download PDF

42. Hello, It’s GPT-2 - How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems

Author: Budzianowski, Paweł, primary and Vulić, Ivan, additional
Published: 2019
Full Text: View/download PDF

43. Training Neural Response Selection for Task-Oriented Dialogue Systems

Author: Henderson, Matthew, primary, Vulić, Ivan, additional, Gerz, Daniela, additional, Casanueva, Iñigo, additional, Budzianowski, Paweł, additional, Coope, Sam, additional, Spithourakis, Georgios, additional, Wen, Tsung-Hsien, additional, Mrkšić, Nikola, additional, and Su, Pei-Hao, additional
Published: 2019
Full Text: View/download PDF

44. Predictors of atrial fibrillation early recurrence following cryoballoon ablation of pulmonary veins using statistical assessment and machine learning algorithms

Author: Budzianowski, Jan, primary, Hiczkiewicz, Jarosław, additional, Burchardt, Paweł, additional, Pieszko, Konrad, additional, Rzeźniczak, Janusz, additional, Budzianowski, Paweł, additional, and Korybalska, Katarzyna, additional
Published: 2018
Full Text: View/download PDF

45. MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Author: Budzianowski, Paweł, primary, Wen, Tsung-Hsien, additional, Tseng, Bo-Hsiang, additional, Casanueva, Iñigo, additional, Ultes, Stefan, additional, Ramadan, Osman, additional, and Gašić, Milica, additional
Published: 2018
Full Text: View/download PDF

46. Feudal Dialogue Management with Jointly Learned Feature Extractors

Author: Casanueva, Iñigo, primary, Budzianowski, Paweł, additional, Ultes, Stefan, additional, Kreyssig, Florian, additional, Tseng, Bo-Hsiang, additional, Wu, Yen-chen, additional, and Gašić, Milica, additional
Published: 2018
Full Text: View/download PDF

47. Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

Author: Ramadan, Osman, primary, Budzianowski, Paweł, additional, and Gašić, Milica, additional
Published: 2018
Full Text: View/download PDF

48. Addressing Objects and Their Relations: The Conversational Entity Dialogue Model

Author: Ultes, Stefan, primary, Budzianowski, Paweł, additional, Casanueva, Iñigo, additional, Rojas-Barahona, Lina M., additional, Tseng, Bo-Hsiang, additional, Wu, Yen-Chen, additional, Young, Steve, additional, and Gašić, Milica, additional
Published: 2018
Full Text: View/download PDF

49. Feudal Reinforcement Learning for Dialogue Management in Large Domains

Author: Casanueva, Iñigo, primary, Budzianowski, Paweł, additional, Su, Pei-Hao, additional, Ultes, Stefan, additional, Rojas Barahona, Lina M., additional, Tseng, Bo-Hsiang, additional, and Gasic, Milica, additional
Published: 2018
Full Text: View/download PDF

50. Neural User Simulation for Corpus-based Policy Optimisation of Spoken Dialogue Systems

Author: Kreyssig, Florian, primary, Casanueva, Iñigo, additional, Budzianowski, Paweł, additional, and Gašić, Milica, additional
Published: 2018
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

93 results on '"Budzianowski, Paweł"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources