Skip to main content

Showing 1–27 of 27 results for author: Budzianowski, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.02839  [pdf, ps, other

    eess.AS cs.AI cs.CL

    Pheme: Efficient and Conversational Speech Generation

    Authors: Paweł Budzianowski, Taras Sereda, Tomasz Cichy, Ivan Vulić

    Abstract: In recent years, speech generation has seen remarkable progress, now achieving one-shot generation capability that is often virtually indistinguishable from real human voice. Integrating such advancements in speech generation with large language models might revolutionize a wide range of applications. However, certain applications, such as assistive conversational systems, require natural and conv… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  2. arXiv:2311.09800  [pdf, other

    cs.CL

    $\textit{Dial BeInfo for Faithfulness}$: Improving Factuality of Information-Seeking Dialogue via Behavioural Fine-Tuning

    Authors: Evgeniia Razumovskaia, Ivan Vulić, Pavle Marković, Tomasz Cichy, Qian Zheng, Tsung-Hsien Wen, Paweł Budzianowski

    Abstract: Factuality is a crucial requirement in information seeking dialogue: the system should respond to the user's queries so that the responses are meaningful and aligned with the knowledge provided to the system. However, most modern large language models suffer from hallucinations, that is, they generate responses not supported by or contradicting the knowledge source. To mitigate the issue and incre… ▽ More

    Submitted 4 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  3. arXiv:2307.01764  [pdf, other

    cs.CL

    Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data

    Authors: Guangzhi Sun, Chao Zhang, Ivan Vulić, Paweł Budzianowski, Philip C. Woodland

    Abstract: Manually annotating fine-grained slot-value labels for task-oriented dialogue (ToD) systems is an expensive and time-consuming endeavour. This motivates research into slot-filling methods that operate with limited amounts of labelled data. Moreover, the majority of current work on ToD is based solely on text as the input modality, neglecting the additional challenges of imperfect automatic speech… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: to submit to CS&L

  4. arXiv:2204.13496  [pdf, other

    cs.CL cs.LG

    EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification

    Authors: Georgios P. Spithourakis, Ivan Vulić, Michał Lis, Iñigo Casanueva, Paweł Budzianowski

    Abstract: Knowledge-based authentication is crucial for task-oriented spoken dialogue systems that offer personalised and privacy-focused services. Such systems should be able to enrol (E), verify (V), and identify (I) new and recurring users based on their personal information, e.g. postcode, name, and date of birth. In this work, we formalise the three authentication tasks and their evaluation protocols,… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: 13 pages, 7 figures, 7 tables. Accepted in NAACL 2022 (Findings)

  5. arXiv:2204.13021  [pdf, other

    cs.CL cs.LG

    NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue

    Authors: Iñigo Casanueva, Ivan Vulić, Georgios P. Spithourakis, Paweł Budzianowski

    Abstract: We present NLU++, a novel dataset for natural language understanding (NLU) in task-oriented dialogue (ToD) systems, with the aim to provide a much more challenging evaluation environment for dialogue NLU models, up to date with the current application and industry requirements. NLU++ is divided into two domains (BANKING and HOTELS) and brings several crucial improvements over current commonly used… ▽ More

    Submitted 5 May, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: 16 pages, 1 figure, 10 tables. Accepted in NAACL 2022 (Findings)

  6. arXiv:2204.02123  [pdf, other

    cs.CL

    Improved and Efficient Conversational Slot Labeling through Question Answering

    Authors: Gabor Fuisz, Ivan Vulić, Samuel Gibbons, Inigo Casanueva, Paweł Budzianowski

    Abstract: Transformer-based pretrained language models (PLMs) offer unmatched performance across the majority of natural language understanding (NLU) tasks, including a body of question answering (QA) tasks. We hypothesize that improvements in QA methodology can also be directly exploited in dialog NLU; however, dialog tasks must be \textit{reformatted} into QA tasks. In particular, we focus on modeling and… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  7. arXiv:2109.10126  [pdf, other

    cs.CL

    ConvFiT: Conversational Fine-Tuning of Pretrained Language Models

    Authors: Ivan Vulić, Pei-Hao Su, Sam Coope, Daniela Gerz, Paweł Budzianowski, Iñigo Casanueva, Nikola Mrkšić, Tsung-Hsien Wen

    Abstract: Transformer-based language models (LMs) pretrained on large text collections are proven to store a wealth of semantic knowledge. However, 1) they are not effective as sentence encoders when used off-the-shelf, and 2) thus typically lag behind conversationally pretrained (e.g., via response selection) encoders on conversational tasks such as intent detection (ID). In this work, we propose ConvFiT,… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 (long paper)

  8. arXiv:1911.11672  [pdf, other

    cs.CL

    Semi-supervised Bootstrap** of Dialogue State Trackers for Task Oriented Modelling

    Authors: Bo-Hsiang Tseng, Marek Rei, Paweł Budzianowski, Richard E. Turner, Bill Byrne, Anna Korhonen

    Abstract: Dialogue systems benefit greatly from optimizing on detailed annotations, such as transcribed utterances, internal dialogue state representations and dialogue act labels. However, collecting these annotations is expensive and time-consuming, holding back development in the area of dialogue modelling. In this paper, we investigate semi-supervised learning methods that are able to reduce the amount… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: This article is published at EMNLP-IJCNLP 2019

  9. arXiv:1910.06719  [pdf, other

    cs.CL cs.LG

    Tree-Structured Semantic Encoder with Knowledge Sharing for Domain Adaptation in Natural Language Generation

    Authors: Bo-Hsiang Tseng, Paweł Budzianowski, Yen-Chen Wu, Milica Gašić

    Abstract: Domain adaptation in natural language generation (NLG) remains challenging because of the high complexity of input semantics across domains and limited data of a target domain. This is particularly the case for dialogue systems, where we want to be able to seamlessly include new domains into the conversation. Therefore, it is crucial for generation models to share knowledge across domains for the… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: Published in SIGDIAL2019

  10. arXiv:1909.07101  [pdf, other

    cs.CL

    Domain Transfer in Dialogue Systems without Turn-Level Supervision

    Authors: Joachim Bingel, Victor Petrén Bach Hansen, Ana Valeria Gonzalez, Paweł Budzianowski, Isabelle Augenstein, Anders Søgaard

    Abstract: Task oriented dialogue systems rely heavily on specialized dialogue state tracking (DST) modules for dynamically predicting user intent throughout the conversation. State-of-the-art DST models are typically trained in a supervised manner from manual annotations at the turn level. However, these annotations are costly to obtain, which makes it difficult to create accurate dialogue systems for new d… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

  11. arXiv:1909.01296  [pdf, other

    cs.CL

    PolyResponse: A Rank-based Approach to Task-Oriented Dialogue with Application in Restaurant Search and Booking

    Authors: Matthew Henderson, Ivan Vulić, Iñigo Casanueva, Paweł Budzianowski, Daniela Gerz, Sam Coope, Georgios Spithourakis, Tsung-Hsien Wen, Nikola Mrkšić, Pei-Hao Su

    Abstract: We present PolyResponse, a conversational search engine that supports task-oriented dialogue. It is a retrieval-based approach that bypasses the complex multi-component design of traditional task-oriented dialogue systems and the use of explicit semantics in the form of task-specific ontologies. The PolyResponse engine is trained on hundreds of millions of examples extracted from real conversation… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: EMNLP 2019 (Demo paper)

  12. arXiv:1907.05774  [pdf, other

    cs.CL

    Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems

    Authors: Paweł Budzianowski, Ivan Vulić

    Abstract: Data scarcity is a long-standing and crucial challenge that hinders quick development of task-oriented dialogue systems across multiple domains: task-oriented dialogue models are expected to learn grammar, syntax, dialogue reasoning, decision making, and language generation from absurdly small amounts of task-specific data. In this paper, we demonstrate that recent progress in language modeling pr… ▽ More

    Submitted 4 August, 2019; v1 submitted 12 July, 2019; originally announced July 2019.

  13. arXiv:1906.01543  [pdf, other

    cs.CL

    Training Neural Response Selection for Task-Oriented Dialogue Systems

    Authors: Matthew Henderson, Ivan Vulić, Daniela Gerz, Iñigo Casanueva, Paweł Budzianowski, Sam Coope, Georgios Spithourakis, Tsung-Hsien Wen, Nikola Mrkšić, Pei-Hao Su

    Abstract: Despite their popularity in the chatbot literature, retrieval-based models have had modest impact on task-oriented dialogue systems, with the main obstacle to their application being the low-data regime of most task-oriented dialogue tasks. Inspired by the recent success of pretraining in language modelling, we propose an effective method for deploying response selection in task-oriented dialogue.… ▽ More

    Submitted 7 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: ACL 2019 long paper

  14. arXiv:1904.06472  [pdf, other

    cs.CL

    A Repository of Conversational Datasets

    Authors: Matthew Henderson, Paweł Budzianowski, Iñigo Casanueva, Sam Coope, Daniela Gerz, Girish Kumar, Nikola Mrkšić, Georgios Spithourakis, Pei-Hao Su, Ivan Vulić, Tsung-Hsien Wen

    Abstract: Progress in Machine Learning is often driven by the availability of large datasets, and consistent evaluation metrics for comparing modeling approaches. To this end, we present a repository of conversational datasets consisting of hundreds of millions of examples, and a standardised evaluation procedure for conversational response selection models using '1-of-100 accuracy'. The repository contains… ▽ More

    Submitted 28 May, 2019; v1 submitted 12 April, 2019; originally announced April 2019.

    Journal ref: Proceedings of the Workshop on NLP for Conversational AI (2019)

  15. arXiv:1901.01466  [pdf, other

    cs.CL

    Addressing Objects and Their Relations: The Conversational Entity Dialogue Model

    Authors: Stefan Ultes, Paweł\ Budzianowski, Iñigo Casanueva, Lina Rojas-Barahona, Bo-Hsiang Tseng, Yen-Chen Wu, Steve Young, Milica Gašić

    Abstract: Statistical spoken dialogue systems usually rely on a single- or multi-domain dialogue model that is restricted in its capabilities of modelling complex dialogue structures, e.g., relations. In this work, we propose a novel dialogue model that is centred around entities and is able to model relations as well as multiple entities of the same type. We demonstrate in a prototype implementation benefi… ▽ More

    Submitted 5 January, 2019; originally announced January 2019.

    Comments: Accepted at SIGDial 2018

  16. arXiv:1812.08879  [pdf, other

    cs.CL cs.AI

    Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

    Authors: Bo-Hsiang Tseng, Florian Kreyssig, Pawel Budzianowski, Inigo Casanueva, Yen-Chen Wu, Stefan Ultes, Milica Gasic

    Abstract: Cross-domain natural language generation (NLG) is still a difficult task within spoken dialogue modelling. Given a semantic representation provided by the dialogue manager, the language generator should generate sentences that convey desired information. Traditional template-based generators can produce sentences with all necessary information, but these sentences are not sufficiently diverse. Wit… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: Sigdial 2018

  17. arXiv:1810.00278  [pdf, other

    cs.CL

    MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

    Authors: Paweł Budzianowski, Tsung-Hsien Wen, Bo-Hsiang Tseng, Iñigo Casanueva, Stefan Ultes, Osman Ramadan, Milica Gašić

    Abstract: Even though machine learning has become the major scene in dialogue research community, the real breakthrough has been blocked by the scale of data available. To address this fundamental obstacle, we introduce the Multi-Domain Wizard-of-Oz dataset (MultiWOZ), a fully-labeled collection of human-human written conversations spanning over multiple domains and topics. At a size of $10$k dialogues, it… ▽ More

    Submitted 20 April, 2020; v1 submitted 29 September, 2018; originally announced October 2018.

    Comments: Accepted for publication at EMNLP 2018

  18. arXiv:1807.06517  [pdf, other

    cs.CL

    Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

    Authors: Osman Ramadan, Paweł Budzianowski, Milica Gašić

    Abstract: Robust dialogue belief tracking is a key component in maintaining good quality dialogue systems. The tasks that dialogue systems are trying to solve are becoming increasingly complex, requiring scalability to multi domain, semantically rich dialogues. However, most current approaches have difficulty scaling up with domains because of the dependency of the model parameters on the dialogue ontology.… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: 10 pages, 1 figure and 2 tables. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL)

  19. arXiv:1806.05484  [pdf, other

    cs.CL cs.AI

    Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

    Authors: Lina M. Rojas-Barahona, Stefan Ultes, Pawel Budzianowski, Iñigo Casanueva, Milica Gasic, Bo-Hsiang Tseng, Steve Young

    Abstract: This paper presents two ways of dealing with scarce data in semantic decoding using N-Best speech recognition hypotheses. First, we learn features by using a deep learning architecture in which the weights for the unknown and known categories are jointly optimised. Second, an unsupervised method is used for further tuning the weights. Sharing weights injects prior knowledge to unknown categories.… ▽ More

    Submitted 21 June, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

  20. arXiv:1805.06966  [pdf, other

    cs.CL cs.AI stat.ML

    Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems

    Authors: Florian Kreyssig, Inigo Casanueva, Pawel Budzianowski, Milica Gasic

    Abstract: User Simulators are one of the major tools that enable offline training of task-oriented dialogue systems. For this task the Agenda-Based User Simulator (ABUS) is often used. The ABUS is based on hand-crafted rules and its output is in semantic form. Issues arise from both properties such as limited diversity and the inability to interface a text-level belief tracker. This paper introduces the Neu… ▽ More

    Submitted 17 May, 2018; originally announced May 2018.

    Comments: Accepted to SIGDIAL 2018

  21. arXiv:1803.03232  [pdf, other

    cs.CL cs.AI cs.NE

    Feudal Reinforcement Learning for Dialogue Management in Large Domains

    Authors: Iñigo Casanueva, Paweł Budzianowski, Pei-Hao Su, Stefan Ultes, Lina Rojas-Barahona, Bo-Hsiang Tseng, Milica Gašić

    Abstract: Reinforcement learning (RL) is a promising approach to solve dialogue policy optimisation. Traditional RL algorithms, however, fail to scale to large domains due to the curse of dimensionality. We propose a novel Dialogue Management architecture, based on Feudal RL, which decomposes the decision into two steps; a first step where a master policy selects a subset of primitive actions, and a second… ▽ More

    Submitted 8 March, 2018; originally announced March 2018.

    Comments: Accepted as a short paper in NAACL 2018

  22. arXiv:1802.03753  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces

    Authors: Gellért Weisz, Paweł Budzianowski, Pei-Hao Su, Milica Gašić

    Abstract: In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue agents that can converse with humans. A part of this effort is the policy optimisation task, which attempts to find a policy describing how to respond to humans, in the form of a function taking the current state of the dialogue and returning the response of the system. In this paper, we investigate de… ▽ More

    Submitted 11 February, 2018; originally announced February 2018.

  23. arXiv:1711.11486  [pdf, other

    stat.ML cs.CL cs.LG cs.NE

    Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation

    Authors: Christopher Tegho, Paweł Budzianowski, Milica Gašić

    Abstract: In statistical dialogue management, the dialogue manager learns a policy that maps a belief state to an action for the system to perform. Efficient exploration is key to successful policy optimisation. Current deep reinforcement learning methods are very promising but rely on epsilon-greedy exploration, thus subjecting the user to a random choice of action during learning. Alternative approaches s… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.

    Comments: Accepted at the Bayesian Deep Learning Workshop, 31st Conference on Neural Information Processing Systems (NIPS 2017)

  24. arXiv:1711.11023  [pdf, other

    stat.ML cs.CL cs.NE

    A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

    Authors: Iñigo Casanueva, Paweł Budzianowski, Pei-Hao Su, Nikola Mrkšić, Tsung-Hsien Wen, Stefan Ultes, Lina Rojas-Barahona, Steve Young, Milica Gašić

    Abstract: Dialogue assistants are rapidly becoming an indispensable daily aid. To avoid the significant effort needed to hand-craft the required dialogue flow, the Dialogue Management (DM) module can be cast as a continuous Markov Decision Process (MDP) and trained through Reinforcement Learning (RL). Several RL models have been investigated over recent years. However, the lack of a common benchmarking fram… ▽ More

    Submitted 6 April, 2018; v1 submitted 29 November, 2017; originally announced November 2017.

    Comments: Accepted at the Deep Reinforcement Learning Symposium, 31st Conference on Neural Information Processing Systems (NIPS 2017) Paper updated with minor changes

  25. arXiv:1707.06299  [pdf, other

    cs.CL stat.ML

    Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

    Authors: Stefan Ultes, Paweł Budzianowski, Iñigo Casanueva, Nikola Mrkšić, Lina Rojas-Barahona, Pei-Hao Su, Tsung-Hsien Wen, Milica Gašić, Steve Young

    Abstract: Reinforcement learning is widely used for dialogue policy optimization where the reward function often consists of more than one component, e.g., the dialogue success and the dialogue length. In this work, we propose a structured method for finding a good balance between these components by searching for the optimal reward component weighting. To render this search feasible, we use multi-objective… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

    Comments: Accepted at SIGDial 2017

  26. arXiv:1707.00130  [pdf, other

    cs.CL cs.AI cs.LG

    Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management

    Authors: Pei-Hao Su, Pawel Budzianowski, Stefan Ultes, Milica Gasic, Steve Young

    Abstract: Deep reinforcement learning (RL) methods have significant potential for dialogue policy optimisation. However, they suffer from a poor performance in the early stages of learning. This is especially problematic for on-line learning with real users. Two approaches are introduced to tackle this problem. Firstly, to speed up the learning process, two sample-efficient neural networks algorithms: trust… ▽ More

    Submitted 5 July, 2017; v1 submitted 1 July, 2017; originally announced July 2017.

    Comments: Accepted as a long paper in SigDial 2017

  27. arXiv:1706.06210  [pdf, other

    cs.CL cs.AI

    Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

    Authors: Paweł Budzianowski, Stefan Ultes, Pei-Hao Su, Nikola Mrkšić, Tsung-Hsien Wen, Iñigo Casanueva, Lina Rojas-Barahona, Milica Gašić

    Abstract: Human conversation is inherently complex, often spanning many different topics/domains. This makes policy learning for dialogue systems very challenging. Standard flat reinforcement learning methods do not provide an efficient framework for modelling such dialogues. In this paper, we focus on the under-explored problem of multi-domain dialogue management. First, we propose a new method for hierarc… ▽ More

    Submitted 17 July, 2017; v1 submitted 19 June, 2017; originally announced June 2017.

    Comments: Update of the section 4 and the bibliography