Skip to main content

Showing 1–50 of 52 results for author: Gašić, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.07418  [pdf, other

    cs.CL cs.SD eess.AS

    Speech-based Slot Filling using Large Language Models

    Authors: Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gašić, Philip C. Woodland

    Abstract: Recently, advancements in large language models (LLMs) have shown an unprecedented ability across various language tasks. This paper investigates the potential application of LLMs to slot filling with noisy ASR transcriptions, via both in-context learning and task-specific fine-tuning. Dedicated prompt designs and fine-tuning approaches are proposed to improve the robustness of LLMs for slot filli… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  2. arXiv:2310.08944  [pdf, other

    cs.CL cs.LG

    CAMELL: Confidence-based Acquisition Model for Efficient Self-supervised Active Learning with Label Validation

    Authors: Carel van Niekerk, Christian Geishauser, Michael Heck, Shutong Feng, Hsien-chin Lin, Nurul Lubis, Benjamin Ruppik, Renato Vukovic, Milica Gašić

    Abstract: Supervised neural approaches are hindered by their dependence on large, meticulously annotated datasets, a requirement that is particularly cumbersome for sequential tasks. The quality of annotations tends to deteriorate with the transition from expert-based to crowd-sourced labelling. To address these challenges, we present \textbf{CAMELL} (Confidence-based Acquisition Model for Efficient self-su… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  3. arXiv:2309.12881  [pdf, other

    cs.CL

    Affect Recognition in Conversations Using Large Language Models

    Authors: Shutong Feng, Guangzhi Sun, Nurul Lubis, Chao Zhang, Milica Gašić

    Abstract: Affect recognition, encompassing emotions, moods, and feelings, plays a pivotal role in human communication. In the realm of conversational artificial intelligence (AI), the ability to discern and respond to human affective cues is a critical factor for creating engaging and empathetic interactions. This study delves into the capacity of large language models (LLMs) to recognise human affect in co… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  4. arXiv:2308.12648  [pdf, other

    cs.CL

    From Chatter to Matter: Addressing Critical Steps of Emotion Recognition Learning in Task-oriented Dialogue

    Authors: Shutong Feng, Nurul Lubis, Benjamin Ruppik, Christian Geishauser, Michael Heck, Hsien-chin Lin, Carel van Niekerk, Renato Vukovic, Milica Gašić

    Abstract: Emotion recognition in conversations (ERC) is a crucial task for building human-like conversational agents. While substantial efforts have been devoted to ERC for chit-chat dialogues, the task-oriented counterpart is largely left unattended. Directly applying chit-chat ERC models to task-oriented dialogues (ToDs) results in suboptimal performance as these models overlook key features such as the c… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted by SIGDIAL 2023

  5. EmoUS: Simulating User Emotions in Task-Oriented Dialogues

    Authors: Hsien-Chin Lin, Shutong Feng, Christian Geishauser, Nurul Lubis, Carel van Niekerk, Michael Heck, Benjamin Ruppik, Renato Vukovic, Milica Gašić

    Abstract: Existing user simulators (USs) for task-oriented dialogue systems only model user behaviour on semantic and natural language levels without considering the user persona and emotions. Optimising dialogue systems with generic user policies, which cannot model diverse user behaviour driven by different emotional states, may result in a high drop-off rate when deployed in the real world. Thus, we pres… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: accepted by SIGIR2023

  6. arXiv:2306.01386  [pdf, other

    cs.CL cs.AI

    ChatGPT for Zero-shot Dialogue State Tracking: A Solution or an Opportunity?

    Authors: Michael Heck, Nurul Lubis, Benjamin Ruppik, Renato Vukovic, Shutong Feng, Christian Geishauser, Hsien-Chin Lin, Carel van Niekerk, Milica Gašić

    Abstract: Recent research on dialogue state tracking (DST) focuses on methods that allow few- and zero-shot transfer to new domains or schemas. However, performance gains heavily depend on aggressive data augmentation and fine-tuning of ever larger language model based architectures. In contrast, general purpose language models, trained on large amounts of diverse data, hold the promise of solving any kind… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 13 pages, 3 figures, accepted at ACL 2023

  7. arXiv:2211.17148  [pdf, other

    cs.CL cs.AI

    ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format

    Authors: Qi Zhu, Christian Geishauser, Hsien-chin Lin, Carel van Niekerk, Baolin Peng, Zheng Zhang, Michael Heck, Nurul Lubis, Dazhen Wan, Xiaochen Zhu, Jianfeng Gao, Milica Gašić, Minlie Huang

    Abstract: Task-oriented dialogue (TOD) systems function as digital assistants, guiding users through various tasks such as booking flights or finding restaurants. Existing toolkits for building TOD systems often fall short of in delivering comprehensive arrays of data, models, and experimental environments with a user-friendly experience. We introduce ConvLab-3: a multifaceted dialogue system toolkit crafte… ▽ More

    Submitted 17 October, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

  8. arXiv:2209.00876  [pdf, other

    cs.CL

    Dialogue Evaluation with Offline Reinforcement Learning

    Authors: Nurul Lubis, Christian Geishauser, Hsien-Chin Lin, Carel van Niekerk, Michael Heck, Shutong Feng, Milica Gašić

    Abstract: Task-oriented dialogue systems aim to fulfill user goals through natural language interactions. They are ideally evaluated with human users, which however is unattainable to do at every iteration of the development phase. Simulated users could be an alternative, however their development is nontrivial. Therefore, researchers resort to offline metrics on existing human-human corpora, which are more… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted as long paper at SIGDIAL 2022

  9. arXiv:2208.10817  [pdf, other

    cs.CL

    GenTUS: Simulating User Behaviour and Language in Task-oriented Dialogues with Generative Transformers

    Authors: Hsien-Chin Lin, Christian Geishauser, Shutong Feng, Nurul Lubis, Carel van Niekerk, Michael Heck, Milica Gašić

    Abstract: User simulators (USs) are commonly used to train task-oriented dialogue systems (DSs) via reinforcement learning. The interactions often take place on semantic level for efficiency, but there is still a gap from semantic actions to natural language, which causes a mismatch between training and deployment environment. Incorporating a natural language generation (NLG) module with USs during training… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted as a long paper to SIGDial 2022

  10. Dialogue Term Extraction using Transfer Learning and Topological Data Analysis

    Authors: Renato Vukovic, Michael Heck, Benjamin Matthias Ruppik, Carel van Niekerk, Marcus Zibrowius, Milica Gašić

    Abstract: Goal oriented dialogue systems were originally designed as a natural language interface to a fixed data-set of entities that users might inquire about, further described by domain, slots, and values. As we move towards adaptable dialogue systems where knowledge about domains, slots, and values may change, there is an increasing need to automatically extract these terms from raw dialogues or relate… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted as a long paper to SIGDIAL 2022 (Edinburgh)

  11. arXiv:2204.05928  [pdf, other

    cs.CL cs.LG

    Dynamic Dialogue Policy for Continual Reinforcement Learning

    Authors: Christian Geishauser, Carel van Niekerk, Nurul Lubis, Michael Heck, Hsien-Chin Lin, Shutong Feng, Milica Gašić

    Abstract: Continual learning is one of the key components of human learning and a necessary requirement of artificial intelligence. As dialogue can potentially span infinitely many topics and tasks, a task-oriented dialogue system must have the capability to continually learn, dynamically adapting to new challenges while preserving the knowledge it already acquired. Despite the importance, continual reinfor… ▽ More

    Submitted 10 October, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

  12. arXiv:2203.10012  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges

    Authors: Shikib Mehri, **ho Choi, Luis Fernando D'Haro, Jan Deriu, Maxine Eskenazi, Milica Gasic, Kallirroi Georgila, Dilek Hakkani-Tur, Zekang Li, Verena Rieser, Samira Shaikh, David Traum, Yi-Ting Yeh, Zhou Yu, Yizhe Zhang, Chen Zhang

    Abstract: This is a report on the NSF Future Directions Workshop on Automatic Evaluation of Dialog. The workshop explored the current state of the art along with its limitations and suggested promising directions for future work in this important and very rapidly changing area of research.

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Report from the NSF AED Workshop (http://dialrc.org/AED/)

  13. arXiv:2202.03354  [pdf, other

    cs.CL

    Robust Dialogue State Tracking with Weak Supervision and Sparse Data

    Authors: Michael Heck, Nurul Lubis, Carel van Niekerk, Shutong Feng, Christian Geishauser, Hsien-Chin Lin, Milica Gašić

    Abstract: Generalising dialogue state tracking (DST) to new data is especially challenging due to the strong reliance on abundant and fine-grained supervision during training. Sample sparsity, distributional shift and the occurrence of new concepts and topics frequently lead to severe performance degradation during inference. In this paper we propose a training strategy to build extractive DST models withou… ▽ More

    Submitted 9 August, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 12 pages, 6 figures, pre-MIT Press publication version (author's final version), accepted for publication in TACL

  14. arXiv:2109.07129  [pdf, other

    cs.LG cs.CL

    What Does The User Want? Information Gain for Hierarchical Dialogue Policy Optimisation

    Authors: Christian Geishauser, Songbo Hu, Hsien-chin Lin, Nurul Lubis, Michael Heck, Shutong Feng, Carel van Niekerk, Milica Gašić

    Abstract: The dialogue management component of a task-oriented dialogue system is typically optimised via reinforcement learning (RL). Optimisation via RL is highly susceptible to sample inefficiency and instability. The hierarchical approach called Feudal Dialogue Management takes a step towards more efficient learning by decomposing the action space. However, it still suffers from instability due to the r… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  15. arXiv:2109.04919  [pdf, other

    cs.CL

    EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion Recognition in Task-Oriented Dialogue Systems

    Authors: Shutong Feng, Nurul Lubis, Christian Geishauser, Hsien-chin Lin, Michael Heck, Carel van Niekerk, Milica Gašić

    Abstract: The ability to recognise emotions lends a conversational artificial intelligence a human touch. While emotions in chit-chat dialogues have received substantial attention, emotions in task-oriented dialogues remain largely unaddressed. This is despite emotions and dialogue success having equally important roles in a natural system. Existing emotion-annotated task-oriented corpora are limited in siz… ▽ More

    Submitted 2 May, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: Accepted for publication at LREC 2022

  16. arXiv:2109.04349  [pdf, other

    cs.CL

    Uncertainty Measures in Neural Belief Tracking and the Effects on Dialogue Policy Performance

    Authors: Carel van Niekerk, Andrey Malinin, Christian Geishauser, Michael Heck, Hsien-chin Lin, Nurul Lubis, Shutong Feng, Milica Gašić

    Abstract: The ability to identify and resolve uncertainty is crucial for the robustness of a dialogue system. Indeed, this has been confirmed empirically on systems that utilise Bayesian approaches to dialogue belief tracking. However, such systems consider only confidence estimates and have difficulty scaling to more complex settings. Neural dialogue systems, on the other hand, rarely take uncertainties in… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: 14 pages, 2 figures, accepted at EMNLP 2021 Main conference, Code at: https://gitlab.cs.uni-duesseldorf.de/general/dsml/setsumbt-public

  17. arXiv:2106.08838  [pdf, other

    cs.CL

    Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems

    Authors: Hsien-chin Lin, Nurul Lubis, Songbo Hu, Carel van Niekerk, Christian Geishauser, Michael Heck, Shutong Feng, Milica Gašić

    Abstract: Dialogue policy optimisation via reinforcement learning requires a large number of training interactions, which makes learning with real users time consuming and expensive. Many set-ups therefore rely on a user simulator instead of humans. These user simulators have their own problems. While hand-coded, rule-based user simulators have been shown to be sufficient in small, simple domains, for compl… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  18. arXiv:2011.09413  [pdf, other

    cs.CL

    Topology of Word Embeddings: Singularities Reflect Polysemy

    Authors: Alexander Jakubowski, Milica Gašić, Marcus Zibrowius

    Abstract: The manifold hypothesis suggests that word vectors live on a submanifold within their ambient vector space. We argue that we should, more accurately, expect them to live on a pinched manifold: a singular quotient of a manifold obtained by identifying some of its points. The identified, singular points correspond to polysemous words, i.e. words with multiple meanings. Our point of view suggests tha… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: Accepted at the 9th Joint Conference on Lexical and Computational Semantics (*SEM 2020)

  19. arXiv:2011.09379  [pdf, other

    cs.CL

    Out-of-Task Training for Dialog State Tracking Models

    Authors: Michael Heck, Carel van Niekerk, Nurul Lubis, Christian Geishauser, Hsien-Chin Lin, Marco Moresi, Milica Gašić

    Abstract: Dialog state tracking (DST) suffers from severe data sparsity. While many natural language processing (NLP) tasks benefit from transfer learning and multi-task learning, in dialog these methods are limited by the amount of available data and by the specificity of dialog applications. In this work, we successfully utilize non-dialog data from unrelated NLP tasks to train dialog state trackers. This… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: 8 pages, 2 figures, to be published in Proceedings of the 28th International Conference on Computational Linguistics, Code at https://gitlab.cs.uni-duesseldorf.de/general/dsml/trippy-public

  20. arXiv:2011.09378  [pdf, other

    cs.CL

    LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization

    Authors: Nurul Lubis, Christian Geishauser, Michael Heck, Hsien-chin Lin, Marco Moresi, Carel van Niekerk, Milica Gašić

    Abstract: Reinforcement learning (RL) can enable task-oriented dialogue systems to steer the conversation towards successful task completion. In an end-to-end setting, a response can be constructed in a word-level sequential decision making process with the entire system vocabulary as action space. Policies trained in such a fashion do not require expert-defined action spaces, but they have to deal with lar… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: 15 pages. To be published as long paper in Proceedings of The 28th International Conference on Computational Linguistics (COLING 2020). Code can be accessed at https://gitlab.cs.uni-duesseldorf.de/general/dsml/lava-public

  21. arXiv:2010.02586  [pdf, other

    cs.CL cs.AI

    Knowing What You Know: Calibrating Dialogue Belief State Distributions via Ensembles

    Authors: Carel van Niekerk, Michael Heck, Christian Geishauser, Hsien-Chin Lin, Nurul Lubis, Marco Moresi, Milica Gašić

    Abstract: The ability to accurately track what happens during a conversation is essential for the performance of a dialogue system. Current state-of-the-art multi-domain dialogue state trackers achieve just over 55% accuracy on the current go-to benchmark, which means that in almost every second dialogue turn they place full confidence in an incorrect dialogue state. Belief trackers, on the other hand, main… ▽ More

    Submitted 5 November, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 7 pages, 9 figures, to be published in Findings of EMNLP 2020, code available at: https://gitlab.cs.uni-duesseldorf.de/general/dsml/calibrating-dialogue-belief-state-distributions

    Journal ref: Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, Pages 3096-3102; Association for Computational Linguistics

  22. arXiv:2005.02877  [pdf, other

    cs.CL

    TripPy: A Triple Copy Strategy for Value Independent Neural Dialog State Tracking

    Authors: Michael Heck, Carel van Niekerk, Nurul Lubis, Christian Geishauser, Hsien-Chin Lin, Marco Moresi, Milica Gašić

    Abstract: Task-oriented dialog systems rely on dialog state tracking (DST) to monitor the user's goal during the course of an interaction. Multi-domain and open-vocabulary settings complicate the task considerably and demand scalable solutions. In this paper we present a new approach to DST which makes use of various copy mechanisms to fill slots with values. Our model has no need to maintain a list of cand… ▽ More

    Submitted 25 September, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 10 pages, 6 figures, published in Proceedings of the 21st Annual SIGdial Meeting on Discourse and Dialogue, Code at: https://gitlab.cs.uni-duesseldorf.de/general/dsml/trippy-public

    Journal ref: Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue (July 2020), Pages 35-44; Association for Computational Linguistics

  23. arXiv:1910.06719  [pdf, other

    cs.CL cs.LG

    Tree-Structured Semantic Encoder with Knowledge Sharing for Domain Adaptation in Natural Language Generation

    Authors: Bo-Hsiang Tseng, Paweł Budzianowski, Yen-Chen Wu, Milica Gašić

    Abstract: Domain adaptation in natural language generation (NLG) remains challenging because of the high complexity of input semantics across domains and limited data of a target domain. This is particularly the case for dialogue systems, where we want to be able to seamlessly include new domains into the conversation. Therefore, it is crucial for generation models to share knowledge across domains for the… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: Published in SIGDIAL2019

  24. arXiv:1905.11259  [pdf, other

    cs.CL cs.AI cs.LG eess.AS

    AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning

    Authors: Lu Chen, Zhi Chen, Bowen Tan, Sishan Long, Milica Gasic, Kai Yu

    Abstract: Dialogue policy plays an important role in task-oriented spoken dialogue systems. It determines how to respond to users. The recently proposed deep reinforcement learning (DRL) approaches have been used for policy optimization. However, these deep models are still challenging for two reasons: 1) Many DRL-based policies are not sample-efficient. 2) Most models don't have the capability of policy tr… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: 14 pages, 8 figures; Accepted by IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING

  25. arXiv:1901.01466  [pdf, other

    cs.CL

    Addressing Objects and Their Relations: The Conversational Entity Dialogue Model

    Authors: Stefan Ultes, Paweł\ Budzianowski, Iñigo Casanueva, Lina Rojas-Barahona, Bo-Hsiang Tseng, Yen-Chen Wu, Steve Young, Milica Gašić

    Abstract: Statistical spoken dialogue systems usually rely on a single- or multi-domain dialogue model that is restricted in its capabilities of modelling complex dialogue structures, e.g., relations. In this work, we propose a novel dialogue model that is centred around entities and is able to model relations as well as multiple entities of the same type. We demonstrate in a prototype implementation benefi… ▽ More

    Submitted 5 January, 2019; originally announced January 2019.

    Comments: Accepted at SIGDial 2018

  26. arXiv:1812.08879  [pdf, other

    cs.CL cs.AI

    Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems

    Authors: Bo-Hsiang Tseng, Florian Kreyssig, Pawel Budzianowski, Inigo Casanueva, Yen-Chen Wu, Stefan Ultes, Milica Gasic

    Abstract: Cross-domain natural language generation (NLG) is still a difficult task within spoken dialogue modelling. Given a semantic representation provided by the dialogue manager, the language generator should generate sentences that convey desired information. Traditional template-based generators can produce sentences with all necessary information, but these sentences are not sufficiently diverse. Wit… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Comments: Sigdial 2018

  27. arXiv:1810.00278  [pdf, other

    cs.CL

    MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

    Authors: Paweł Budzianowski, Tsung-Hsien Wen, Bo-Hsiang Tseng, Iñigo Casanueva, Stefan Ultes, Osman Ramadan, Milica Gašić

    Abstract: Even though machine learning has become the major scene in dialogue research community, the real breakthrough has been blocked by the scale of data available. To address this fundamental obstacle, we introduce the Multi-Domain Wizard-of-Oz dataset (MultiWOZ), a fully-labeled collection of human-human written conversations spanning over multiple domains and topics. At a size of $10$k dialogues, it… ▽ More

    Submitted 20 April, 2020; v1 submitted 29 September, 2018; originally announced October 2018.

    Comments: Accepted for publication at EMNLP 2018

  28. arXiv:1809.00640  [pdf, other

    cs.CL

    Deep learning for language understanding of mental health concepts derived from Cognitive Behavioural Therapy

    Authors: Lina Rojas-Barahona, Bo-Hsiang Tseng, Yinpei Dai, Clare Mansfield, Osman Ramadan, Stefan Ultes, Michael Crawford, Milica Gasic

    Abstract: In recent years, we have seen deep learning and distributed representations of words and sentences make impact on a number of natural language processing tasks, such as similarity, entailment and sentiment analysis. Here we introduce a new task: understanding of mental health concepts derived from Cognitive Behavioural Therapy (CBT). We define a mental health ontology based on the CBT principles,… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

    Comments: Accepted for publication at LOUHI 2018: The Ninth International Workshop on Health Text Mining and Information Analysis

  29. arXiv:1807.06517  [pdf, other

    cs.CL

    Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

    Authors: Osman Ramadan, Paweł Budzianowski, Milica Gašić

    Abstract: Robust dialogue belief tracking is a key component in maintaining good quality dialogue systems. The tasks that dialogue systems are trying to solve are becoming increasingly complex, requiring scalability to multi domain, semantically rich dialogues. However, most current approaches have difficulty scaling up with domains because of the dependency of the model parameters on the dialogue ontology.… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: 10 pages, 1 figure and 2 tables. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL)

  30. arXiv:1806.05484  [pdf, other

    cs.CL cs.AI

    Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

    Authors: Lina M. Rojas-Barahona, Stefan Ultes, Pawel Budzianowski, Iñigo Casanueva, Milica Gasic, Bo-Hsiang Tseng, Steve Young

    Abstract: This paper presents two ways of dealing with scarce data in semantic decoding using N-Best speech recognition hypotheses. First, we learn features by using a deep learning architecture in which the weights for the unknown and known categories are jointly optimised. Second, an unsupervised method is used for further tuning the weights. Sharing weights injects prior knowledge to unknown categories.… ▽ More

    Submitted 21 June, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

  31. arXiv:1805.06966  [pdf, other

    cs.CL cs.AI stat.ML

    Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems

    Authors: Florian Kreyssig, Inigo Casanueva, Pawel Budzianowski, Milica Gasic

    Abstract: User Simulators are one of the major tools that enable offline training of task-oriented dialogue systems. For this task the Agenda-Based User Simulator (ABUS) is often used. The ABUS is based on hand-crafted rules and its output is in semantic form. Issues arise from both properties such as limited diversity and the inability to interface a text-level belief tracker. This paper introduces the Neu… ▽ More

    Submitted 17 May, 2018; originally announced May 2018.

    Comments: Accepted to SIGDIAL 2018

  32. arXiv:1803.03232  [pdf, other

    cs.CL cs.AI cs.NE

    Feudal Reinforcement Learning for Dialogue Management in Large Domains

    Authors: Iñigo Casanueva, Paweł Budzianowski, Pei-Hao Su, Stefan Ultes, Lina Rojas-Barahona, Bo-Hsiang Tseng, Milica Gašić

    Abstract: Reinforcement learning (RL) is a promising approach to solve dialogue policy optimisation. Traditional RL algorithms, however, fail to scale to large domains due to the curse of dimensionality. We propose a novel Dialogue Management architecture, based on Feudal RL, which decomposes the decision into two steps; a first step where a master policy selects a subset of primitive actions, and a second… ▽ More

    Submitted 8 March, 2018; originally announced March 2018.

    Comments: Accepted as a short paper in NAACL 2018

  33. arXiv:1802.03753  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces

    Authors: Gellért Weisz, Paweł Budzianowski, Pei-Hao Su, Milica Gašić

    Abstract: In spoken dialogue systems, we aim to deploy artificial intelligence to build automated dialogue agents that can converse with humans. A part of this effort is the policy optimisation task, which attempts to find a policy describing how to respond to humans, in the form of a function taking the current state of the dialogue and returning the response of the system. In this paper, we investigate de… ▽ More

    Submitted 11 February, 2018; originally announced February 2018.

  34. arXiv:1711.11486  [pdf, other

    stat.ML cs.CL cs.LG cs.NE

    Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation

    Authors: Christopher Tegho, Paweł Budzianowski, Milica Gašić

    Abstract: In statistical dialogue management, the dialogue manager learns a policy that maps a belief state to an action for the system to perform. Efficient exploration is key to successful policy optimisation. Current deep reinforcement learning methods are very promising but rely on epsilon-greedy exploration, thus subjecting the user to a random choice of action during learning. Alternative approaches s… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.

    Comments: Accepted at the Bayesian Deep Learning Workshop, 31st Conference on Neural Information Processing Systems (NIPS 2017)

  35. arXiv:1711.11023  [pdf, other

    stat.ML cs.CL cs.NE

    A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

    Authors: Iñigo Casanueva, Paweł Budzianowski, Pei-Hao Su, Nikola Mrkšić, Tsung-Hsien Wen, Stefan Ultes, Lina Rojas-Barahona, Steve Young, Milica Gašić

    Abstract: Dialogue assistants are rapidly becoming an indispensable daily aid. To avoid the significant effort needed to hand-craft the required dialogue flow, the Dialogue Management (DM) module can be cast as a continuous Markov Decision Process (MDP) and trained through Reinforcement Learning (RL). Several RL models have been investigated over recent years. However, the lack of a common benchmarking fram… ▽ More

    Submitted 6 April, 2018; v1 submitted 29 November, 2017; originally announced November 2017.

    Comments: Accepted at the Deep Reinforcement Learning Symposium, 31st Conference on Neural Information Processing Systems (NIPS 2017) Paper updated with minor changes

  36. arXiv:1707.06299  [pdf, other

    cs.CL stat.ML

    Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

    Authors: Stefan Ultes, Paweł Budzianowski, Iñigo Casanueva, Nikola Mrkšić, Lina Rojas-Barahona, Pei-Hao Su, Tsung-Hsien Wen, Milica Gašić, Steve Young

    Abstract: Reinforcement learning is widely used for dialogue policy optimization where the reward function often consists of more than one component, e.g., the dialogue success and the dialogue length. In this work, we propose a structured method for finding a good balance between these components by searching for the optimal reward component weighting. To render this search feasible, we use multi-objective… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

    Comments: Accepted at SIGDial 2017

  37. arXiv:1707.00130  [pdf, other

    cs.CL cs.AI cs.LG

    Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management

    Authors: Pei-Hao Su, Pawel Budzianowski, Stefan Ultes, Milica Gasic, Steve Young

    Abstract: Deep reinforcement learning (RL) methods have significant potential for dialogue policy optimisation. However, they suffer from a poor performance in the early stages of learning. This is especially problematic for on-line learning with real users. Two approaches are introduced to tackle this problem. Firstly, to speed up the learning process, two sample-efficient neural networks algorithms: trust… ▽ More

    Submitted 5 July, 2017; v1 submitted 1 July, 2017; originally announced July 2017.

    Comments: Accepted as a long paper in SigDial 2017

  38. arXiv:1706.06210  [pdf, other

    cs.CL cs.AI

    Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

    Authors: Paweł Budzianowski, Stefan Ultes, Pei-Hao Su, Nikola Mrkšić, Tsung-Hsien Wen, Iñigo Casanueva, Lina Rojas-Barahona, Milica Gašić

    Abstract: Human conversation is inherently complex, often spanning many different topics/domains. This makes policy learning for dialogue systems very challenging. Standard flat reinforcement learning methods do not provide an efficient framework for modelling such dialogues. In this paper, we focus on the under-explored problem of multi-domain dialogue management. First, we propose a new method for hierarc… ▽ More

    Submitted 17 July, 2017; v1 submitted 19 June, 2017; originally announced June 2017.

    Comments: Update of the section 4 and the bibliography

  39. arXiv:1706.00374  [pdf, other

    cs.CL cs.AI cs.LG

    Semantic Specialisation of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints

    Authors: Nikola Mrkšić, Ivan Vulić, Diarmuid Ó Séaghdha, Ira Leviant, Roi Reichart, Milica Gašić, Anna Korhonen, Steve Young

    Abstract: We present Attract-Repel, an algorithm for improving the semantic quality of word vectors by injecting constraints extracted from lexical resources. Attract-Repel facilitates the use of constraints from mono- and cross-lingual resources, yielding semantically specialised cross-lingual vector spaces. Our evaluation shows that the method can make use of existing cross-lingual lexicons to construct h… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

    Comments: Accepted for publication at TACL (to be presented at EMNLP 2017)

  40. arXiv:1610.04120  [pdf, other

    cs.AI cs.CL cs.NE

    Exploiting Sentence and Context Representations in Deep Neural Models for Spoken Language Understanding

    Authors: Lina M. Rojas Barahona, Milica Gasic, Nikola Mrkšić, Pei-Hao Su, Stefan Ultes, Tsung-Hsien Wen, Steve Young

    Abstract: This paper presents a deep learning architecture for the semantic decoder component of a Statistical Spoken Dialogue System. In a slot-filling dialogue, the semantic decoder predicts the dialogue act and a set of slot-value pairs from a set of n-best hypotheses returned by the Automatic Speech Recognition. Most current models for spoken language understanding assume (i) word-aligned semantic annot… ▽ More

    Submitted 13 October, 2016; originally announced October 2016.

  41. arXiv:1609.02846  [pdf, other

    cs.CL

    Dialogue manager domain adaptation using Gaussian process reinforcement learning

    Authors: Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: Spoken dialogue systems allow humans to interact with machines using natural speech. As such, they have many benefits. By using speech as the primary communication medium, a computer interface can facilitate swift, human-like acquisition of information. In recent years, speech interfaces have become ever more popular, as is evident from the rise of personal assistants such as Siri, Google Now, Cor… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

    Comments: accepted for publication in Computer Speech and Language

  42. arXiv:1606.03352  [pdf, other

    cs.CL cs.NE stat.ML

    Conditional Generation and Snapshot Learning in Neural Dialogue Systems

    Authors: Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, David Vandyke, Steve Young

    Abstract: Recently a variety of LSTM-based conditional language models (LM) have been applied across a range of language generation tasks. In this work we study various model architectures and different ways to represent and aggregate the source information in an end-to-end neural dialogue system framework. A method called snapshot learning is also proposed to facilitate learning from supervised sequential… ▽ More

    Submitted 10 June, 2016; originally announced June 2016.

  43. arXiv:1606.02689  [pdf, other

    cs.CL cs.LG

    Continuously Learning Neural Dialogue Management

    Authors: Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A unified neural network framework is proposed to enable the system to first learn by supervision from a set of dialogue data and then continuously improve its behaviour via reinforcement learning, all using gradient-based algorithms on one single model. The experiments demonstrate the supervised mode… ▽ More

    Submitted 8 June, 2016; originally announced June 2016.

  44. arXiv:1605.07669  [pdf, other

    cs.CL cs.LG

    On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

    Authors: Pei-Hao Su, Milica Gasic, Nikola Mrksic, Lina Rojas-Barahona, Stefan Ultes, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: The ability to compute an accurate reward function is essential for optimising a dialogue policy via reinforcement learning. In real-world applications, using explicit user feedback as the reward signal is often unreliable and costly to collect. This problem can be mitigated if the user's intent is known in advance or data is available to pre-train a task success predictor off-line. In practice ne… ▽ More

    Submitted 2 June, 2016; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: Accepted as a long paper in ACL 2016

  45. arXiv:1604.04562  [pdf, other

    cs.CL cs.AI cs.NE stat.ML

    A Network-based End-to-End Trainable Task-oriented Dialogue System

    Authors: Tsung-Hsien Wen, David Vandyke, Nikola Mrksic, Milica Gasic, Lina M. Rojas-Barahona, Pei-Hao Su, Stefan Ultes, Steve Young

    Abstract: Teaching machines to accomplish tasks by conversing naturally with humans is challenging. Currently, develo** task-oriented dialogue systems requires creating multiple components and typically this involves either a large amount of handcrafting, or acquiring costly labelled datasets to solve a statistical learning problem for each component. In this work we introduce a neural network-based text-… ▽ More

    Submitted 24 April, 2017; v1 submitted 15 April, 2016; originally announced April 2016.

    Comments: published at EACL 2017

  46. arXiv:1603.01232  [pdf, other

    cs.CL

    Multi-domain Neural Network Language Generation for Spoken Dialogue Systems

    Authors: Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Lina M. Rojas-Barahona, Pei-Hao Su, David Vandyke, Steve Young

    Abstract: Moving from limited-domain natural language generation (NLG) to open domain is difficult because the number of semantic input combinations grows exponentially with the number of domains. Therefore, it is important to leverage existing resources and exploit similarities between domains to facilitate domain adaptation. In this paper, we propose a procedure to train multi-domain, Recurrent Neural Net… ▽ More

    Submitted 3 March, 2016; originally announced March 2016.

    Comments: Accepted as a long paper in NAACL-HLT 2016

  47. arXiv:1603.00892  [pdf, other

    cs.CL cs.LG

    Counter-fitting Word Vectors to Linguistic Constraints

    Authors: Nikola Mrkšić, Diarmuid Ó Séaghdha, Blaise Thomson, Milica Gašić, Lina Rojas-Barahona, Pei-Hao Su, David Vandyke, Tsung-Hsien Wen, Steve Young

    Abstract: In this work, we present a novel counter-fitting method which injects antonymy and synonymy constraints into vector space representations in order to improve the vectors' capability for judging semantic similarity. Applying this method to publicly available pre-trained word vectors leads to a new state of the art performance on the SimLex-999 dataset. We also show how the method can be used to tai… ▽ More

    Submitted 2 March, 2016; originally announced March 2016.

    Comments: Paper accepted for the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2016)

  48. arXiv:1508.03391  [pdf, other

    cs.LG cs.CL

    Reward Sha** with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems

    Authors: Pei-Hao Su, David Vandyke, Milica Gasic, Nikola Mrksic, Tsung-Hsien Wen, Steve Young

    Abstract: Statistical spoken dialogue systems have the attractive property of being able to be optimised from data via interactions with real users. However in the reinforcement learning paradigm the dialogue manager (agent) often requires significant time to explore the state-action space to learn to behave in a desirable manner. This is a critical issue when the system is trained on-line with real users w… ▽ More

    Submitted 18 August, 2015; v1 submitted 13 August, 2015; originally announced August 2015.

    Comments: Accepted for publication in SigDial 2015

  49. arXiv:1508.03386  [pdf, other

    cs.LG cs.CL

    Learning from Real Users: Rating Dialogue Success with Neural Networks for Reinforcement Learning in Spoken Dialogue Systems

    Authors: Pei-Hao Su, David Vandyke, Milica Gasic, Dongho Kim, Nikola Mrksic, Tsung-Hsien Wen, Steve Young

    Abstract: To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for measuring task success is available. To date training has relied on presenting a task to either simulated or paid users and inferring the dialogue's success by observing whether this presented task was achieved or not. Our aim however is to be able to learn from real users acting under their own volitio… ▽ More

    Submitted 13 August, 2015; originally announced August 2015.

    Comments: Accepted for publication in INTERSPEECH 2015

  50. arXiv:1508.01755  [pdf, other

    cs.CL

    Stochastic Language Generation in Dialogue using Recurrent Neural Networks with Convolutional Sentence Reranking

    Authors: Tsung-Hsien Wen, Milica Gasic, Dongho Kim, Nikola Mrksic, Pei-Hao Su, David Vandyke, Steve Young

    Abstract: The natural language generation (NLG) component of a spoken dialogue system (SDS) usually needs a substantial amount of handcrafting or a well-labeled dataset to be trained on. These limitations add significantly to development costs and make cross-domain, multi-lingual dialogue systems intractable. Moreover, human languages are context-aware. The most natural response should be directly learned f… ▽ More

    Submitted 7 August, 2015; originally announced August 2015.

    Comments: To be appear in SigDial 2015