Skip to main content

Showing 1–18 of 18 results for author: van Niekerk, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.08944  [pdf, other

    cs.CL cs.LG

    CAMELL: Confidence-based Acquisition Model for Efficient Self-supervised Active Learning with Label Validation

    Authors: Carel van Niekerk, Christian Geishauser, Michael Heck, Shutong Feng, Hsien-chin Lin, Nurul Lubis, Benjamin Ruppik, Renato Vukovic, Milica Gašić

    Abstract: Supervised neural approaches are hindered by their dependence on large, meticulously annotated datasets, a requirement that is particularly cumbersome for sequential tasks. The quality of annotations tends to deteriorate with the transition from expert-based to crowd-sourced labelling. To address these challenges, we present \textbf{CAMELL} (Confidence-based Acquisition Model for Efficient self-su… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  2. arXiv:2308.12648  [pdf, other

    cs.CL

    From Chatter to Matter: Addressing Critical Steps of Emotion Recognition Learning in Task-oriented Dialogue

    Authors: Shutong Feng, Nurul Lubis, Benjamin Ruppik, Christian Geishauser, Michael Heck, Hsien-chin Lin, Carel van Niekerk, Renato Vukovic, Milica Gašić

    Abstract: Emotion recognition in conversations (ERC) is a crucial task for building human-like conversational agents. While substantial efforts have been devoted to ERC for chit-chat dialogues, the task-oriented counterpart is largely left unattended. Directly applying chit-chat ERC models to task-oriented dialogues (ToDs) results in suboptimal performance as these models overlook key features such as the c… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted by SIGDIAL 2023

  3. EmoUS: Simulating User Emotions in Task-Oriented Dialogues

    Authors: Hsien-Chin Lin, Shutong Feng, Christian Geishauser, Nurul Lubis, Carel van Niekerk, Michael Heck, Benjamin Ruppik, Renato Vukovic, Milica Gašić

    Abstract: Existing user simulators (USs) for task-oriented dialogue systems only model user behaviour on semantic and natural language levels without considering the user persona and emotions. Optimising dialogue systems with generic user policies, which cannot model diverse user behaviour driven by different emotional states, may result in a high drop-off rate when deployed in the real world. Thus, we pres… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: accepted by SIGIR2023

  4. arXiv:2306.01386  [pdf, other

    cs.CL cs.AI

    ChatGPT for Zero-shot Dialogue State Tracking: A Solution or an Opportunity?

    Authors: Michael Heck, Nurul Lubis, Benjamin Ruppik, Renato Vukovic, Shutong Feng, Christian Geishauser, Hsien-Chin Lin, Carel van Niekerk, Milica Gašić

    Abstract: Recent research on dialogue state tracking (DST) focuses on methods that allow few- and zero-shot transfer to new domains or schemas. However, performance gains heavily depend on aggressive data augmentation and fine-tuning of ever larger language model based architectures. In contrast, general purpose language models, trained on large amounts of diverse data, hold the promise of solving any kind… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 13 pages, 3 figures, accepted at ACL 2023

  5. arXiv:2211.17148  [pdf, other

    cs.CL cs.AI

    ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format

    Authors: Qi Zhu, Christian Geishauser, Hsien-chin Lin, Carel van Niekerk, Baolin Peng, Zheng Zhang, Michael Heck, Nurul Lubis, Dazhen Wan, Xiaochen Zhu, Jianfeng Gao, Milica Gašić, Minlie Huang

    Abstract: Task-oriented dialogue (TOD) systems function as digital assistants, guiding users through various tasks such as booking flights or finding restaurants. Existing toolkits for building TOD systems often fall short of in delivering comprehensive arrays of data, models, and experimental environments with a user-friendly experience. We introduce ConvLab-3: a multifaceted dialogue system toolkit crafte… ▽ More

    Submitted 17 October, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

  6. arXiv:2209.00876  [pdf, other

    cs.CL

    Dialogue Evaluation with Offline Reinforcement Learning

    Authors: Nurul Lubis, Christian Geishauser, Hsien-Chin Lin, Carel van Niekerk, Michael Heck, Shutong Feng, Milica Gašić

    Abstract: Task-oriented dialogue systems aim to fulfill user goals through natural language interactions. They are ideally evaluated with human users, which however is unattainable to do at every iteration of the development phase. Simulated users could be an alternative, however their development is nontrivial. Therefore, researchers resort to offline metrics on existing human-human corpora, which are more… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted as long paper at SIGDIAL 2022

  7. arXiv:2208.10817  [pdf, other

    cs.CL

    GenTUS: Simulating User Behaviour and Language in Task-oriented Dialogues with Generative Transformers

    Authors: Hsien-Chin Lin, Christian Geishauser, Shutong Feng, Nurul Lubis, Carel van Niekerk, Michael Heck, Milica Gašić

    Abstract: User simulators (USs) are commonly used to train task-oriented dialogue systems (DSs) via reinforcement learning. The interactions often take place on semantic level for efficiency, but there is still a gap from semantic actions to natural language, which causes a mismatch between training and deployment environment. Incorporating a natural language generation (NLG) module with USs during training… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted as a long paper to SIGDial 2022

  8. Dialogue Term Extraction using Transfer Learning and Topological Data Analysis

    Authors: Renato Vukovic, Michael Heck, Benjamin Matthias Ruppik, Carel van Niekerk, Marcus Zibrowius, Milica Gašić

    Abstract: Goal oriented dialogue systems were originally designed as a natural language interface to a fixed data-set of entities that users might inquire about, further described by domain, slots, and values. As we move towards adaptable dialogue systems where knowledge about domains, slots, and values may change, there is an increasing need to automatically extract these terms from raw dialogues or relate… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted as a long paper to SIGDIAL 2022 (Edinburgh)

  9. arXiv:2204.05928  [pdf, other

    cs.CL cs.LG

    Dynamic Dialogue Policy for Continual Reinforcement Learning

    Authors: Christian Geishauser, Carel van Niekerk, Nurul Lubis, Michael Heck, Hsien-Chin Lin, Shutong Feng, Milica Gašić

    Abstract: Continual learning is one of the key components of human learning and a necessary requirement of artificial intelligence. As dialogue can potentially span infinitely many topics and tasks, a task-oriented dialogue system must have the capability to continually learn, dynamically adapting to new challenges while preserving the knowledge it already acquired. Despite the importance, continual reinfor… ▽ More

    Submitted 10 October, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

  10. arXiv:2202.03354  [pdf, other

    cs.CL

    Robust Dialogue State Tracking with Weak Supervision and Sparse Data

    Authors: Michael Heck, Nurul Lubis, Carel van Niekerk, Shutong Feng, Christian Geishauser, Hsien-Chin Lin, Milica Gašić

    Abstract: Generalising dialogue state tracking (DST) to new data is especially challenging due to the strong reliance on abundant and fine-grained supervision during training. Sample sparsity, distributional shift and the occurrence of new concepts and topics frequently lead to severe performance degradation during inference. In this paper we propose a training strategy to build extractive DST models withou… ▽ More

    Submitted 9 August, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 12 pages, 6 figures, pre-MIT Press publication version (author's final version), accepted for publication in TACL

  11. arXiv:2109.07129  [pdf, other

    cs.LG cs.CL

    What Does The User Want? Information Gain for Hierarchical Dialogue Policy Optimisation

    Authors: Christian Geishauser, Songbo Hu, Hsien-chin Lin, Nurul Lubis, Michael Heck, Shutong Feng, Carel van Niekerk, Milica Gašić

    Abstract: The dialogue management component of a task-oriented dialogue system is typically optimised via reinforcement learning (RL). Optimisation via RL is highly susceptible to sample inefficiency and instability. The hierarchical approach called Feudal Dialogue Management takes a step towards more efficient learning by decomposing the action space. However, it still suffers from instability due to the r… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  12. arXiv:2109.04919  [pdf, other

    cs.CL

    EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion Recognition in Task-Oriented Dialogue Systems

    Authors: Shutong Feng, Nurul Lubis, Christian Geishauser, Hsien-chin Lin, Michael Heck, Carel van Niekerk, Milica Gašić

    Abstract: The ability to recognise emotions lends a conversational artificial intelligence a human touch. While emotions in chit-chat dialogues have received substantial attention, emotions in task-oriented dialogues remain largely unaddressed. This is despite emotions and dialogue success having equally important roles in a natural system. Existing emotion-annotated task-oriented corpora are limited in siz… ▽ More

    Submitted 2 May, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

    Comments: Accepted for publication at LREC 2022

  13. arXiv:2109.04349  [pdf, other

    cs.CL

    Uncertainty Measures in Neural Belief Tracking and the Effects on Dialogue Policy Performance

    Authors: Carel van Niekerk, Andrey Malinin, Christian Geishauser, Michael Heck, Hsien-chin Lin, Nurul Lubis, Shutong Feng, Milica Gašić

    Abstract: The ability to identify and resolve uncertainty is crucial for the robustness of a dialogue system. Indeed, this has been confirmed empirically on systems that utilise Bayesian approaches to dialogue belief tracking. However, such systems consider only confidence estimates and have difficulty scaling to more complex settings. Neural dialogue systems, on the other hand, rarely take uncertainties in… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: 14 pages, 2 figures, accepted at EMNLP 2021 Main conference, Code at: https://gitlab.cs.uni-duesseldorf.de/general/dsml/setsumbt-public

  14. arXiv:2106.08838  [pdf, other

    cs.CL

    Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems

    Authors: Hsien-chin Lin, Nurul Lubis, Songbo Hu, Carel van Niekerk, Christian Geishauser, Michael Heck, Shutong Feng, Milica Gašić

    Abstract: Dialogue policy optimisation via reinforcement learning requires a large number of training interactions, which makes learning with real users time consuming and expensive. Many set-ups therefore rely on a user simulator instead of humans. These user simulators have their own problems. While hand-coded, rule-based user simulators have been shown to be sufficient in small, simple domains, for compl… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  15. arXiv:2011.09379  [pdf, other

    cs.CL

    Out-of-Task Training for Dialog State Tracking Models

    Authors: Michael Heck, Carel van Niekerk, Nurul Lubis, Christian Geishauser, Hsien-Chin Lin, Marco Moresi, Milica Gašić

    Abstract: Dialog state tracking (DST) suffers from severe data sparsity. While many natural language processing (NLP) tasks benefit from transfer learning and multi-task learning, in dialog these methods are limited by the amount of available data and by the specificity of dialog applications. In this work, we successfully utilize non-dialog data from unrelated NLP tasks to train dialog state trackers. This… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: 8 pages, 2 figures, to be published in Proceedings of the 28th International Conference on Computational Linguistics, Code at https://gitlab.cs.uni-duesseldorf.de/general/dsml/trippy-public

  16. arXiv:2011.09378  [pdf, other

    cs.CL

    LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization

    Authors: Nurul Lubis, Christian Geishauser, Michael Heck, Hsien-chin Lin, Marco Moresi, Carel van Niekerk, Milica Gašić

    Abstract: Reinforcement learning (RL) can enable task-oriented dialogue systems to steer the conversation towards successful task completion. In an end-to-end setting, a response can be constructed in a word-level sequential decision making process with the entire system vocabulary as action space. Policies trained in such a fashion do not require expert-defined action spaces, but they have to deal with lar… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: 15 pages. To be published as long paper in Proceedings of The 28th International Conference on Computational Linguistics (COLING 2020). Code can be accessed at https://gitlab.cs.uni-duesseldorf.de/general/dsml/lava-public

  17. arXiv:2010.02586  [pdf, other

    cs.CL cs.AI

    Knowing What You Know: Calibrating Dialogue Belief State Distributions via Ensembles

    Authors: Carel van Niekerk, Michael Heck, Christian Geishauser, Hsien-Chin Lin, Nurul Lubis, Marco Moresi, Milica Gašić

    Abstract: The ability to accurately track what happens during a conversation is essential for the performance of a dialogue system. Current state-of-the-art multi-domain dialogue state trackers achieve just over 55% accuracy on the current go-to benchmark, which means that in almost every second dialogue turn they place full confidence in an incorrect dialogue state. Belief trackers, on the other hand, main… ▽ More

    Submitted 5 November, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 7 pages, 9 figures, to be published in Findings of EMNLP 2020, code available at: https://gitlab.cs.uni-duesseldorf.de/general/dsml/calibrating-dialogue-belief-state-distributions

    Journal ref: Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, Pages 3096-3102; Association for Computational Linguistics

  18. arXiv:2005.02877  [pdf, other

    cs.CL

    TripPy: A Triple Copy Strategy for Value Independent Neural Dialog State Tracking

    Authors: Michael Heck, Carel van Niekerk, Nurul Lubis, Christian Geishauser, Hsien-Chin Lin, Marco Moresi, Milica Gašić

    Abstract: Task-oriented dialog systems rely on dialog state tracking (DST) to monitor the user's goal during the course of an interaction. Multi-domain and open-vocabulary settings complicate the task considerably and demand scalable solutions. In this paper we present a new approach to DST which makes use of various copy mechanisms to fill slots with values. Our model has no need to maintain a list of cand… ▽ More

    Submitted 25 September, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 10 pages, 6 figures, published in Proceedings of the 21st Annual SIGdial Meeting on Discourse and Dialogue, Code at: https://gitlab.cs.uni-duesseldorf.de/general/dsml/trippy-public

    Journal ref: Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue (July 2020), Pages 35-44; Association for Computational Linguistics