Skip to main content

Showing 1–50 of 96 results for author: Hakkani-Tur, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11709  [pdf, other

    cs.CL cs.MA

    Instruct, Not Assist: LLM-based Multi-Turn Planning and Hierarchical Questioning for Socratic Code Debugging

    Authors: Priyanka Kargupta, Ishika Agarwal, Dilek Hakkani-Tur, Jiawei Han

    Abstract: Socratic questioning is an effective teaching strategy, encouraging critical thinking and problem-solving. The conversational capabilities of large language models (LLMs) show great potential for providing scalable, real-time student guidance. However, current LLMs often give away solutions directly, making them ineffective instructors. We tackle this issue in the code debugging domain with TreeIn… ▽ More

    Submitted 25 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2311.17376  [pdf, other

    cs.CL

    CESAR: Automatic Induction of Compositional Instructions for Multi-turn Dialogs

    Authors: Taha Aksu, Devamanyu Hazarika, Shikib Mehri, Seokhwan Kim, Dilek Hakkani-Tür, Yang Liu, Mahdi Namazifar

    Abstract: Instruction-based multitasking has played a critical role in the success of large language models (LLMs) in multi-turn dialog applications. While publicly available LLMs have shown promising performance, when exposed to complex instructions with multiple constraints, they lag against state-of-the-art models like ChatGPT. In this work, we hypothesize that the availability of large-scale complex dem… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  3. arXiv:2308.11995  [pdf, other

    cs.CL cs.AI

    Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations

    Authors: Karthik Gopalakrishnan, Behnam Hedayatnia, Qinlang Chen, Anna Gottardi, Sanjeev Kwatra, Anu Venkatesh, Raefer Gabriel, Dilek Hakkani-Tur

    Abstract: Building socialbots that can have deep, engaging open-domain conversations with humans is one of the grand challenges of artificial intelligence (AI). To this end, bots need to be able to leverage world knowledge spanning several domains effectively when conversing with humans who have their own world knowledge. Existing knowledge-grounded conversation datasets are primarily stylized with explicit… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: arXiving an old paper accepted at INTERSPEECH 2019

  4. arXiv:2308.05221  [pdf, other

    cs.HC cs.AI cs.RO

    Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI

    Authors: Hangjie Shi, Leslie Ball, Govind Thattai, Desheng Zhang, Lucy Hu, Qiaozi Gao, Suhaila Shakiah, Xiaofeng Gao, Aishwarya Padmakumar, Bofei Yang, Cadence Chung, Dinakar Guthy, Gaurav Sukhatme, Karthika Arumugam, Matthew Wen, Osman Ipek, Patrick Lange, Rohan Khanna, Shreyas Pansare, Vasu Sharma, Chao Zhang, Cris Flagg, Daniel Pressel, Lavina Vaz, Luke Dai , et al. (17 additional authors not shown)

    Abstract: The Alexa Prize program has empowered numerous university students to explore, experiment, and showcase their talents in building conversational agents through challenges like the SocialBot Grand Challenge and the TaskBot Challenge. As conversational agents increasingly appear in multimodal and embodied contexts, it is important to explore the affordances of conversational interaction augmented wi… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  5. arXiv:2305.12091  [pdf, other

    cs.CL

    "What do others think?": Task-Oriented Conversational Modeling with Subjective Knowledge

    Authors: Chao Zhao, Spandana Gella, Seokhwan Kim, Di **, Devamanyu Hazarika, Alexandros Papangelis, Behnam Hedayatnia, Mahdi Namazifar, Yang Liu, Dilek Hakkani-Tur

    Abstract: Task-oriented Dialogue (TOD) Systems aim to build dialogue systems that assist users in accomplishing specific goals, such as booking a hotel or a restaurant. Traditional TODs rely on domain-specific APIs/DBs or external factual knowledge to generate responses, which cannot accommodate subjective user requests (e.g., "Is the WIFI reliable?" or "Does the restaurant have a good atmosphere?"). To add… ▽ More

    Submitted 2 October, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: SIGDIAL 2023

  6. arXiv:2305.06485  [pdf, other

    cs.RO cs.AI cs.CL cs.HC

    Multimodal Contextualized Plan Prediction for Embodied Task Completion

    Authors: Mert İnan, Aishwarya Padmakumar, Spandana Gella, Patrick Lange, Dilek Hakkani-Tur

    Abstract: Task planning is an important component of traditional robotics systems enabling robots to compose fine grained skills to perform more complex tasks. Recent work building systems for translating natural language to executable actions for task completion in simulated embodied agents is focused on directly predicting low level action sequences that would be expected to be directly executable by a ph… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: NILLI at EMNLP 2022

  7. arXiv:2302.11054  [pdf, other

    cs.CL

    Conversational Text-to-SQL: An Odyssey into State-of-the-Art and Challenges Ahead

    Authors: Sree Hari Krishnan Parthasarathi, Lu Zeng, Dilek Hakkani-Tur

    Abstract: Conversational, multi-turn, text-to-SQL (CoSQL) tasks map natural language utterances in a dialogue to SQL queries. State-of-the-art (SOTA) systems use large, pre-trained and finetuned language models, such as the T5-family, in conjunction with constrained decoding. With multi-tasking (MT) over coherent tasks with discrete prompts during training, we improve over specialized text-to-SQL T5-family… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted for publication at ICASSP 2023

  8. arXiv:2302.09170  [pdf, other

    cs.CL cs.AI

    KILM: Knowledge Injection into Encoder-Decoder Language Models

    Authors: Yan Xu, Mahdi Namazifar, Devamanyu Hazarika, Aishwarya Padmakumar, Yang Liu, Dilek Hakkani-Tür

    Abstract: Large pre-trained language models (PLMs) have been shown to retain implicit knowledge within their parameters. To enhance this implicit knowledge, we propose Knowledge Injection into Language Models (KILM), a novel approach that injects entity-related knowledge into encoder-decoder PLMs, via a generative knowledge infilling objective through continued pre-training. This is done without architectur… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  9. arXiv:2302.08626  [pdf, other

    cs.NE

    Role of Bias Terms in Dot-Product Attention

    Authors: Mahdi Namazifar, Devamanyu Hazarika, Dilek Hakkani-Tur

    Abstract: Dot-product attention is a core module in the present generation of neural network models, particularly transformers, and is being leveraged across numerous areas such as natural language processing and computer vision. This attention module is comprised of three linear transformations, namely query, key, and value linear transformations, each of which has a bias term. In this work, we study the r… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  10. arXiv:2302.05096  [pdf, other

    cs.CL cs.AI

    Selective In-Context Data Augmentation for Intent Detection using Pointwise V-Information

    Authors: Yen-Ting Lin, Alexandros Papangelis, Seokhwan Kim, Sung** Lee, Devamanyu Hazarika, Mahdi Namazifar, Di **, Yang Liu, Dilek Hakkani-Tur

    Abstract: This work focuses on in-context data augmentation for intent detection. Having found that augmentation via in-context prompting of large pre-trained language models (PLMs) alone does not improve performance, we introduce a novel approach based on PLMs and pointwise V-information (PVI), a metric that can measure the usefulness of a datapoint for training a model. Our method first fine-tunes a PLM o… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Accepted at EACL 2023

  11. arXiv:2302.03269  [pdf, other

    cs.CL cs.AI cs.IR

    PLACES: Prompting Language Models for Social Conversation Synthesis

    Authors: Maximillian Chen, Alexandros Papangelis, Chenyang Tao, Seokhwan Kim, Andy Rosenbaum, Yang Liu, Zhou Yu, Dilek Hakkani-Tur

    Abstract: Collecting high quality conversational data can be very expensive for most applications and infeasible for others due to privacy, ethical, or similar concerns. A promising direction to tackle this problem is to generate synthetic dialogues by prompting large language models. In this work, we use a small set of expert-written conversations as in-context examples to synthesize a social conversation… ▽ More

    Submitted 16 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: In Findings of EACL 2023. 25 pages, 4 figures, 26 tables. Code available at https://github.com/alexa/PLACES

  12. arXiv:2302.00871  [pdf, other

    cs.CL

    Using In-Context Learning to Improve Dialogue Safety

    Authors: Nicholas Meade, Spandana Gella, Devamanyu Hazarika, Prakhar Gupta, Di **, Siva Reddy, Yang Liu, Dilek Hakkani-Tür

    Abstract: While large neural-based conversational models have become increasingly proficient dialogue agents, recent work has highlighted safety issues with these systems. For example, these systems can be goaded into generating toxic content, which often perpetuates social biases or stereotypes. We investigate a retrieval-based method for reducing bias and toxicity in responses from chatbots. It uses in-co… ▽ More

    Submitted 22 October, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Findings of EMNLP 2023

  13. arXiv:2212.10557  [pdf, other

    cs.CL

    DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines

    Authors: Prakhar Gupta, Yang Liu, Di **, Behnam Hedayatnia, Spandana Gella, Sijia Liu, Patrick Lange, Julia Hirschberg, Dilek Hakkani-Tur

    Abstract: Dialogue models are able to generate coherent and fluent responses, but they can still be challenging to control and may produce non-engaging, unsafe results. This unpredictability diminishes user trust and can hinder the use of the models in the real world. To address this, we introduce DialGuide, a novel framework for controlling dialogue model behavior using natural language rules, or guideline… ▽ More

    Submitted 21 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  14. arXiv:2210.14469  [pdf, other

    cs.CL cs.LG

    Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning

    Authors: Yifan Chen, Devamanyu Hazarika, Mahdi Namazifar, Yang Liu, Di **, Dilek Hakkani-Tur

    Abstract: Prefix-tuning, or more generally continuous prompt tuning, has become an essential paradigm of parameter-efficient transfer learning. Using a large pre-trained language model (PLM), prefix-tuning can obtain strong performance by training only a small portion of parameters. In this paper, we propose to understand and further develop prefix-tuning through the kernel lens. Specifically, we make an an… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: To appear in EMNLP 2022. Code is available at https://github.com/ychen-stat-ml/kernel-adapters

  15. arXiv:2210.14169  [pdf, other

    cs.CL cs.AI cs.LG

    Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding

    Authors: Maximillian Chen, Alexandros Papangelis, Chenyang Tao, Andy Rosenbaum, Seokhwan Kim, Yang Liu, Zhou Yu, Dilek Hakkani-Tur

    Abstract: Dialogue understanding tasks often necessitate abundant annotated data to achieve good performance and that presents challenges in low-resource settings. To alleviate this barrier, we explore few-shot data augmentation for dialogue understanding by prompting large pre-trained language models and present a novel approach that iterates on augmentation quality by applying weakly-supervised filters. W… ▽ More

    Submitted 2 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: To appear in SyntheticData4ML @ NeurIPS 2022. 16 pages, 10 figures, 3 tables

  16. arXiv:2210.10668  [pdf, other

    cs.CL cs.AI

    N-Best Hypotheses Reranking for Text-To-SQL Systems

    Authors: Lu Zeng, Sree Hari Krishnan Parthasarathi, Dilek Hakkani-Tur

    Abstract: Text-to-SQL task maps natural language utterances to structured queries that can be issued to a database. State-of-the-art (SOTA) systems rely on finetuning large, pre-trained language models in conjunction with constrained decoding applying a SQL parser. On the well established Spider dataset, we begin with Oracle studies: specifically, choosing an Oracle hypothesis from a SOTA model's 10-best li… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at IEEE SLT'22

  17. arXiv:2209.12953  [pdf, other

    cs.CL cs.CV

    Dialog Acts for Task-Driven Embodied Agents

    Authors: Spandana Gella, Aishwarya Padmakumar, Patrick Lange, Dilek Hakkani-Tur

    Abstract: Embodied agents need to be able to interact in natural language understanding task descriptions and asking appropriate follow up questions to obtain necessary information to be effective at successfully accomplishing tasks for a wide range of users. In this work, we propose a set of dialog acts for modelling such dialogs and annotate the TEACh dataset that includes over 3,000 situated, task orient… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: accepted at SIGDIAL 2022

  18. arXiv:2209.06321  [pdf, other

    cs.CL cs.AI cs.HC

    Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot Challenge on Conversational Task Assistance

    Authors: Anna Gottardi, Osman Ipek, Giuseppe Castellucci, Shui Hu, Lavina Vaz, Yao Lu, Anju Khatri, Anjali Chadha, Desheng Zhang, Sattvik Sahai, Prerna Dwivedi, Hangjie Shi, Lucy Hu, Andy Huang, Luke Dai, Bofei Yang, Varun Somani, Pankaj Rajan, Ron Rezac, Michael Johnston, Savanna Stiff, Leslie Ball, David Carmel, Yang Liu, Dilek Hakkani-Tur , et al. (5 additional authors not shown)

    Abstract: Since its inception in 2016, the Alexa Prize program has enabled hundreds of university students to explore and compete to develop conversational agents through the SocialBot Grand Challenge. The goal of the challenge is to build agents capable of conversing coherently and engagingly with humans on popular topics for 20 minutes, while achieving an average rating of at least 4.0/5.0. However, as co… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 14 pages, Proceedings of Alexa Prize Taskbot (Alexa Prize 2021)

    ACM Class: I.2.7; J.0; H.5.1; H.5.2

  19. arXiv:2208.04379  [pdf, other

    cs.CL

    A Systematic Evaluation of Response Selection for Open Domain Dialogue

    Authors: Behnam Hedayatnia, Di **, Yang Liu, Dilek Hakkani-Tur

    Abstract: Recent progress on neural approaches for language processing has triggered a resurgence of interest on building intelligent open-domain chatbots. However, even the state-of-the-art neural chatbots cannot produce satisfying responses for every turn in a dialog. A practical solution is to generate multiple response candidates for the same context, and then perform response ranking/selection to deter… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted at SIGDial 2022. 14 pages, 9 figures, 2 tables

  20. arXiv:2207.11862  [pdf, other

    cs.CL cs.AI

    Improving Bot Response Contradiction Detection via Utterance Rewriting

    Authors: Di **, Sijia Liu, Yang Liu, Dilek Hakkani-Tur

    Abstract: Though chatbots based on large neural models can often produce fluent responses in open domain conversations, one salient error type is contradiction or inconsistency with the preceding conversation turns. Previous work has treated contradiction detection in bot responses as a task similar to natural language inference, e.g., detect the contradiction between a pair of bot utterances. However, utte… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

    Comments: Accepted by SIGDial 2022

  21. arXiv:2207.11363  [pdf, other

    cs.CL

    Knowledge-Grounded Conversational Data Augmentation with Generative Conversational Networks

    Authors: Yen-Ting Lin, Alexandros Papangelis, Seokhwan Kim, Dilek Hakkani-Tur

    Abstract: While rich, open-domain textual data are generally available and may include interesting phenomena (humor, sarcasm, empathy, etc.) most are designed for language processing tasks, and are usually in a non-conversational format. In this work, we take a step towards automatically generating conversational data using Generative Conversational Networks, aiming to benefit from the breadth of available… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: Accepted at SIGDial 2022

  22. arXiv:2206.07808  [pdf, other

    cs.CL cs.AI cs.LG

    Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems

    Authors: Jack FitzGerald, Shankar Ananthakrishnan, Konstantine Arkoudas, Davide Bernardi, Abhishek Bhagia, Claudio Delli Bovi, ** Cao, Rakesh Chada, Amit Chauhan, Luoxin Chen, Anurag Dwarakanath, Satyam Dwivedi, Turan Gojayev, Karthik Gopalakrishnan, Thomas Gueudre, Dilek Hakkani-Tur, Wael Hamza, Jonathan Hueser, Kevin Martin Jose, Haidar Khan, Beiye Liu, Jianhua Lu, Alessandro Manzotti, Pradeep Natarajan, Karolina Owczarzak , et al. (16 additional authors not shown)

    Abstract: We present results from a large-scale experiment on pretraining encoders with non-embedding parameter counts ranging from 700M to 9.3B, their subsequent distillation into smaller models ranging from 17M-170M parameters, and their application to the Natural Language Understanding (NLU) component of a virtual assistant system. Though we train using 70% spoken-form data, our teacher models perform co… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: KDD 2022

    ACM Class: I.2.7

    Journal ref: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

  23. arXiv:2206.07296  [pdf, other

    cs.CL

    Enhanced Knowledge Selection for Grounded Dialogues via Document Semantic Graphs

    Authors: Sha Li, Mahdi Namazifar, Di **, Mohit Bansal, Heng Ji, Yang Liu, Dilek Hakkani-Tur

    Abstract: Providing conversation models with background knowledge has been shown to make open-domain dialogues more informative and engaging. Existing models treat knowledge selection as a sentence ranking or classification problem where each sentence is handled individually, ignoring the internal semantic connection among sentences in the background document. In this work, we propose to automatically conve… ▽ More

    Submitted 30 June, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: NAACL 2022. Please refer to https://www.amazon.science/publications/enhanced-knowledge-selection-for-grounded-dialogues-via-document-semantic-graphs for code and resources

  24. arXiv:2206.00583  [pdf, other

    cs.LG

    Calibrate and Debias Layer-wise Sampling for Graph Convolutional Networks

    Authors: Yifan Chen, Tianning Xu, Dilek Hakkani-Tur, Di **, Yun Yang, Ruoqing Zhu

    Abstract: Multiple sampling-based methods have been developed for approximating and accelerating node embedding aggregation in graph convolutional networks (GCNs) training. Among them, a layer-wise approach recursively performs importance sampling to select neighbors jointly for existing nodes in each layer. This paper revisits the approach from a matrix approximation perspective, and identifies two issues… ▽ More

    Submitted 15 June, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Published at TMLR. Code is available at https://github.com/ychen-stat-ml/GCN-layer-wise-sampling

  25. arXiv:2206.00167  [pdf, other

    cs.CL

    Understanding How People Rate Their Conversations

    Authors: Alexandros Papangelis, Nicole Chartier, Pankaj Rajan, Julia Hirschberg, Dilek Hakkani-Tur

    Abstract: User ratings play a significant role in spoken dialogue systems. Typically, such ratings tend to be averaged across all users and then utilized as feedback to improve the system or personalize its behavior. While this method can be useful to understand broad, general issues with the system and its behavior, it does not take into account differences between users that affect their ratings. In this… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: Published at IWSDS 2021

  26. arXiv:2205.09249  [pdf, other

    cs.CL cs.AI cs.CV cs.RO

    On the Limits of Evaluating Embodied Agent Model Generalization Using Validation Sets

    Authors: Hyounghun Kim, Aishwarya Padmakumar, Di **, Mohit Bansal, Dilek Hakkani-Tur

    Abstract: Natural language guided embodied task completion is a challenging problem since it requires understanding natural language instructions, aligning them with egocentric visual observations, and choosing appropriate actions to execute in the environment to produce desired changes. We experiment with augmenting a transformer model for this task with modules that effectively utilize a wider field of vi… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: ACL 2022 Insights Workshop (6 pages)

  27. arXiv:2205.03720  [pdf, other

    cs.CL cs.LG

    Empowering parameter-efficient transfer learning by recognizing the kernel structure in self-attention

    Authors: Yifan Chen, Devamanyu Hazarika, Mahdi Namazifar, Yang Liu, Di **, Dilek Hakkani-Tur

    Abstract: The massive amount of trainable parameters in the pre-trained language models (PLMs) makes them hard to be deployed to multiple downstream tasks. To address this issue, parameter-efficient transfer learning methods have been proposed to tune only a few parameters during fine-tuning while freezing the rest. This paper looks at existing methods along this line through the \textit{kernel lens}. Motiv… ▽ More

    Submitted 26 October, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: Accepted in NAACL 2022. Code is available at https://github.com/ychen-stat-ml/kernel-adapters

  28. arXiv:2203.13927  [pdf, other

    cs.CL

    What is wrong with you?: Leveraging User Sentiment for Automatic Dialog Evaluation

    Authors: Sarik Ghazarian, Behnam Hedayatnia, Alexandros Papangelis, Yang Liu, Dilek Hakkani-Tur

    Abstract: Accurate automatic evaluation metrics for open-domain dialogs are in high demand. Existing model-based metrics for system response evaluation are trained on human annotated data, which is cumbersome to collect. In this work, we propose to use information that can be automatically extracted from the next user utterance, such as its sentiment or whether the user explicitly ends the conversation, as… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL Findings 2022. 11 pages, 8 figures, 5 tables

  29. arXiv:2203.11396  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Textual Out-of-Domain Detection without In-Domain Labels

    Authors: Di **, Shuyang Gao, Seokhwan Kim, Yang Liu, Dilek Hakkani-Tur

    Abstract: In many real-world settings, machine learning models need to identify user inputs that are out-of-domain (OOD) so as to avoid performing wrong actions. This work focuses on a challenging case of OOD detection, where no labels for in-domain data are accessible (e.g., no intent labels for the intent classification task). To this end, we first evaluate different language model based approaches that p… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted by IEEE/ACM Transactions on Audio Speech and Language

  30. arXiv:2203.10012  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges

    Authors: Shikib Mehri, **ho Choi, Luis Fernando D'Haro, Jan Deriu, Maxine Eskenazi, Milica Gasic, Kallirroi Georgila, Dilek Hakkani-Tur, Zekang Li, Verena Rieser, Samira Shaikh, David Traum, Yi-Ting Yeh, Zhou Yu, Yizhe Zhang, Chen Zhang

    Abstract: This is a report on the NSF Future Directions Workshop on Automatic Evaluation of Dialog. The workshop explored the current state of the art along with its limitations and suggested promising directions for future work in this important and very rapidly changing area of research.

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Report from the NSF AED Workshop (http://dialrc.org/AED/)

  31. arXiv:2203.00763  [pdf, other

    cs.CL

    Multi-Sentence Knowledge Selection in Open-Domain Dialogue

    Authors: Mihail Eric, Nicole Chartier, Behnam Hedayatnia, Karthik Gopalakrishnan, Pankaj Rajan, Yang Liu, Dilek Hakkani-Tur

    Abstract: Incorporating external knowledge sources effectively in conversations is a longstanding problem in open-domain dialogue research. The existing literature on open-domain knowledge selection is limited and makes certain brittle assumptions on knowledge sources to simplify the overall task (Dinan et al., 2019), such as the existence of a single relevant knowledge sentence per context. In this work, w… ▽ More

    Submitted 4 October, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: Accepted at INLG 2021. 11 pages, 5 tables, 8 figures

  32. arXiv:2112.08637  [pdf, other

    cs.CL cs.AI

    Analyzing the Limits of Self-Supervision in Handling Bias in Language

    Authors: Lisa Bauer, Karthik Gopalakrishnan, Spandana Gella, Yang Liu, Mohit Bansal, Dilek Hakkani-Tur

    Abstract: Prompting inputs with natural language task descriptions has emerged as a popular mechanism to elicit reasonably accurate outputs from large-scale generative language models with little to no in-context supervision. This also helps gain insight into how well language models capture the semantics of a wide range of downstream tasks purely from self-supervised pre-training on massive corpora of unla… ▽ More

    Submitted 16 August, 2023; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted at Findings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) 2022

  33. arXiv:2112.05842  [pdf, other

    cs.CL cs.LG eess.AS

    Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems

    Authors: Manaal Faruqui, Dilek Hakkani-Tür

    Abstract: As more users across the world are interacting with dialog agents in their daily life, there is a need for better speech understanding that calls for renewed attention to the dynamics between research in automatic speech recognition (ASR) and natural language understanding (NLU). We briefly review these research areas and lay out the current relationship between them. In light of the observations… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: Accepted to be published at Computational Linguistics Journal 2022

  34. arXiv:2112.05359  [pdf, other

    cs.LG cs.CL stat.ML

    Sketching as a Tool for Understanding and Accelerating Self-attention for Long Sequences

    Authors: Yifan Chen, Qi Zeng, Dilek Hakkani-Tur, Di **, Heng Ji, Yun Yang

    Abstract: Transformer-based models are not efficient in processing long sequences due to the quadratic space and time complexity of the self-attention modules. To address this limitation, Linformer and Informer are proposed to reduce the quadratic complexity to linear (modulo logarithmic factors) via low-dimensional projection and row selection respectively. These two models are intrinsically connected, and… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  35. arXiv:2111.08808  [pdf, other

    cs.CL

    User Response and Sentiment Prediction for Automatic Dialogue Evaluation

    Authors: Sarik Ghazarian, Behnam Hedayatnia, Alexandros Papangelis, Yang Liu, Dilek Hakkani-Tur

    Abstract: Automatic evaluation is beneficial for open-domain dialog system development. However, standard word-overlap metrics (BLEU, ROUGE) do not correlate well with human judgements of open-domain dialog systems. In this work we propose to use the sentiment of the next user utterance for turn or dialog level evaluation. Specifically we propose three methods: one that predicts the next sentiment directly,… ▽ More

    Submitted 16 February, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted at EMNLP 2021 Evaluations and Assessments of Neural Conversation Systems Workshop. 2 pages, 1 table

  36. arXiv:2110.08501  [pdf, other

    cs.CL

    Think Before You Speak: Explicitly Generating Implicit Commonsense Knowledge for Response Generation

    Authors: Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

    Abstract: Implicit knowledge, such as common sense, is key to fluid human conversations. Current neural response generation (RG) models are trained to generate responses directly, omitting unstated implicit knowledge. In this paper, we present Think-Before-Speaking (TBS), a generative approach to first externalize implicit commonsense knowledge (think) and use this knowledge to generate responses (speak). W… ▽ More

    Submitted 11 September, 2023; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: Accepted at ACL 2022 main conference. 16 pages, 9 figures, 9 tables

  37. arXiv:2110.08383  [pdf, other

    cs.CL

    Training Conversational Agents with Generative Conversational Networks

    Authors: Yen-Ting Lin, Alexandros Papangelis, Seokhwan Kim, Dilek Hakkani-Tur

    Abstract: Rich, open-domain textual data available on the web resulted in great advancements for language processing. However, while that data may be suitable for language processing tasks, they are mostly non-conversational, lacking many phenomena that appear in human interactions and this is one of the reasons why we still have many unsolved challenges in conversational AI. In this work, we attempt to add… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: Accepted at WeCNLP 2021

  38. arXiv:2110.05456  [pdf, other

    cs.CL cs.AI

    Rome was built in 1776: A Case Study on Factual Correctness in Knowledge-Grounded Response Generation

    Authors: Sashank Santhanam, Behnam Hedayatnia, Spandana Gella, Aishwarya Padmakumar, Seokhwan Kim, Yang Liu, Dilek Hakkani-Tur

    Abstract: Recently neural response generation models have leveraged large pre-trained transformer models and knowledge snippets to generate relevant and informative responses. However, this does not guarantee that generated responses are factually correct. In this paper, we examine factual correctness in knowledge-grounded neural response generation models. We present a human annotation setup to identify th… ▽ More

    Submitted 4 October, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

  39. arXiv:2110.00534  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    TEACh: Task-driven Embodied Agents that Chat

    Authors: Aishwarya Padmakumar, Jesse Thomason, Ayush Shrivastava, Patrick Lange, Anjali Narayan-Chen, Spandana Gella, Robinson Piramuthu, Gokhan Tur, Dilek Hakkani-Tur

    Abstract: Robots operating in human spaces must be able to engage in natural language interaction with people, both understanding and executing instructions, and using conversation to resolve ambiguity and recover from mistakes. To study this, we introduce TEACh, a dataset of over 3,000 human--human, interactive dialogues to complete household tasks in simulation. A Commander with access to oracle informati… ▽ More

    Submitted 28 December, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: Accepted at AAAI 2022; 7 pages main, 28 pages total, 29 figures; Version 3 uses a new test set for EDH instances that restrict evaluation to state changes only on task-relevant objects

  40. arXiv:2109.13489  [pdf, other

    cs.CL

    "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations

    Authors: Seokhwan Kim, Yang Liu, Di **, Alexandros Papangelis, Karthik Gopalakrishnan, Behnam Hedayatnia, Dilek Hakkani-Tur

    Abstract: Most prior work in dialogue modeling has been on written conversations mostly because of existing data sets. However, written dialogues are not sufficient to fully capture the nature of spoken conversations as well as the potential speech recognition errors in practical spoken dialogue systems. This work presents a new benchmark on spoken task-oriented conversations, which is intended to study mul… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: To be presented at ASRU 2021

  41. arXiv:2109.12211  [pdf, other

    cs.CL

    Style Control for Schema-Guided Natural Language Generation

    Authors: Alicia Y. Tsai, Shereen Oraby, Vittorio Perera, Jiun-Yu Kao, Yuheng Du, Anjali Narayan-Chen, Tagyoung Chung, Dilek Hakkani-Tur

    Abstract: Natural Language Generation (NLG) for task-oriented dialogue systems focuses on communicating specific content accurately, fluently, and coherently. While these attributes are crucial for a successful dialogue, it is also desirable to simultaneously accomplish specific stylistic goals, such as response length, point-of-view, descriptiveness, sentiment, formality, and empathy. In this work, we focu… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Accepted at the 3rd Workshop on NLP for ConvAI at EMNLP '21

  42. arXiv:2109.08820  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Zero and Few-shot Knowledge-seeking Turn Detection in Task-orientated Dialogue Systems

    Authors: Di **, Shuyang Gao, Seokhwan Kim, Yang Liu, Dilek Hakkani-Tur

    Abstract: Most prior work on task-oriented dialogue systems is restricted to supporting domain APIs. However, users may have requests that are out of the scope of these APIs. This work focuses on identifying such user requests. Existing methods for this task mainly rely on fine-tuning pre-trained models on large annotated data. We propose a novel method, REDE, based on adaptive representation learning and d… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: To appear at NLP4ConvAI workshop of EMNLP 2021

  43. arXiv:2109.06427  [pdf, ps, other

    cs.CL

    Commonsense-Focused Dialogues for Response Generation: An Empirical Study

    Authors: Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

    Abstract: Smooth and effective communication requires the ability to perform latent or explicit commonsense inference. Prior commonsense reasoning benchmarks (such as SocialIQA and CommonsenseQA) mainly focus on the discriminative task of choosing the right answer from a set of candidates, and do not involve interactive language generation as in dialogue. Moreover, existing dialogue datasets do not explicit… ▽ More

    Submitted 21 September, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Accepted at SIGDIAL 2021. 12 pages, 5 tables

  44. arXiv:2106.09174  [pdf, other

    cs.CL cs.AI cs.LG

    Can I Be of Further Assistance? Using Unstructured Knowledge Access to Improve Task-oriented Conversational Modeling

    Authors: Di **, Seokhwan Kim, Dilek Hakkani-Tur

    Abstract: Most prior work on task-oriented dialogue systems are restricted to limited coverage of domain APIs. However, users oftentimes have requests that are out of the scope of these APIs. This work focuses on responding to these beyond-API-coverage user turns by incorporating external, unstructured knowledge sources. Our approach works in a pipelined manner with knowledge-seeking turn detection, knowled… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Presented as a DIALDOC workshop paper at ACL 2021

  45. arXiv:2106.08484  [pdf, other

    cs.CL cs.HC

    Generative Conversational Networks

    Authors: Alexandros Papangelis, Karthik Gopalakrishnan, Aishwarya Padmakumar, Seokhwan Kim, Gokhan Tur, Dilek Hakkani-Tur

    Abstract: Inspired by recent work in meta-learning and generative teaching networks, we propose a framework called Generative Conversational Networks, in which conversational agents learn to generate their own labelled training data (given some seed data) and then train themselves from that data to perform a given task. We use reinforcement learning to optimize the data generation process where the reward s… ▽ More

    Submitted 16 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: SIGDial 2021

  46. arXiv:2106.06411  [pdf, other

    cs.CL cs.AI

    Zero-Shot Controlled Generation with Encoder-Decoder Transformers

    Authors: Devamanyu Hazarika, Mahdi Namazifar, Dilek Hakkani-Tür

    Abstract: Controlling neural network-based models for natural language generation (NLG) has broad applications in numerous areas such as machine translation, document summarization, and dialog systems. Approaches that enable such control in a zero-shot manner would be of great importance as, among other reasons, they remove the need for additional annotated data and training. In this work, we propose novel… ▽ More

    Submitted 6 April, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: Accepted at AAAI 2022

  47. arXiv:2105.11589  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.RO

    VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator

    Authors: Ayush Shrivastava, Karthik Gopalakrishnan, Yang Liu, Robinson Piramuthu, Gokhan Tür, Devi Parikh, Dilek Hakkani-Tür

    Abstract: Interactive robots navigating photo-realistic environments need to be trained to effectively leverage and handle the dynamic nature of dialogue in addition to the challenges underlying vision-and-language navigation (VLN). In this paper, we present VISITRON, a multi-modal Transformer-based navigator better suited to the interactive regime inherent to Cooperative Vision-and-Dialog Navigation (CVDN)… ▽ More

    Submitted 15 March, 2022; v1 submitted 24 May, 2021; originally announced May 2021.

    Comments: Accepted at Findings of the Annual Meeting of the Association for Computational Linguistics (ACL) 2022, previous version accepted at Visually Grounded Interaction and Language (ViGIL) Workshop at NAACL 2021

    ACM Class: I.2.9

  48. arXiv:2105.05913  [pdf, other

    cs.CL

    Go Beyond Plain Fine-tuning: Improving Pretrained Models for Social Commonsense

    Authors: Ting-Yun Chang, Yang Liu, Karthik Gopalakrishnan, Behnam Hedayatnia, Pei Zhou, Dilek Hakkani-Tur

    Abstract: Pretrained language models have demonstrated outstanding performance in many NLP tasks recently. However, their social intelligence, which requires commonsense reasoning about the current situation and mental states of others, is still develo**. Towards improving language models' social intelligence, we focus on the Social IQA dataset, a task requiring social and emotional commonsense reasoning.… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: SLT 2021

  49. arXiv:2105.05457  [pdf, other

    cs.CL

    Incorporating Commonsense Knowledge Graph in Pretrained Models for Social Commonsense Tasks

    Authors: Ting-Yun Chang, Yang Liu, Karthik Gopalakrishnan, Behnam Hedayatnia, Pei Zhou, Dilek Hakkani-Tur

    Abstract: Pretrained language models have excelled at many NLP tasks recently; however, their social intelligence is still unsatisfactory. To enable this, machines need to have a more general understanding of our complicated world and develop the ability to perform commonsense reasoning besides fitting the specific downstream tasks. External commonsense knowledge graphs (KGs), such as ConceptNet, provide ri… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: EMNLP2020 Workshop

  50. arXiv:2104.09088  [pdf, other

    cs.CL cs.LG

    Alexa Conversations: An Extensible Data-driven Approach for Building Task-oriented Dialogue Systems

    Authors: Anish Acharya, Suranjit Adhikari, Sanchit Agarwal, Vincent Auvray, Nehal Belgamwar, Arijit Biswas, Shubhra Chandra, Tagyoung Chung, Maryam Fazel-Zarandi, Raefer Gabriel, Shuyang Gao, Rahul Goel, Dilek Hakkani-Tur, Jan Jezabek, Abhay Jha, Jiun-Yu Kao, Prakash Krishnan, Peter Ku, Anuj Goyal, Chien-Wei Lin, Qing Liu, Arindam Mandal, Angeliki Metallinou, Vishal Naik, Yi Pan , et al. (6 additional authors not shown)

    Abstract: Traditional goal-oriented dialogue systems rely on various components such as natural language understanding, dialogue state tracking, policy learning and response generation. Training each component requires annotations which are hard to obtain for every new domain, limiting scalability of such systems. Similarly, rule-based dialogue systems require extensive writing and maintenance of rules and… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Journal ref: NAACL 2021 System Demonstrations Track