Skip to main content

Showing 1–27 of 27 results for author: Hedayatnia, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.11995  [pdf, other

    cs.CL cs.AI

    Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations

    Authors: Karthik Gopalakrishnan, Behnam Hedayatnia, Qinlang Chen, Anna Gottardi, Sanjeev Kwatra, Anu Venkatesh, Raefer Gabriel, Dilek Hakkani-Tur

    Abstract: Building socialbots that can have deep, engaging open-domain conversations with humans is one of the grand challenges of artificial intelligence (AI). To this end, bots need to be able to leverage world knowledge spanning several domains effectively when conversing with humans who have their own world knowledge. Existing knowledge-grounded conversation datasets are primarily stylized with explicit… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: arXiving an old paper accepted at INTERSPEECH 2019

  2. arXiv:2305.12091  [pdf, other

    cs.CL

    "What do others think?": Task-Oriented Conversational Modeling with Subjective Knowledge

    Authors: Chao Zhao, Spandana Gella, Seokhwan Kim, Di **, Devamanyu Hazarika, Alexandros Papangelis, Behnam Hedayatnia, Mahdi Namazifar, Yang Liu, Dilek Hakkani-Tur

    Abstract: Task-oriented Dialogue (TOD) Systems aim to build dialogue systems that assist users in accomplishing specific goals, such as booking a hotel or a restaurant. Traditional TODs rely on domain-specific APIs/DBs or external factual knowledge to generate responses, which cannot accommodate subjective user requests (e.g., "Is the WIFI reliable?" or "Does the restaurant have a good atmosphere?"). To add… ▽ More

    Submitted 2 October, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: SIGDIAL 2023

  3. arXiv:2212.10557  [pdf, other

    cs.CL

    DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines

    Authors: Prakhar Gupta, Yang Liu, Di **, Behnam Hedayatnia, Spandana Gella, Sijia Liu, Patrick Lange, Julia Hirschberg, Dilek Hakkani-Tur

    Abstract: Dialogue models are able to generate coherent and fluent responses, but they can still be challenging to control and may produce non-engaging, unsafe results. This unpredictability diminishes user trust and can hinder the use of the models in the real world. To address this, we introduce DialGuide, a novel framework for controlling dialogue model behavior using natural language rules, or guideline… ▽ More

    Submitted 21 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  4. arXiv:2208.04379  [pdf, other

    cs.CL

    A Systematic Evaluation of Response Selection for Open Domain Dialogue

    Authors: Behnam Hedayatnia, Di **, Yang Liu, Dilek Hakkani-Tur

    Abstract: Recent progress on neural approaches for language processing has triggered a resurgence of interest on building intelligent open-domain chatbots. However, even the state-of-the-art neural chatbots cannot produce satisfying responses for every turn in a dialog. A practical solution is to generate multiple response candidates for the same context, and then perform response ranking/selection to deter… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted at SIGDial 2022. 14 pages, 9 figures, 2 tables

  5. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, AdriĆ  Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  6. arXiv:2203.13927  [pdf, other

    cs.CL

    What is wrong with you?: Leveraging User Sentiment for Automatic Dialog Evaluation

    Authors: Sarik Ghazarian, Behnam Hedayatnia, Alexandros Papangelis, Yang Liu, Dilek Hakkani-Tur

    Abstract: Accurate automatic evaluation metrics for open-domain dialogs are in high demand. Existing model-based metrics for system response evaluation are trained on human annotated data, which is cumbersome to collect. In this work, we propose to use information that can be automatically extracted from the next user utterance, such as its sentiment or whether the user explicitly ends the conversation, as… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL Findings 2022. 11 pages, 8 figures, 5 tables

  7. arXiv:2203.00763  [pdf, other

    cs.CL

    Multi-Sentence Knowledge Selection in Open-Domain Dialogue

    Authors: Mihail Eric, Nicole Chartier, Behnam Hedayatnia, Karthik Gopalakrishnan, Pankaj Rajan, Yang Liu, Dilek Hakkani-Tur

    Abstract: Incorporating external knowledge sources effectively in conversations is a longstanding problem in open-domain dialogue research. The existing literature on open-domain knowledge selection is limited and makes certain brittle assumptions on knowledge sources to simplify the overall task (Dinan et al., 2019), such as the existence of a single relevant knowledge sentence per context. In this work, w… ▽ More

    Submitted 4 October, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: Accepted at INLG 2021. 11 pages, 5 tables, 8 figures

  8. arXiv:2111.08808  [pdf, other

    cs.CL

    User Response and Sentiment Prediction for Automatic Dialogue Evaluation

    Authors: Sarik Ghazarian, Behnam Hedayatnia, Alexandros Papangelis, Yang Liu, Dilek Hakkani-Tur

    Abstract: Automatic evaluation is beneficial for open-domain dialog system development. However, standard word-overlap metrics (BLEU, ROUGE) do not correlate well with human judgements of open-domain dialog systems. In this work we propose to use the sentiment of the next user utterance for turn or dialog level evaluation. Specifically we propose three methods: one that predicts the next sentiment directly,… ▽ More

    Submitted 16 February, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted at EMNLP 2021 Evaluations and Assessments of Neural Conversation Systems Workshop. 2 pages, 1 table

  9. arXiv:2110.08501  [pdf, other

    cs.CL

    Think Before You Speak: Explicitly Generating Implicit Commonsense Knowledge for Response Generation

    Authors: Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

    Abstract: Implicit knowledge, such as common sense, is key to fluid human conversations. Current neural response generation (RG) models are trained to generate responses directly, omitting unstated implicit knowledge. In this paper, we present Think-Before-Speaking (TBS), a generative approach to first externalize implicit commonsense knowledge (think) and use this knowledge to generate responses (speak). W… ▽ More

    Submitted 11 September, 2023; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: Accepted at ACL 2022 main conference. 16 pages, 9 figures, 9 tables

  10. arXiv:2110.05456  [pdf, other

    cs.CL cs.AI

    Rome was built in 1776: A Case Study on Factual Correctness in Knowledge-Grounded Response Generation

    Authors: Sashank Santhanam, Behnam Hedayatnia, Spandana Gella, Aishwarya Padmakumar, Seokhwan Kim, Yang Liu, Dilek Hakkani-Tur

    Abstract: Recently neural response generation models have leveraged large pre-trained transformer models and knowledge snippets to generate relevant and informative responses. However, this does not guarantee that generated responses are factually correct. In this paper, we examine factual correctness in knowledge-grounded neural response generation models. We present a human annotation setup to identify th… ▽ More

    Submitted 4 October, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

  11. arXiv:2109.13489  [pdf, other

    cs.CL

    "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations

    Authors: Seokhwan Kim, Yang Liu, Di **, Alexandros Papangelis, Karthik Gopalakrishnan, Behnam Hedayatnia, Dilek Hakkani-Tur

    Abstract: Most prior work in dialogue modeling has been on written conversations mostly because of existing data sets. However, written dialogues are not sufficient to fully capture the nature of spoken conversations as well as the potential speech recognition errors in practical spoken dialogue systems. This work presents a new benchmark on spoken task-oriented conversations, which is intended to study mul… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: To be presented at ASRU 2021

  12. arXiv:2109.06427  [pdf, ps, other

    cs.CL

    Commonsense-Focused Dialogues for Response Generation: An Empirical Study

    Authors: Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

    Abstract: Smooth and effective communication requires the ability to perform latent or explicit commonsense inference. Prior commonsense reasoning benchmarks (such as SocialIQA and CommonsenseQA) mainly focus on the discriminative task of choosing the right answer from a set of candidates, and do not involve interactive language generation as in dialogue. Moreover, existing dialogue datasets do not explicit… ▽ More

    Submitted 21 September, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Accepted at SIGDIAL 2021. 12 pages, 5 tables

  13. arXiv:2105.05913  [pdf, other

    cs.CL

    Go Beyond Plain Fine-tuning: Improving Pretrained Models for Social Commonsense

    Authors: Ting-Yun Chang, Yang Liu, Karthik Gopalakrishnan, Behnam Hedayatnia, Pei Zhou, Dilek Hakkani-Tur

    Abstract: Pretrained language models have demonstrated outstanding performance in many NLP tasks recently. However, their social intelligence, which requires commonsense reasoning about the current situation and mental states of others, is still develo**. Towards improving language models' social intelligence, we focus on the Social IQA dataset, a task requiring social and emotional commonsense reasoning.… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: SLT 2021

  14. arXiv:2105.05457  [pdf, other

    cs.CL

    Incorporating Commonsense Knowledge Graph in Pretrained Models for Social Commonsense Tasks

    Authors: Ting-Yun Chang, Yang Liu, Karthik Gopalakrishnan, Behnam Hedayatnia, Pei Zhou, Dilek Hakkani-Tur

    Abstract: Pretrained language models have excelled at many NLP tasks recently; however, their social intelligence is still unsatisfactory. To enable this, machines need to have a more general understanding of our complicated world and develop the ability to perform commonsense reasoning besides fitting the specific downstream tasks. External commonsense knowledge graphs (KGs), such as ConceptNet, provide ri… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: EMNLP2020 Workshop

  15. arXiv:2101.09276  [pdf, other

    cs.CL

    Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access Track in DSTC9

    Authors: Seokhwan Kim, Mihail Eric, Behnam Hedayatnia, Karthik Gopalakrishnan, Yang Liu, Chao-Wei Huang, Dilek Hakkani-Tur

    Abstract: Most prior work on task-oriented dialogue systems are restricted to a limited coverage of domain APIs, while users oftentimes have domain related requests that are not covered by the APIs. This challenge track aims to expand the coverage of task-oriented dialogue systems by incorporating external unstructured knowledge sources. We define three tasks: knowledge-seeking turn detection, knowledge sel… ▽ More

    Submitted 3 February, 2021; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: To be presented at AAAI-21 DSTC9 Workshop. arXiv admin note: substantial text overlap with arXiv:2006.03533, arXiv:2011.06486

  16. arXiv:2011.06486  [pdf, ps, other

    cs.CL

    Overview of the Ninth Dialog System Technology Challenge: DSTC9

    Authors: Chulaka Gunasekara, Seokhwan Kim, Luis Fernando D'Haro, Abhinav Rastogi, Yun-Nung Chen, Mihail Eric, Behnam Hedayatnia, Karthik Gopalakrishnan, Yang Liu, Chao-Wei Huang, Dilek Hakkani-TĆ¼r, **chao Li, Qi Zhu, Lingxiao Luo, Lars Liden, Kaili Huang, Shahin Shayandeh, Runze Liang, Baolin Peng, Zheng Zhang, Swadheen Shukla, Minlie Huang, Jianfeng Gao, Shikib Mehri, Yulan Feng , et al. (14 additional authors not shown)

    Abstract: This paper introduces the Ninth Dialog System Technology Challenge (DSTC-9). This edition of the DSTC focuses on applying end-to-end dialog technologies for four distinct tasks in dialog systems, namely, 1. Task-oriented dialog Modeling with unstructured knowledge access, 2. Multi-domain task-oriented dialog, 3. Interactive evaluation of dialog, and 4. Situated interactive multi-modal dialog. This… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  17. arXiv:2008.07683  [pdf, other

    cs.CL cs.AI cs.LG

    Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical Study

    Authors: Karthik Gopalakrishnan, Behnam Hedayatnia, Longshaokan Wang, Yang Liu, Dilek Hakkani-Tur

    Abstract: Large end-to-end neural open-domain chatbots are becoming increasingly popular. However, research on building such chatbots has typically assumed that the user input is written in nature and it is not clear whether these chatbots would seamlessly integrate with automatic speech recognition (ASR) models to serve the speech modality. We aim to bring attention to this important question by empiricall… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted at INTERSPEECH 2020. For dataset, see https://github.com/alexa/Topical-Chat/tree/master/TopicalChatASR/

    ACM Class: I.2.7

  18. arXiv:2006.03533  [pdf, other

    cs.CL cs.AI

    Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access

    Authors: Seokhwan Kim, Mihail Eric, Karthik Gopalakrishnan, Behnam Hedayatnia, Yang Liu, Dilek Hakkani-Tur

    Abstract: Most prior work on task-oriented dialogue systems are restricted to a limited coverage of domain APIs, while users oftentimes have domain related requests that are not covered by the APIs. In this paper, we propose to expand coverage of task-oriented dialogue systems by incorporating external unstructured knowledge sources. We define three sub-tasks: knowledge-seeking turn detection, knowledge sel… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: To be presented at SIGDIAL 2020

  19. arXiv:2005.12529  [pdf, other

    cs.AI cs.CL

    Policy-Driven Neural Response Generation for Knowledge-Grounded Dialogue Systems

    Authors: Behnam Hedayatnia, Karthik Gopalakrishnan, Seokhwan Kim, Yang Liu, Mihail Eric, Dilek Hakkani-Tur

    Abstract: Open-domain dialogue systems aim to generate relevant, informative and engaging responses. Seq2seq neural response generation approaches do not have explicit mechanisms to control the content or style of the generated response, and frequently result in uninformative utterances. In this paper, we propose using a dialogue policy to plan the content and style of target responses in the form of an act… ▽ More

    Submitted 24 August, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: Link to public dataset

  20. arXiv:1904.13015  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Coherent and Engaging Spoken Dialog Response Generation Using Automatic Conversation Evaluators

    Authors: Sanghyun Yi, Rahul Goel, Chandra Khatri, Alessandra Cervone, Tagyoung Chung, Behnam Hedayatnia, Anu Venkatesh, Raefer Gabriel, Dilek Hakkani-Tur

    Abstract: Encoder-decoder based neural architectures serve as the basis of state-of-the-art approaches in end-to-end open domain dialog systems. Since most of such systems are trained with a maximum likelihood~(MLE) objective they suffer from issues such as lack of generalizability and the generic response problem, i.e., a system response that can be an answer to a large number of user utterances, e.g., "Ma… ▽ More

    Submitted 21 November, 2019; v1 submitted 29 April, 2019; originally announced April 2019.

  21. arXiv:1903.08097  [pdf, other

    cs.CL

    Natural Language Generation at Scale: A Case Study for Open Domain Question Answering

    Authors: Alessandra Cervone, Chandra Khatri, Rahul Goel, Behnam Hedayatnia, Anu Venkatesh, Dilek Hakkani-Tur, Raefer Gabriel

    Abstract: Current approaches to Natural Language Generation (NLG) for dialog mainly focus on domain-specific, task-oriented applications (e.g. restaurant booking) using limited ontologies (up to 20 slot types), usually without considering the previous conversation context. Furthermore, these approaches require large amounts of data for each domain, and do not benefit from examples that may be available for… ▽ More

    Submitted 23 September, 2019; v1 submitted 19 March, 2019; originally announced March 2019.

    Comments: Accepted to INLG 2019

  22. arXiv:1812.10757  [pdf

    cs.CL cs.AI

    Advancing the State of the Art in Open Domain Dialog Systems through the Alexa Prize

    Authors: Chandra Khatri, Behnam Hedayatnia, Anu Venkatesh, Jeff Nunn, Yi Pan, Qing Liu, Han Song, Anna Gottardi, Sanjeev Kwatra, Sanju Pancholi, Ming Cheng, Qinglang Chen, Lauren Stubel, Karthik Gopalakrishnan, Kate Bland, Raefer Gabriel, Arindam Mandal, Dilek Hakkani-Tur, Gene Hwang, Nate Michel, Eric King, Rohit Prasad

    Abstract: Building open domain conversational systems that allow users to have engaging conversations on topics of their choice is a challenging task. Alexa Prize was launched in 2016 to tackle the problem of achieving natural, sustained, coherent and engaging open-domain dialogs. In the second iteration of the competition in 2018, university teams advanced the state of the art by using context in dialog mo… ▽ More

    Submitted 27 December, 2018; originally announced December 2018.

    Comments: 2018 Alexa Prize Proceedings

  23. arXiv:1811.12900  [pdf, other

    cs.CL

    Detecting Offensive Content in Open-domain Conversations using Two Stage Semi-supervision

    Authors: Chandra Khatri, Behnam Hedayatnia, Rahul Goel, Anushree Venkatesh, Raefer Gabriel, Arindam Mandal

    Abstract: As open-ended human-chatbot interaction becomes commonplace, sensitive content detection gains importance. In this work, we propose a two stage semi-supervised approach to bootstrap large-scale data for automatic sensitive language detection from publicly available web resources. We explore various data selection methods including 1) using a blacklist to rank online discussion forums by the level… ▽ More

    Submitted 30 November, 2018; originally announced November 2018.

    Comments: NIPS CONVAI Workshop 2018

  24. arXiv:1810.08135  [pdf, other

    cs.CL

    Contextual Topic Modeling For Dialog Systems

    Authors: Chandra Khatri, Rahul Goel, Behnam Hedayatnia, Angeliki Metanillou, Anushree Venkatesh, Raefer Gabriel, Arindam Mandal

    Abstract: Accurate prediction of conversation topics can be a valuable signal for creating coherent and engaging dialog systems. In this work, we focus on context-aware topic classification methods for identifying topics in free-form human-chatbot dialogs. We extend previous work on neural topic classification and unsupervised topic keyword detection by incorporating conversational context and dialog act fe… ▽ More

    Submitted 18 October, 2018; v1 submitted 18 October, 2018; originally announced October 2018.

  25. Contextual Language Model Adaptation for Conversational Agents

    Authors: Anirudh Raju, Behnam Hedayatnia, Linda Liu, Ankur Gandhe, Chandra Khatri, Angeliki Metallinou, Anu Venkatesh, Ariya Rastrow

    Abstract: Statistical language models (LM) play a key role in Automatic Speech Recognition (ASR) systems used by conversational agents. These ASR systems should provide a high accuracy under a variety of speaking styles, domains, vocabulary and argots. In this paper, we present a DNN-based method to adapt the LM to each user-agent interaction based on generalized contextual information, by predicting an opt… ▽ More

    Submitted 31 July, 2018; v1 submitted 26 June, 2018; originally announced June 2018.

    Comments: Interspeech 2018 (accepted)

    ACM Class: I.2.7

    Journal ref: Proc. Interspeech 2018, 3333-3337

  26. arXiv:1801.03625  [pdf, ps, other

    cs.CL cs.AI cs.CY cs.HC cs.MA

    On Evaluating and Comparing Open Domain Dialog Systems

    Authors: Anu Venkatesh, Chandra Khatri, Ashwin Ram, Fenfei Guo, Raefer Gabriel, Ashish Nagar, Rohit Prasad, Ming Cheng, Behnam Hedayatnia, Angeliki Metallinou, Rahul Goel, Shaohua Yang, Anirudh Raju

    Abstract: Conversational agents are exploding in popularity. However, much work remains in the area of non goal-oriented conversations, despite significant growth in research interest over recent years. To advance the state of the art in conversational AI, Amazon launched the Alexa Prize, a 2.5-million dollar university competition where sixteen selected university teams built conversational agents to deliv… ▽ More

    Submitted 26 December, 2018; v1 submitted 10 January, 2018; originally announced January 2018.

    Comments: 10 pages, 5 tables. NIPS 2017 Conversational AI workshop. http://alborz-geramifard.com/workshops/nips17-Conversational-AI/Main.html

    MSC Class: 97R40 ACM Class: I.2.7

    Journal ref: NIPS.Workshop.ConversationalAI 2017-12-08 http://alborz-geramifard.com/workshops/nips17-Conversational-AI/Main.html accessed 2018-01-01

  27. arXiv:1801.03604  [pdf

    cs.AI cs.CL cs.CY cs.HC cs.MA

    Conversational AI: The Science Behind the Alexa Prize

    Authors: Ashwin Ram, Rohit Prasad, Chandra Khatri, Anu Venkatesh, Raefer Gabriel, Qing Liu, Jeff Nunn, Behnam Hedayatnia, Ming Cheng, Ashish Nagar, Eric King, Kate Bland, Amanda Wartick, Yi Pan, Han Song, Sk Jayadevan, Gene Hwang, Art Pettigrue

    Abstract: Conversational agents are exploding in popularity. However, much work remains in the area of social conversation as well as free-form conversation over a broad range of domains and topics. To advance the state of the art in conversational AI, Amazon launched the Alexa Prize, a 2.5-million-dollar university competition where sixteen selected university teams were challenged to build conversational… ▽ More

    Submitted 10 January, 2018; originally announced January 2018.

    Comments: 18 pages, 5 figures, Alexa Prize Proceedings Paper (https://developer.amazon.com/alexaprize/proceedings), Alexa Prize University Competition to advance Conversational AI

    MSC Class: 97R40 ACM Class: I.2.7

    Journal ref: Alexa.Prize.Proceedings https://developer.amazon.com/alexaprize/proceedings accessed (2018)-01-01