Skip to main content

Showing 1–17 of 17 results for author: Crook, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.04735  [pdf, other

    cs.CV

    SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

    Authors: Jielin Qiu, Andrea Madotto, Zhaojiang Lin, Paul A. Crook, Yifan Ethan Xu, Xin Luna Dong, Christos Faloutsos, Lei Li, Babak Damavandi, Seungwhan Moon

    Abstract: Vision-extended LLMs have made significant strides in Visual Question Answering (VQA). Despite these advancements, VLLMs still encounter substantial difficulties in handling queries involving long-tail entities, with a tendency to produce erroneous or hallucinated responses. In this work, we introduce a novel evaluative benchmark named \textbf{SnapNTell}, specifically tailored for entity-centric V… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  2. arXiv:2402.10466  [pdf, other

    cs.CL cs.AI

    Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

    Authors: Zekun Li, Zhiyu Zoey Chen, Mike Ross, Patrick Huber, Seungwhan Moon, Zhaojiang Lin, Xin Luna Dong, Adithya Sagar, Xifeng Yan, Paul A. Crook

    Abstract: Large language models (LLMs) are increasingly prevalent in conversational systems due to their advanced understanding and generative capabilities in general contexts. However, their effectiveness in task-oriented dialogues (TOD), which requires not only response generation but also effective dialogue state tracking (DST) within specific tasks and domains, remains less satisfying. In this work, we… ▽ More

    Submitted 30 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Main. Code available at: https://github.com/facebookresearch/FnCTOD

  3. arXiv:2205.05589  [pdf, other

    cs.CL

    KETOD: Knowledge-Enriched Task-Oriented Dialogue

    Authors: Zhiyu Chen, Bing Liu, Seungwhan Moon, Chinnadhurai Sankar, Paul Crook, William Yang Wang

    Abstract: Existing studies in dialogue system research mostly treat task-oriented dialogue and chit-chat as separate domains. Towards building a human-like assistant that can converse naturally and seamlessly with users, it is important to build a dialogue system that conducts both types of conversations effectively. In this work, we investigate how task-oriented dialogue and knowledge-grounded chit-chat ca… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: NAACL 2022 Findings

  4. arXiv:2112.08351  [pdf, other

    cs.CL

    Database Search Results Disambiguation for Task-Oriented Dialog Systems

    Authors: Kun Qian, Ahmad Beirami, Satwik Kottur, Shahin Shayandeh, Paul Crook, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar

    Abstract: As task-oriented dialog systems are becoming increasingly popular in our lives, more realistic tasks have been proposed and explored. However, new practical challenges arise. For instance, current dialog systems cannot effectively handle multiple search results when querying a database, due to the lack of such scenarios in existing public datasets. In this paper, we propose Database Search Result… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  5. arXiv:2110.06905  [pdf, other

    cs.CL

    Teaching Models new APIs: Domain-Agnostic Simulators for Task Oriented Dialogue

    Authors: Moya Chen, Paul A. Crook, Stephen Roller

    Abstract: We demonstrate that large language models are able to simulate Task Oriented Dialogues in novel domains, provided only with an API implementation and a list of goals. We show these simulations can formulate online, automatic metrics that correlate well with human evaluations. Furthermore, by checking for whether the User's goals are met, we can use simulation to repeatedly generate training data a… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  6. arXiv:2109.04655  [pdf, other

    cs.CL

    Zero-Shot Dialogue State Tracking via Cross-Task Transfer

    Authors: Zhaojiang Lin, Bing Liu, Andrea Madotto, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Eunjoon Cho, Rajen Subba, Pascale Fung

    Abstract: Zero-shot transfer learning for dialogue state tracking (DST) enables us to handle a variety of task-oriented dialogue domains without the expense of collecting in-domain data. In this work, we propose to transfer the \textit{cross-task} knowledge from general question answering (QA) corpora for the zero-shot DST task. Specifically, we propose TransferQA, a transferable generative QA model that se… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  7. arXiv:2105.04222  [pdf, other

    cs.CL

    Leveraging Slot Descriptions for Zero-Shot Cross-Domain Dialogue State Tracking

    Authors: Zhaojiang Lin, Bing Liu, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Andrea Madotto, Eunjoon Cho, Rajen Subba

    Abstract: Zero-shot cross-domain dialogue state tracking (DST) enables us to handle task-oriented dialogue in unseen domains without the expense of collecting in-domain data. In this paper, we propose a slot description enhanced generative approach for zero-shot cross-domain DST. Specifically, our model first encodes dialogue context and slots with a pre-trained self-attentive encoder, and generates slot va… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: NAACL 2021

  8. arXiv:2012.15504  [pdf, other

    cs.CL cs.AI

    Continual Learning in Task-Oriented Dialogue Systems

    Authors: Andrea Madotto, Zhaojiang Lin, Zhenpeng Zhou, Seungwhan Moon, Paul Crook, Bing Liu, Zhou Yu, Eunjoon Cho, Zhiguang Wang

    Abstract: Continual learning in task-oriented dialogue systems can allow us to add new domains and functionalities through time without incurring the high cost of a whole system retraining. In this paper, we propose a continual learning benchmark for task-oriented dialogue systems with 37 domains to be learned continuously in four settings, such as intent recognition, state tracking, natural language genera… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 9 pages

  9. arXiv:2011.06486  [pdf, ps, other

    cs.CL

    Overview of the Ninth Dialog System Technology Challenge: DSTC9

    Authors: Chulaka Gunasekara, Seokhwan Kim, Luis Fernando D'Haro, Abhinav Rastogi, Yun-Nung Chen, Mihail Eric, Behnam Hedayatnia, Karthik Gopalakrishnan, Yang Liu, Chao-Wei Huang, Dilek Hakkani-Tür, **chao Li, Qi Zhu, Lingxiao Luo, Lars Liden, Kaili Huang, Shahin Shayandeh, Runze Liang, Baolin Peng, Zheng Zhang, Swadheen Shukla, Minlie Huang, Jianfeng Gao, Shikib Mehri, Yulan Feng , et al. (14 additional authors not shown)

    Abstract: This paper introduces the Ninth Dialog System Technology Challenge (DSTC-9). This edition of the DSTC focuses on applying end-to-end dialog technologies for four distinct tasks in dialog systems, namely, 1. Task-oriented dialog Modeling with unstructured knowledge access, 2. Multi-domain task-oriented dialog, 3. Interactive evaluation of dialog, and 4. Situated interactive multi-modal dialog. This… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  10. arXiv:2011.05457  [pdf, other

    cs.CL cs.AI

    Resource Constrained Dialog Policy Learning via Differentiable Inductive Logic Programming

    Authors: Zhenpeng Zhou, Ahmad Beirami, Paul Crook, Pararth Shah, Rajen Subba, Alborz Geramifard

    Abstract: Motivated by the needs of resource constrained dialog policy learning, we introduce dialog policy via differentiable inductive logic (DILOG). We explore the tasks of one-shot learning and zero-shot domain transfer with DILOG on SimDial and MultiWoZ. Using a single representative dialog from the restaurant domain, we train DILOG on the SimDial dataset and obtain 99+% in-domain test accuracy. We als… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  11. arXiv:2010.12757  [pdf, other

    cs.CL

    Adding Chit-Chat to Enhance Task-Oriented Dialogues

    Authors: Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho, Claire Cardie

    Abstract: Existing dialogue corpora and models are typically designed under two disjoint motives: while task-oriented systems focus on achieving functional goals (e.g., booking hotels), open-domain chatbots aim at making socially engaging conversations. In this work, we propose to integrate both types of systems by Adding Chit-Chat to ENhance Task-ORiented dialogues (ACCENTOR), with the goal of making virtu… ▽ More

    Submitted 1 May, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: To appear in NAACL-HLT 2021

  12. arXiv:2006.01460  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Situated and Interactive Multimodal Conversations

    Authors: Seungwhan Moon, Satwik Kottur, Paul A. Crook, Ankita De, Shivani Poddar, Theodore Levin, David Whitney, Daniel Difranco, Ahmad Beirami, Eunjoon Cho, Rajen Subba, Alborz Geramifard

    Abstract: Next generation virtual assistants are envisioned to handle multimodal inputs (e.g., vision, memories of previous interactions, in addition to the user's utterances), and perform multimodal actions (e.g., displaying a route in addition to generating the system's utterance). We introduce Situated Interactive MultiModal Conversations (SIMMC) as a new direction aimed at training agents that take mult… ▽ More

    Submitted 10 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: 20 pages, 5 figures, 11 tables, accepted to COLING 2020

  13. Information Seeking in the Spirit of Learning: a Dataset for Conversational Curiosity

    Authors: Pedro Rodriguez, Paul Crook, Seungwhan Moon, Zhiguang Wang

    Abstract: Open-ended human learning and information-seeking are increasingly mediated by digital assistants. However, such systems often ignore the user's pre-existing knowledge. Assuming a correlation between engagement and user responses such as "liking" messages or asking followup questions, we design a Wizard-of-Oz dialog task that tests the hypothesis that engagement increases when users are presented… ▽ More

    Submitted 9 November, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: EMNLP 2020: https://www.aclweb.org/anthology/2020.emnlp-main.655/

  14. arXiv:1911.02690  [pdf, other

    cs.CL cs.AI

    SIMMC: Situated Interactive Multi-Modal Conversational Data Collection And Evaluation Platform

    Authors: Paul A. Crook, Shivani Poddar, Ankita De, Semir Shafi, David Whitney, Alborz Geramifard, Rajen Subba

    Abstract: As digital virtual assistants become ubiquitous, it becomes increasingly important to understand the situated behaviour of users as they interact with these assistants. To this end, we introduce SIMMC, an extension to ParlAI for multi-modal conversational data collection and system evaluation. SIMMC simulates an immersive setup, where crowd workers are able to interact with environments constructe… ▽ More

    Submitted 30 January, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

    Comments: ASRU 2019 (demonstration)

  15. arXiv:1909.03922  [pdf, other

    cs.CL cs.AI

    Recommendation as a Communication Game: Self-Supervised Bot-Play for Goal-oriented Dialogue

    Authors: Dongyeop Kang, Anusha Balakrishnan, Pararth Shah, Paul Crook, Y-Lan Boureau, Jason Weston

    Abstract: Traditional recommendation systems produce static rather than interactive recommendations invariant to a user's specific requests, clarifications, or current mood, and can suffer from the cold-start problem if their tastes are unknown. These issues can be alleviated by treating recommendation as an interactive dialogue task instead, where an expert recommender can sequentially ask about someone's… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  16. arXiv:1612.00913  [pdf, ps, other

    cs.CL cs.LG

    End-to-End Joint Learning of Natural Language Understanding and Dialogue Manager

    Authors: Xuesong Yang, Yun-Nung Chen, Dilek Hakkani-Tur, Paul Crook, Xiujun Li, Jianfeng Gao, Li Deng

    Abstract: Natural language understanding and dialogue policy learning are both essential in conversational systems that predict the next system actions in response to a current user utterance. Conventional approaches aggregate separate models of natural language understanding (NLU) and system action prediction (SAP) as a pipeline that is sensitive to noisy outputs of error-prone NLU. To address the issues,… ▽ More

    Submitted 4 January, 2017; v1 submitted 2 December, 2016; originally announced December 2016.

    Comments: Accepted in The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2017)

  17. arXiv:1508.00986  [pdf, other

    cs.AI

    On the Linear Belief Compression of POMDPs: A re-examination of current methods

    Authors: Zhuoran Wang, Paul A. Crook, Wenshuo Tang, Oliver Lemon

    Abstract: Belief compression improves the tractability of large-scale partially observable Markov decision processes (POMDPs) by finding projections from high-dimensional belief space onto low-dimensional approximations, where solving to obtain action selection policies requires fewer computations. This paper develops a unified theoretical framework to analyse three existing linear belief compression approa… ▽ More

    Submitted 5 August, 2015; originally announced August 2015.