Skip to main content

Showing 1–3 of 3 results for author: Imbrasaite, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.15299  [pdf, other

    cs.CL

    TaskLAMA: Probing the Complex Task Understanding of Language Models

    Authors: Quan Yuan, Mehran Kazemi, Xin Xu, Isaac Noble, Vaiva Imbrasaite, Deepak Ramachandran

    Abstract: Structured Complex Task Decomposition (SCTD) is the problem of breaking down a complex real-world task (such as planning a wedding) into a directed acyclic graph over individual steps that contribute to achieving the task, with edges specifying temporal dependencies between them. SCTD is an important component of assistive planning tools, and a challenge for commonsense reasoning systems. We probe… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  2. arXiv:2306.07934  [pdf, other

    cs.CL cs.AI cs.LG

    BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory Information

    Authors: Mehran Kazemi, Quan Yuan, Deepti Bhatia, Najoung Kim, Xin Xu, Vaiva Imbrasaite, Deepak Ramachandran

    Abstract: Automated reasoning with unstructured natural text is a key requirement for many potential applications of NLP and for develo** robust AI systems. Recently, Language Models (LMs) have demonstrated complex reasoning capacities even without any finetuning. However, existing evaluation for automated reasoning assumes access to a consistent and coherent set of information over which models reason. W… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  3. arXiv:2305.14128  [pdf, other

    cs.CL cs.AI

    Dr.ICL: Demonstration-Retrieved In-context Learning

    Authors: Man Luo, Xin Xu, Zhuyun Dai, Panupong Pasupat, Mehran Kazemi, Chitta Baral, Vaiva Imbrasaite, Vincent Y Zhao

    Abstract: In-context learning (ICL), teaching a large language model (LLM) to perform a task with few-shot demonstrations rather than adjusting the model parameters, has emerged as a strong paradigm for using LLMs. While early studies primarily used a fixed or random set of demonstrations for all test queries, recent research suggests that retrieving semantically similar demonstrations to the input from a p… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.