Skip to main content

Showing 1–10 of 10 results for author: Wolfson, T

.
  1. arXiv:2406.10922  [pdf, other

    cs.CL cs.AI cs.DB

    Generating Tables from the Parametric Knowledge of Language Models

    Authors: Yevgeni Berkovitch, Oren Glickman, Amit Somech, Tomer Wolfson

    Abstract: We explore generating factual and accurate tables from the parametric knowledge of large language models (LLMs). While LLMs have demonstrated impressive capabilities in recreating knowledge bases and generating free-form text, we focus on generating structured tabular data, which is crucial in domains like finance and healthcare. We examine the table generation abilities of four state-of-the-art L… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2406.05107  [pdf, other

    cs.DB

    LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration

    Authors: Tavor Lipman, Tova Milo, Amit Somech, Tomer Wolfson, Oz Zafar

    Abstract: Data exploration is a challenging process in which users examine a dataset by iteratively employing a series of queries. While in some cases the user explores a new dataset to become familiar with it, more often, the exploration process is conducted with a specific analysis goal or question in mind. To assist users in exploring a new dataset, Automated Data Exploration (ADE) systems have been devi… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2310.01558  [pdf, other

    cs.CL cs.AI

    Making Retrieval-Augmented Language Models Robust to Irrelevant Context

    Authors: Ori Yoran, Tomer Wolfson, Ori Ram, Jonathan Berant

    Abstract: Retrieval-augmented language models (RALMs) hold promise to produce language understanding systems that are are factual, efficient, and up-to-date. An important desideratum of RALMs, is that retrieved information helps model performance when it is relevant, and does not harm performance when it is not. This is particularly important in multi-hop reasoning scenarios, where misuse of irrelevant evid… ▽ More

    Submitted 5 May, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  4. arXiv:2304.13007  [pdf, other

    cs.CL cs.AI

    Answering Questions by Meta-Reasoning over Multiple Chains of Thought

    Authors: Ori Yoran, Tomer Wolfson, Ben Bogin, Uri Katz, Daniel Deutch, Jonathan Berant

    Abstract: Modern systems for multi-hop question answering (QA) typically break questions into a sequence of reasoning steps, termed chain-of-thought (CoT), before arriving at a final answer. Often, multiple chains are sampled and aggregated through a voting mechanism over the final answers, but the intermediate steps themselves are discarded. While such approaches improve performance, they do not consider t… ▽ More

    Submitted 17 October, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted for publication in The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023). Author's final version

  5. arXiv:2205.12665  [pdf, other

    cs.CL

    QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs

    Authors: Samuel Joseph Amouyal, Tomer Wolfson, Ohad Rubin, Ori Yoran, Jonathan Herzig, Jonathan Berant

    Abstract: Existing benchmarks for open-domain question answering (ODQA) typically focus on questions whose answers can be extracted from a single paragraph. By contrast, many natural questions, such as "What players were drafted by the Brooklyn Nets?" have a list of answers. Answering such questions requires retrieving and reading from many passages, in a large corpus. We introduce QAMPARI, an ODQA benchmar… ▽ More

    Submitted 29 May, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

  6. arXiv:2112.06311  [pdf, other

    cs.CL cs.AI cs.DB

    Weakly Supervised Text-to-SQL Parsing through Question Decomposition

    Authors: Tomer Wolfson, Daniel Deutch, Jonathan Berant

    Abstract: Text-to-SQL parsers are crucial in enabling non-experts to effortlessly query relational data. Training such parsers, by contrast, generally requires expertise in annotating natural language (NL) utterances with corresponding SQL queries. In this work, we propose a weak supervision approach for training text-to-SQL parsers. We take advantage of the recently proposed question meaning representation… ▽ More

    Submitted 26 April, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

    Comments: Accepted for publication in Findings of NAACL 2022. Author's final version

  7. arXiv:2107.13935  [pdf, other

    cs.CL

    Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition

    Authors: Mor Geva, Tomer Wolfson, Jonathan Berant

    Abstract: Recent efforts to create challenge benchmarks that test the abilities of natural language understanding models have largely depended on human annotations. In this work, we introduce the "Break, Perturb, Build" (BPB) framework for automatic reasoning-oriented perturbation of question-answer pairs. BPB represents a question by decomposing it into the reasoning steps that are required to answer it, s… ▽ More

    Submitted 18 October, 2021; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2021. Author's final version

  8. arXiv:2005.00724  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Obtaining Faithful Interpretations from Compositional Neural Networks

    Authors: Sanjay Subramanian, Ben Bogin, Nitish Gupta, Tomer Wolfson, Sameer Singh, Jonathan Berant, Matt Gardner

    Abstract: Neural module networks (NMNs) are a popular approach for modeling compositionality: they achieve high accuracy when applied to problems in language and vision, while reflecting the compositional structure of the problem in the network architecture. However, prior work implicitly assumed that the structure of the network modules, describing the abstract reasoning process, provides a faithful explan… ▽ More

    Submitted 8 September, 2020; v1 submitted 2 May, 2020; originally announced May 2020.

    Comments: ACL 2020; first three authors contributed equally

  9. arXiv:2001.11770  [pdf, other

    cs.CL

    Break It Down: A Question Understanding Benchmark

    Authors: Tomer Wolfson, Mor Geva, Ankit Gupta, Matt Gardner, Yoav Goldberg, Daniel Deutch, Jonathan Berant

    Abstract: Understanding natural language questions entails the ability to break down a question into the requisite steps for computing its answer. In this work, we introduce a Question Decomposition Meaning Representation (QDMR) for questions. QDMR constitutes the ordered list of steps, expressed through natural language, that are necessary for answering a question. We develop a crowdsourcing pipeline, show… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2020. Author's final version

  10. arXiv:1808.04614  [pdf, other

    cs.CL cs.AI

    Explaining Queries over Web Tables to Non-Experts

    Authors: Jonathan Berant, Daniel Deutch, Amir Globerson, Tova Milo, Tomer Wolfson

    Abstract: Designing a reliable natural language (NL) interface for querying tables has been a longtime goal of researchers in both the data management and natural language processing (NLP) communities. Such an interface receives as input an NL question, translates it into a formal query, executes the query and returns the results. Errors in the translation process are not uncommon, and users typically strug… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

    Comments: Short paper version to appear in ICDE 2019