Skip to main content

Showing 1–12 of 12 results for author: Holzenberger, N

.
  1. arXiv:2406.17186  [pdf, other

    cs.CL cs.CY

    CLERC: A Dataset for Legal Case Retrieval and Retrieval-Augmented Analysis Generation

    Authors: Abe Bohan Hou, Orion Weller, Guanghui Qin, Eugene Yang, Dawn Lawrie, Nils Holzenberger, Andrew Blair-Stanek, Benjamin Van Durme

    Abstract: Legal professionals need to write analyses that rely on citations to relevant precedents, i.e., previous case decisions. Intelligent systems assisting legal professionals in writing such documents provide great benefits but are challenging to design. Such systems need to help locate, summarize, and reason over salient precedents in order to be useful. To enable systems for such tasks, we work with… ▽ More

    Submitted 27 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2401.06715  [pdf, other

    cs.CL cs.AI

    Reframing Tax Law Entailment as Analogical Reasoning

    Authors: Xinrui Zou, Ming Zhang, Nathaniel Weir, Benjamin Van Durme, Nils Holzenberger

    Abstract: Statutory reasoning refers to the application of legislative provisions to a series of case facts described in natural language. We re-frame statutory reasoning as an analogy task, where each instance of the analogy task involves a combination of two instances of statutory reasoning. This increases the dataset size by two orders of magnitude, and introduces an element of interpretability. We show… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  3. arXiv:2311.09693  [pdf, other

    cs.CL cs.AI

    BLT: Can Large Language Models Handle Basic Legal Text?

    Authors: Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme

    Abstract: We find that the best publicly available LLMs like GPT-4, Claude, and {PaLM 2} currently perform poorly at basic legal text handling. We introduce a benchmark consisting of tasks that lawyers and paralegals would expect LLMs to handle zero-shot, such as looking up the text at a line of a witness deposition or at a subsection of a contract. LLMs' poor performance on this benchmark casts into doubt… ▽ More

    Submitted 28 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    ACM Class: I.2.1; I.2.7; J.7

  4. arXiv:2309.09992  [pdf

    cs.AI cs.CL

    OpenAI Cribbed Our Tax Example, But Can GPT-4 Really Do Tax?

    Authors: Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme

    Abstract: The authors explain where OpenAI got the tax law example in its livestream demonstration of GPT-4, why GPT-4 got the wrong answer, and how it fails to reliably calculate taxes.

    Submitted 7 February, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 5 pages

    ACM Class: I.2.7; I.2.0

    Journal ref: 180 TAX NOTES FEDERAL 1101 (AUG. 14, 2023)

  5. arXiv:2308.11462  [pdf, other

    cs.CL cs.AI cs.CY

    LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

    Authors: Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher RĂ©, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia , et al. (15 additional authors not shown)

    Abstract: The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisc… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 143 pages, 79 tables, 4 figures

  6. arXiv:2302.06100  [pdf, other

    cs.CL cs.AI

    Can GPT-3 Perform Statutory Reasoning?

    Authors: Andrew Blair-Stanek, Nils Holzenberger, Benjamin Van Durme

    Abstract: Statutory reasoning is the task of reasoning with facts and statutes, which are rules written in natural language by a legislature. It is a basic legal skill. In this paper we explore the capabilities of the most capable GPT-3 model, text-davinci-003, on an established statutory-reasoning dataset called SARA. We consider a variety of approaches, including dynamic few-shot prompting, chain-of-thoug… ▽ More

    Submitted 10 May, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: 10 pages

  7. arXiv:2205.12643  [pdf, other

    cs.CL

    Asking the Right Questions in Low Resource Template Extraction

    Authors: Nils Holzenberger, Yunmo Chen, Benjamin Van Durme

    Abstract: Information Extraction (IE) researchers are map** tasks to Question Answering (QA) in order to leverage existing large QA resources, and thereby improve data efficiency. Especially in template extraction (TE), map** an ontology to a set of questions can be more time-efficient than collecting labeled examples. We ask whether end users of TE systems can design these questions, and whether it is… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  8. arXiv:2105.07903  [pdf, other

    cs.CL

    Factoring Statutory Reasoning as Language Understanding Challenges

    Authors: Nils Holzenberger, Benjamin Van Durme

    Abstract: Statutory reasoning is the task of determining whether a legal statute, stated in natural language, applies to the text description of a case. Prior work introduced a resource that approached statutory reasoning as a monolithic textual entailment problem, with neural baselines performing nearly at-chance. To address this challenge, we decompose statutory reasoning into four types of language-under… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: 18 pages, 3 figures. To appear in ACL 2021

  9. arXiv:2104.08811  [pdf, other

    cs.CL cs.AI

    Human Schema Curation via Causal Association Rule Mining

    Authors: Noah Weber, Anton Belyy, Nils Holzenberger, Rachel Rudinger, Benjamin Van Durme

    Abstract: Event schemas are structured knowledge sources defining typical real-world scenarios (e.g., going to an airport). We present a framework for efficient human-in-the-loop construction of a schema library, based on a novel script induction system and a well-crafted interface that allows non-experts to "program" complex event structures. Associated with this work we release a schema library: a machine… ▽ More

    Submitted 23 May, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: 12 pages, 6 figures, 6 tables

  10. arXiv:2005.05257  [pdf, other

    cs.CL

    A Dataset for Statutory Reasoning in Tax Law Entailment and Question Answering

    Authors: Nils Holzenberger, Andrew Blair-Stanek, Benjamin Van Durme

    Abstract: Legislation can be viewed as a body of prescriptive rules expressed in natural language. The application of legislation to facts of a case we refer to as statutory reasoning, where those facts are also expressed in natural language. Computational statutory reasoning is distinct from most existing work in machine reading, in that much of the information needed for deciding a case is declared exactl… ▽ More

    Submitted 12 August, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

  11. arXiv:1912.12766  [pdf, other

    cs.LG stat.ML

    Multiview Representation Learning for a Union of Subspaces

    Authors: Nils Holzenberger, Raman Arora

    Abstract: Canonical correlation analysis (CCA) is a popular technique for learning representations that are maximally correlated across multiple views in data. In this paper, we extend the CCA based framework for learning a multiview mixture model. We show that the proposed model and a set of simple heuristics yield improvements over standard CCA, as measured in terms of performance on downstream tasks. Our… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

  12. arXiv:1811.08890  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Learning from Multiview Correlations in Open-Domain Videos

    Authors: Nils Holzenberger, Shruti Palaskar, Pranava Madhyastha, Florian Metze, Raman Arora

    Abstract: An increasing number of datasets contain multiple views, such as video, sound and automatic captions. A basic challenge in representation learning is how to leverage multiple views to learn better representations. This is further complicated by the existence of a latent alignment between views, such as between speech and its transcription, and by the multitude of choices for the learning objective… ▽ More

    Submitted 1 March, 2019; v1 submitted 21 November, 2018; originally announced November 2018.