Skip to main content

Showing 1–10 of 10 results for author: Scherrer, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12241  [pdf, other

    cs.CL cs.AI

    Introducing v0.5 of the AI Safety Benchmark from MLCommons

    Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

    Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  2. arXiv:2311.11944  [pdf, other

    cs.CL cs.AI cs.CE stat.ML

    FinanceBench: A New Benchmark for Financial Question Answering

    Authors: Pranab Islam, Anand Kannappan, Douwe Kiela, Rebecca Qian, Nino Scherrer, Bertie Vidgen

    Abstract: FinanceBench is a first-of-its-kind test suite for evaluating the performance of LLMs on open book financial question answering (QA). It comprises 10,231 questions about publicly traded companies, with corresponding answers and evidence strings. The questions in FinanceBench are ecologically valid and cover a diverse set of scenarios. They are intended to be clear-cut and straightforward to answer… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Dataset is available at: https://huggingface.co/datasets/PatronusAI/financebench

  3. arXiv:2311.08370  [pdf, other

    cs.CL

    SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models

    Authors: Bertie Vidgen, Nino Scherrer, Hannah Rose Kirk, Rebecca Qian, Anand Kannappan, Scott A. Hale, Paul Röttger

    Abstract: The past year has seen rapid acceleration in the development of large language models (LLMs). However, without proper steering and safeguards, LLMs will readily follow malicious instructions, provide unsafe advice, and generate toxic content. We introduce SimpleSafetyTests (SST) as a new test suite for rapidly and systematically identifying such critical safety risks. The test suite comprises 100… ▽ More

    Submitted 16 February, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  4. arXiv:2309.05858  [pdf, other

    cs.LG cs.AI

    Uncovering mesa-optimization algorithms in Transformers

    Authors: Johannes von Oswald, Eyvind Niklasson, Maximilian Schlegel, Sei** Kobayashi, Nicolas Zucchet, Nino Scherrer, Nolan Miller, Mark Sandler, Blaise Agüera y Arcas, Max Vladymyrov, Razvan Pascanu, João Sacramento

    Abstract: Transformers have become the dominant model in deep learning, but the reason for their superior performance is poorly understood. Here, we hypothesize that the strong performance of Transformers stems from an architectural bias towards mesa-optimization, a learned process running within the forward pass of a model consisting of the following two steps: (i) the construction of an internal learning… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  5. arXiv:2307.14324  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Evaluating the Moral Beliefs Encoded in LLMs

    Authors: Nino Scherrer, Claudia Shi, Amir Feder, David M. Blei

    Abstract: This paper presents a case study on the design, administration, post-processing, and evaluation of surveys on large language models (LLMs). It comprises two components: (1) A statistical method for eliciting beliefs encoded in LLMs. We introduce statistical measures and evaluation metrics that quantify the probability of an LLM "making a choice", the associated uncertainty, and the consistency of… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  6. arXiv:2211.13715  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Trust Your $\nabla$: Gradient-based Intervention Targeting for Causal Discovery

    Authors: Mateusz Olko, Michał Zając, Aleksandra Nowak, Nino Scherrer, Yashas Annadani, Stefan Bauer, Łukasz Kuciński, Piotr Miłoś

    Abstract: Inferring causal structure from data is a challenging task of fundamental importance in science. Observational data are often insufficient to identify a system's causal structure uniquely. While conducting interventions (i.e., experiments) can improve the identifiability, such samples are usually challenging and expensive to obtain. Hence, experimental design approaches for causal discovery aim to… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted to 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  7. arXiv:2211.03846  [pdf, other

    cs.LG cs.MA stat.ME

    Federated Causal Discovery From Interventions

    Authors: Amin Abyaneh, Nino Scherrer, Patrick Schwab, Stefan Bauer, Bernhard Schölkopf, Arash Mehrjou

    Abstract: Causal discovery serves a pivotal role in mitigating model uncertainty through recovering the underlying causal mechanisms among variables. In many practical domains, such as healthcare, access to the data gathered by individual entities is limited, primarily for privacy and regulatory constraints. However, the majority of existing causal discovery methods require the data to be available in a cen… ▽ More

    Submitted 11 February, 2024; v1 submitted 7 November, 2022; originally announced November 2022.

  8. arXiv:2206.04620  [pdf, other

    cs.LG cs.AI stat.ML

    On the Generalization and Adaption Performance of Causal Models

    Authors: Nino Scherrer, Anirudh Goyal, Stefan Bauer, Yoshua Bengio, Nan Rosemary Ke

    Abstract: Learning models that offer robust out-of-distribution generalization and fast adaptation is a key challenge in modern machine learning. Modelling causal structure into neural networks holds the promise to accomplish robust zero and few-shot adaptation. Recent advances in differentiable causal discovery have proposed to factorize the data generating process into a set of modules, i.e. one module fo… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  9. arXiv:2109.02429  [pdf, other

    stat.ML cs.LG

    Learning Neural Causal Models with Active Interventions

    Authors: Nino Scherrer, Olexa Bilaniuk, Yashas Annadani, Anirudh Goyal, Patrick Schwab, Bernhard Schölkopf, Michael C. Mozer, Yoshua Bengio, Stefan Bauer, Nan Rosemary Ke

    Abstract: Discovering causal structures from data is a challenging inference problem of fundamental importance in all areas of science. The appealing properties of neural networks have recently led to a surge of interest in differentiable neural network-based methods for learning causal structures from data. So far, differentiable causal discovery has focused on static datasets of observational or fixed int… ▽ More

    Submitted 5 March, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  10. arXiv:2106.07635  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Causal Networks: Approximate Bayesian Inference over Causal Structures

    Authors: Yashas Annadani, Jonas Rothfuss, Alexandre Lacoste, Nino Scherrer, Anirudh Goyal, Yoshua Bengio, Stefan Bauer

    Abstract: Learning the causal structure that underlies data is a crucial step towards robust real-world decision making. The majority of existing work in causal inference focuses on determining a single directed acyclic graph (DAG) or a Markov equivalence class thereof. However, a crucial aspect to acting intelligently upon the knowledge about causal structure which has been inferred from finite data demand… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: 10 pages, 6 figures