Skip to main content

Showing 1–16 of 16 results for author: Binz, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.18225  [pdf, other

    cs.CL cs.AI cs.LG

    CogBench: a large language model walks into a psychology lab

    Authors: Julian Coda-Forno, Marcel Binz, Jane X. Wang, Eric Schulz

    Abstract: Large language models (LLMs) have significantly advanced the field of artificial intelligence. Yet, evaluating them comprehensively remains challenging. We argue that this is partly due to the predominant focus on performance metrics in most benchmarks. This paper introduces CogBench, a benchmark that includes ten behavioral metrics derived from seven cognitive psychology experiments. This novel a… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  2. arXiv:2402.03969  [pdf, other

    cs.LG

    In-context learning agents are asymmetric belief updaters

    Authors: Johannes A. Schubert, Akshay K. Jagadish, Marcel Binz, Eric Schulz

    Abstract: We study the in-context learning dynamics of large language models (LLMs) using three instrumental learning tasks adapted from cognitive psychology. We find that LLMs update their beliefs in an asymmetric manner and learn more from better-than-expected outcomes than from worse-than-expected ones. Furthermore, we show that this effect reverses when learning about counterfactual feedback and disappe… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  3. arXiv:2402.01821  [pdf, other

    cs.LG cs.AI

    Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks

    Authors: Akshay K. Jagadish, Julian Coda-Forno, Mirko Thalmann, Eric Schulz, Marcel Binz

    Abstract: Ecological rationality refers to the notion that humans are rational agents adapted to their environment. However, testing this theory remains challenging due to two reasons: the difficulty in defining what tasks are ecologically valid and building rational models for these tasks. In this work, we demonstrate that large language models can generate cognitive tasks, specifically category learning t… ▽ More

    Submitted 28 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 27 pages (9 pages of main text, 4 pages of references, and 14 pages of appendix), 13 figures, and 7 Tables

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  4. arXiv:2312.03759  [pdf, ps, other

    cs.CL cs.AI cs.CY cs.DL

    How should the advent of large language models affect the practice of science?

    Authors: Marcel Binz, Stephan Alaniz, Adina Roskies, Balazs Aczel, Carl T. Bergstrom, Colin Allen, Daniel Schad, Dirk Wulff, Jevin D. West, Qiong Zhang, Richard M. Shiffrin, Samuel J. Gershman, Ven Popov, Emily M. Bender, Marco Marelli, Matthew M. Botvinick, Zeynep Akata, Eric Schulz

    Abstract: Large language models (LLMs) are being increasingly incorporated into scientific workflows. However, we have yet to fully grasp the implications of this integration. How should the advent of large language models affect the practice of science? For this opinion piece, we have invited four diverse groups of scientists to reflect on this query, sharing their perspectives and engaging in debate. Schu… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  5. arXiv:2310.19943  [pdf, other

    cs.LG q-bio.NC

    The Acquisition of Physical Knowledge in Generative Neural Networks

    Authors: Luca M. Schulze Buschoff, Eric Schulz, Marcel Binz

    Abstract: As children grow older, they develop an intuitive understanding of the physical processes around them. Their physical understanding develops in stages, moving along developmental trajectories which have been mapped out extensively in previous empirical research. Here, we investigate how the learning trajectories of deep generative neural networks compare to children's developmental trajectories us… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at ICML 2023

  6. arXiv:2306.09377  [pdf, other

    cs.LG cs.AI cs.CV

    Language Aligned Visual Representations Predict Human Behavior in Naturalistic Learning Tasks

    Authors: Can Demircan, Tankred Saanum, Leonardo Pettini, Marcel Binz, Blazej M Baczkowski, Paula Kaanders, Christian F Doeller, Mona M Garvert, Eric Schulz

    Abstract: Humans possess the ability to identify and generalize relevant features of natural objects, which aids them in various situations. To investigate this phenomenon and determine the most effective representations for predicting human behavior, we conducted two experiments involving category learning and reward learning. Our experiments used realistic images as stimuli, and participants were tasked w… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  7. arXiv:2306.03917  [pdf, other

    cs.CL cs.AI cs.LG

    Turning large language models into cognitive models

    Authors: Marcel Binz, Eric Schulz

    Abstract: Large language models are powerful systems that excel at many tasks, ranging from translation to mathematical reasoning. Yet, at the same time, these models often show unhuman-like characteristics. In the present paper, we address this gap and ask whether large language models can be turned into cognitive models. We find that -- after finetuning them on data from psychological experiments -- these… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  8. arXiv:2305.17109  [pdf, other

    cs.LG

    Reinforcement Learning with Simple Sequence Priors

    Authors: Tankred Saanum, Noémi Éltető, Peter Dayan, Marcel Binz, Eric Schulz

    Abstract: Everything else being equal, simpler models should be preferred over more complex ones. In reinforcement learning (RL), simplicity is typically quantified on an action-by-action basis -- but this timescale ignores temporal regularities, like repetitions, often present in sequential strategies. We therefore propose an RL algorithm that learns to solve tasks with sequences of actions that are compre… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  9. arXiv:2305.12907  [pdf, other

    cs.CL cs.AI cs.LG

    Meta-in-context learning in large language models

    Authors: Julian Coda-Forno, Marcel Binz, Zeynep Akata, Matthew Botvinick, Jane X. Wang, Eric Schulz

    Abstract: Large language models have shown tremendous performance in a variety of tasks. In-context learning -- the ability to improve at a task after being provided with a number of demonstrations -- is seen as one of the main contributors to their success. In the present paper, we demonstrate that the in-context learning abilities of large language models can be recursively improved via in-context learnin… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  10. arXiv:2304.11111  [pdf, other

    cs.CL cs.AI cs.LG

    Inducing anxiety in large language models increases exploration and bias

    Authors: Julian Coda-Forno, Kristin Witte, Akshay K. Jagadish, Marcel Binz, Zeynep Akata, Eric Schulz

    Abstract: Large language models are transforming research on machine learning while galvanizing public debates. Understanding not only when these models work well and succeed but also why they fail and misbehave is of great societal relevance. We propose to turn the lens of computational psychiatry, a framework used to computationally describe and modify aberrant behavior, to the outputs produced by these m… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  11. arXiv:2304.06729  [pdf, other

    cs.AI cs.LG

    Meta-Learned Models of Cognition

    Authors: Marcel Binz, Ishita Dasgupta, Akshay Jagadish, Matthew Botvinick, Jane X. Wang, Eric Schulz

    Abstract: Meta-learning is a framework for learning learning algorithms through repeated interactions with an environment as opposed to designing them by hand. In recent years, this framework has established itself as a promising tool for building models of human cognition. Yet, a coherent research program around meta-learned models of cognition is still missing. The purpose of this article is to synthesize… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  12. arXiv:2209.12344  [pdf, other

    cs.LG cs.AI

    Stochastic Gradient Descent Captures How Children Learn About Physics

    Authors: Luca M. Schulze Buschoff, Eric Schulz, Marcel Binz

    Abstract: As children grow older, they develop an intuitive understanding of the physical processes around them. They move along developmental trajectories, which have been mapped out extensively in previous empirical research. We investigate how children's developmental trajectories compare to the learning trajectories of artificial systems. Specifically, we examine the idea that cognitive development resu… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Comments: Submitted to SVRHM at NeurIPS 2022

  13. arXiv:2206.14576  [pdf, other

    cs.CL cs.AI cs.LG

    Using cognitive psychology to understand GPT-3

    Authors: Marcel Binz, Eric Schulz

    Abstract: We study GPT-3, a recent large language model, using tools from cognitive psychology. More specifically, we assess GPT-3's decision-making, information search, deliberation, and causal reasoning abilities on a battery of canonical experiments from the literature. We find that much of GPT-3's behavior is impressive: it solves vignette-based tasks similarly or better than human subjects, is able to… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  14. arXiv:2201.11817  [pdf, other

    cs.LG

    Modeling Human Exploration Through Resource-Rational Reinforcement Learning

    Authors: Marcel Binz, Eric Schulz

    Abstract: Equip** artificial agents with useful exploration mechanisms remains a challenge to this day. Humans, on the other hand, seem to manage the trade-off between exploration and exploitation effortlessly. In the present article, we put forward the hypothesis that they accomplish this by making optimal use of limited computational resources. We study this hypothesis by meta-learning reinforcement lea… ▽ More

    Submitted 14 November, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: NeurIPS 2022 final version

  15. arXiv:1902.07580  [pdf, other

    cs.LG stat.ML

    Where Do Human Heuristics Come From?

    Authors: Marcel Binz, Dominik Endres

    Abstract: Human decision-making deviates from the optimal solution, that maximizes cumulative rewards, in many situations. Here we approach this discrepancy from the perspective of bounded rationality and our goal is to provide a justification for such seemingly sub-optimal strategies. More specifically we investigate the hypothesis, that humans do not know optimal decision-making algorithms in advance, but… ▽ More

    Submitted 10 May, 2019; v1 submitted 20 February, 2019; originally announced February 2019.

    Comments: Final version for CogSci 2019

  16. arXiv:1902.07579  [pdf, other

    cs.LG stat.ML

    Emulating Human Developmental Stages with Bayesian Neural Networks

    Authors: Marcel Binz, Dominik Endres

    Abstract: We compare the acquisition of knowledge in humans and machines. Research from the field of developmental psychology indicates, that human-employed hypothesis are initially guided by simple rules, before evolving into more complex theories. This observation is shared across many tasks and domains. We investigate whether stages of development in artificial learning systems are based on the same char… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.