Skip to main content

Showing 1–4 of 4 results for author: Buschoff, L M S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.16093  [pdf, other

    cs.LG

    Visual cognition in multimodal large language models

    Authors: Luca M. Schulze Buschoff, Elif Akata, Matthias Bethge, Eric Schulz

    Abstract: A chief goal of artificial intelligence is to build machines that think like people. Yet it has been argued that deep neural network architectures fail to accomplish this. Researchers have asserted these models' limitations in the domains of causal reasoning, intuitive physics, and intuitive psychology. Yet recent advancements, namely the rise of large language models, particularly those designed… ▽ More

    Submitted 24 January, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Changed title and main text

  2. arXiv:2310.19943  [pdf, other

    cs.LG q-bio.NC

    The Acquisition of Physical Knowledge in Generative Neural Networks

    Authors: Luca M. Schulze Buschoff, Eric Schulz, Marcel Binz

    Abstract: As children grow older, they develop an intuitive understanding of the physical processes around them. Their physical understanding develops in stages, moving along developmental trajectories which have been mapped out extensively in previous empirical research. Here, we investigate how the learning trajectories of deep generative neural networks compare to children's developmental trajectories us… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Published as a conference paper at ICML 2023

  3. arXiv:2209.12344  [pdf, other

    cs.LG cs.AI

    Stochastic Gradient Descent Captures How Children Learn About Physics

    Authors: Luca M. Schulze Buschoff, Eric Schulz, Marcel Binz

    Abstract: As children grow older, they develop an intuitive understanding of the physical processes around them. They move along developmental trajectories, which have been mapped out extensively in previous empirical research. We investigate how children's developmental trajectories compare to the learning trajectories of artificial systems. Specifically, we examine the idea that cognitive development resu… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

    Comments: Submitted to SVRHM at NeurIPS 2022

  4. arXiv:2110.05922  [pdf, other

    cs.CV cs.AI cs.LG q-bio.NC

    Trivial or impossible -- dichotomous data difficulty masks model differences (on ImageNet and beyond)

    Authors: Kristof Meding, Luca M. Schulze Buschoff, Robert Geirhos, Felix A. Wichmann

    Abstract: "The power of a generalization system follows directly from its biases" (Mitchell 1980). Today, CNNs are incredibly powerful generalisation systems -- but to what degree have we understood how their inductive bias influences model decisions? We here attempt to disentangle the various aspects that determine how a model decides. In particular, we ask: what makes one model decide differently from ano… ▽ More

    Submitted 27 April, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at ICLR 2022