Skip to main content

Showing 1–9 of 9 results for author: Muttenthaler, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19087  [pdf, other

    cs.CV cs.AI cs.LG q-bio.QM

    Dimensions underlying the representational alignment of deep neural networks with humans

    Authors: Florian P. Mahner, Lukas Muttenthaler, Umut Güçlü, Martin N. Hebart

    Abstract: Determining the similarities and differences between humans and artificial intelligence is an important goal both in machine learning and cognitive neuroscience. However, similarities in representations only inform us about the degree of alignment, not the factors that determine it. Drawing upon recent developments in cognitive science, we propose a generic framework for yielding comparable repres… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2310.13018  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE

    Getting aligned on representational alignment

    Authors: Ilia Sucholutsky, Lukas Muttenthaler, Adrian Weller, Andi Peng, Andreea Bobu, Been Kim, Bradley C. Love, Erin Grant, Iris Groen, Jascha Achterberg, Joshua B. Tenenbaum, Katherine M. Collins, Katherine L. Hermann, Kerem Oktar, Klaus Greff, Martin N. Hebart, Nori Jacoby, Qiuyi Zhang, Raja Marjieh, Robert Geirhos, Sherol Chen, Simon Kornblith, Sunayana Rane, Talia Konkle, Thomas P. O'Connell , et al. (5 additional authors not shown)

    Abstract: Biological and artificial information processing systems form representations that they can use to categorize, reason, plan, navigate, and make decisions. How can we measure the extent to which the representations formed by these diverse systems agree? Do similarities in representations then translate into similar behavior? How can a system's representations be modified to better match those of an… ▽ More

    Submitted 2 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Working paper, changes to be made in upcoming revisions

  3. arXiv:2307.02245  [pdf, other

    cs.LG cs.CV cs.IT

    Set Learning for Accurate and Calibrated Models

    Authors: Lukas Muttenthaler, Robert A. Vandermeulen, Qiuyi Zhang, Thomas Unterthiner, Klaus-Robert Müller

    Abstract: Model overconfidence and poor calibration are common in machine learning and difficult to account for when applying standard empirical risk minimization. In this work, we propose a novel method to alleviate these problems that we call odd-$k$-out learning (OKO), which minimizes the cross-entropy error for sets rather than for single examples. This naturally allows the model to capture correlations… ▽ More

    Submitted 12 February, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Published as a conference paper at ICLR 2024

  4. arXiv:2306.04507  [pdf, other

    cs.CV cs.LG

    Improving neural network representations using human similarity judgments

    Authors: Lukas Muttenthaler, Lorenz Linhardt, Jonas Dippel, Robert A. Vandermeulen, Katherine Hermann, Andrew K. Lampinen, Simon Kornblith

    Abstract: Deep neural networks have reached human-level performance on many computer vision tasks. However, the objectives used to train these networks enforce only that similar images are embedded at similar locations in the representation space, and do not directly constrain the global structure of the resulting space. Here, we explore the impact of supervising this global structure by linearly aligning i… ▽ More

    Submitted 26 September, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Published as a conference paper at NeurIPS 2023

  5. arXiv:2211.01201  [pdf, other

    cs.CV cs.AI cs.LG q-bio.NC

    Human alignment of neural network representations

    Authors: Lukas Muttenthaler, Jonas Dippel, Lorenz Linhardt, Robert A. Vandermeulen, Simon Kornblith

    Abstract: Today's computer vision models achieve human or near-human level performance across a wide variety of vision tasks. However, their architectures, data, and learning algorithms differ in numerous ways from those that give rise to human vision. In this paper, we investigate the factors that affect the alignment between the representations learned by neural networks and human mental representations i… ▽ More

    Submitted 3 April, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted for publication at ICLR 2023

  6. arXiv:2205.00756  [pdf, other

    cs.LG stat.AP stat.ML

    VICE: Variational Interpretable Concept Embeddings

    Authors: Lukas Muttenthaler, Charles Y. Zheng, Patrick McClure, Robert A. Vandermeulen, Martin N. Hebart, Francisco Pereira

    Abstract: A central goal in the cognitive sciences is the development of numerical models for mental representations of object concepts. This paper introduces Variational Interpretable Concept Embeddings (VICE), an approximate Bayesian method for embedding object concepts in a vector space using data collected from humans in a triplet odd-one-out task. VICE uses variational inference to obtain sparse, non-n… ▽ More

    Submitted 6 October, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: Accepted at NeurIPS 2022

  7. arXiv:2010.03222  [pdf, other

    cs.CL cs.AI

    Unsupervised Evaluation for Question Answering with Transformers

    Authors: Lukas Muttenthaler, Isabelle Augenstein, Johannes Bjerva

    Abstract: It is challenging to automatically evaluate the answer of a QA model at inference time. Although many models provide confidence scores, and simple heuristics can go a long way towards indicating answer correctness, such measures are heavily dataset-dependent and are unlikely to generalize. In this work, we begin by investigating the hidden representations of questions, answers, and contexts in tra… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 8 pages, to be published in the Proceedings of the 2020 EMNLP Workshop BlackboxNLP: Analysing and Interpreting Neural Networks for NLP

  8. arXiv:2006.08342  [pdf, other

    cs.CL

    Subjective Question Answering: Deciphering the inner workings of Transformers in the realm of subjectivity

    Authors: Lukas Muttenthaler

    Abstract: Understanding subjectivity demands reasoning skills beyond the realm of common knowledge. It requires a machine learning model to process sentiment and to perform opinion mining. In this work, I've exploited a recently released dataset for span-selection Question Answering, namely SubjQA. SubjQA is the first QA dataset that contains questions that ask for subjective opinions corresponding to revie… ▽ More

    Submitted 14 October, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: 80 pages, Master's thesis in Computer Science (CS)

  9. arXiv:2006.05113  [pdf, other

    cs.CL cs.LG q-bio.NC

    Human brain activity for machine attention

    Authors: Lukas Muttenthaler, Nora Hollenstein, Maria Barrett

    Abstract: Cognitively inspired NLP leverages human-derived data to teach machines about language processing mechanisms. Recently, neural networks have been augmented with behavioral data to solve a range of NLP tasks spanning syntax and semantics. We are the first to exploit neuroscientific data, namely electroencephalography (EEG), to inform a neural attention model about language processing of the human b… ▽ More

    Submitted 2 October, 2020; v1 submitted 9 June, 2020; originally announced June 2020.