Skip to main content

Showing 1–13 of 13 results for author: Lepori, M

.
  1. arXiv:2406.15955  [pdf, other

    cs.CV cs.AI

    Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects

    Authors: Michael A. Lepori, Alexa R. Tartaglini, Wai Keen Vong, Thomas Serre, Brenden M. Lake, Ellie Pavlick

    Abstract: Though vision transformers (ViTs) have achieved state-of-the-art performance in a variety of settings, they exhibit surprising failures when performing tasks involving visual relations. This begs the question: how do ViTs attempt to perform tasks that require computing visual relations between objects? Prior efforts to interpret ViTs tend to focus on characterizing relevant low-level visual featur… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  2. arXiv:2406.12094  [pdf, other

    cs.CL cs.AI

    Who's asking? User personas and the mechanics of latent misalignment

    Authors: Asma Ghandeharioun, Ann Yuan, Marius Guerard, Emily Reif, Michael A. Lepori, Lucas Dixon

    Abstract: Despite investments in improving model safety, studies show that misaligned capabilities remain latent in safety-tuned models. In this work, we shed light on the mechanics of this phenomenon. First, we show that even when model generations are safe, harmful content can persist in hidden representations and can be extracted by decoding from earlier layers. Then, we show that whether the model divul… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.00053  [pdf, other

    cs.CL cs.LG

    Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting

    Authors: Suraj Anand, Michael A. Lepori, Jack Merullo, Ellie Pavlick

    Abstract: Language models have the ability to perform in-context learning (ICL), allowing them to flexibly adapt their behavior based on context. This contrasts with in-weights learning, where information is statically encoded in model parameters from iterated observations of the data. Despite this apparent ability to learn in-context, language models are known to struggle when faced with unseen or rarely s… ▽ More

    Submitted 1 July, 2024; v1 submitted 28 May, 2024; originally announced June 2024.

    Comments: 9 pages, 5 figures

  4. arXiv:2311.04354  [pdf, other

    cs.CL

    Uncovering Intermediate Variables in Transformers using Circuit Probing

    Authors: Michael A. Lepori, Thomas Serre, Ellie Pavlick

    Abstract: Neural network models have achieved high performance on a wide variety of complex tasks, but the algorithms that they implement are notoriously difficult to interpret. In order to understand these algorithms, it is often necessary to hypothesize intermediate variables involved in the network's computation. For example, does a language model depend on particular syntactic properties when generating… ▽ More

    Submitted 17 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

  5. arXiv:2310.10899  [pdf, other

    cs.LG cs.AI

    Instilling Inductive Biases with Subnetworks

    Authors: Enyan Zhang, Michael A. Lepori, Ellie Pavlick

    Abstract: Despite the recent success of artificial neural networks on a variety of tasks, we have little knowledge or control over the exact solutions these models implement. Instilling inductive biases -- preferences for some solutions over others -- into these models is one promising path toward understanding and controlling their behavior. Much work has been done to study the inherent inductive biases of… ▽ More

    Submitted 31 January, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  6. arXiv:2310.09612  [pdf, other

    cs.CV cs.AI

    Deep Neural Networks Can Learn Generalizable Same-Different Visual Relations

    Authors: Alexa R. Tartaglini, Sheridan Feucht, Michael A. Lepori, Wai Keen Vong, Charles Lovering, Brenden M. Lake, Ellie Pavlick

    Abstract: Although deep neural networks can achieve human-level performance on many object recognition benchmarks, prior work suggests that these same models fail to learn simple abstract relations, such as determining whether two objects are the same or different. Much of this prior work focuses on training convolutional neural networks to classify images of two same or two different abstract shapes, testi… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  7. arXiv:2309.00244  [pdf, other

    cs.LG cs.CL

    NeuroSurgeon: A Toolkit for Subnetwork Analysis

    Authors: Michael A. Lepori, Ellie Pavlick, Thomas Serre

    Abstract: Despite recent advances in the field of explainability, much remains unknown about the algorithms that neural networks learn to represent. Recent work has attempted to understand trained models by decomposing them into functional circuits (Csordás et al., 2020; Lepori et al., 2023). To advance this research, we developed NeuroSurgeon, a python library that can be used to discover and manipulate su… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  8. arXiv:2301.10884  [pdf, other

    cs.CL cs.AI

    Break It Down: Evidence for Structural Compositionality in Neural Networks

    Authors: Michael A. Lepori, Thomas Serre, Ellie Pavlick

    Abstract: Though modern neural networks have achieved impressive performance in both vision and language tasks, we know little about the functions that they implement. One possibility is that neural networks implicitly break down complex tasks into subroutines, implement modular solutions to these subroutines, and compose them into an overall solution to a task - a property we term structural compositionali… ▽ More

    Submitted 6 November, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

  9. Noise2Astro: Astronomical Image Denoising With Self-Supervised NeuralNetworks

    Authors: Yunchong Zhang, Brian Nord, Amanda Pagul, Michael Lepori

    Abstract: In observational astronomy, noise obscures signals of interest. Large-scale astronomical surveys are growing in size and complexity, which will produce more data and increase the workload of data processing. Develo** automated tools, such as convolutional neural networks (CNN), for denoising has become a promising area of research. We investigate the feasibility of CNN-based self-supervised lear… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Journal ref: Res. Notes AAS 6 187 (2022)

  10. arXiv:2011.12086  [pdf, other

    cs.CL

    Unequal Representations: Analyzing Intersectional Biases in Word Embeddings Using Representational Similarity Analysis

    Authors: Michael A. Lepori

    Abstract: We present a new approach for detecting human-like social biases in word embeddings using representational similarity analysis. Specifically, we probe contextualized and non-contextualized embeddings for evidence of intersectional biases against Black women. We show that these embeddings represent Black women as simultaneously less feminine than White women, and less Black than Black men. This fin… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  11. arXiv:2011.12073  [pdf, other

    cs.CL

    Picking BERT's Brain: Probing for Linguistic Dependencies in Contextualized Embeddings Using Representational Similarity Analysis

    Authors: Michael A. Lepori, R. Thomas McCoy

    Abstract: As the name implies, contextualized representations of language are typically motivated by their ability to encode context. Which aspects of context are captured by such representations? We introduce an approach to address this question using Representational Similarity Analysis (RSA). As case studies, we investigate the degree to which a verb embedding encodes the verb's subject, a pronoun embedd… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  12. arXiv:2005.00019  [pdf, other

    cs.CL

    Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMs

    Authors: Michael A. Lepori, Tal Linzen, R. Thomas McCoy

    Abstract: Sequence-based neural networks show significant sensitivity to syntactic structure, but they still perform less well on syntactic tasks than tree-based networks. Such tree-based networks can be provided with a constituency parse, a dependency parse, or both. We evaluate which of these two representational schemes more effectively introduces biases for syntactic structure that increase performance… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

    Comments: To appear in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020)

  13. arXiv:2003.12362  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Can you hear me $\textit{now}$? Sensitive comparisons of human and machine perception

    Authors: Michael A Lepori, Chaz Firestone

    Abstract: The rise of machine-learning systems that process sensory input has brought with it a rise in comparisons between human and machine perception. But such comparisons face a challenge: Whereas machine perception of some stimulus can often be probed through direct and explicit measures, much of human perceptual knowledge is latent, incomplete, or unavailable for explicit report. Here, we explore how… ▽ More

    Submitted 2 August, 2022; v1 submitted 27 March, 2020; originally announced March 2020.

    Comments: 24 pages; 4 figures