Skip to main content

Showing 1–10 of 10 results for author: Oikarinen, T

.
  1. arXiv:2405.06855  [pdf, other

    cs.LG cs.CV

    Linear Explanations for Individual Neurons

    Authors: Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In recent years many methods have been developed to understand the internal workings of neural networks, often by describing the function of individual neurons in the model. However, these methods typically only focus on explaining the very highest activations of a neuron. In this paper we show this is not sufficient, and that the highest activation range is only responsible for a very small perce… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: Published in ICML 2024

  2. arXiv:2403.13771  [pdf, other

    cs.CV cs.LG

    Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models

    Authors: Nicholas Bai, Rahul A. Iyer, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this paper, we propose Describe-and-Dissect (DnD), a novel method to describe the roles of hidden neurons in vision networks. DnD utilizes recent advancements in multimodal deep learning to produce complex natural language descriptions, without the need for labeled training data or a predefined set of concepts to choose from. Additionally, DnD is training-free, meaning we don't train any new mo… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  3. arXiv:2310.16332  [pdf, other

    cs.LG

    Corrupting Neuron Explanations of Deep Visual Features

    Authors: Divyansh Srivastava, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: The inability of DNNs to explain their black-box behavior has led to a recent surge of explainability methods. However, there are growing concerns that these explainability methods are not robust and trustworthy. In this work, we perform the first robustness analysis of Neuron Explanation Methods under a unified pipeline and show that these explanations can be significantly corrupted by random noi… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 1877-1886

  4. arXiv:2310.06200  [pdf, other

    cs.CL cs.LG

    The Importance of Prompt Tuning for Automated Neuron Explanations

    Authors: Justin Lee, Tuomas Oikarinen, Arjun Chatha, Keng-Chi Chang, Yilan Chen, Tsui-Wei Weng

    Abstract: Recent advances have greatly increased the capabilities of large language models (LLMs), but our understanding of the models and their safety has not progressed as fast. In this paper we aim to understand LLMs deeper by studying their individual neurons. We build upon previous work showing large language models such as GPT-4 can be useful in explaining what each neuron in a language model does. Sp… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  5. arXiv:2304.13346  [pdf, other

    cs.LG cs.CV

    Concept-Monitor: Understanding DNN training through individual neurons

    Authors: Mohammad Ali Khan, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this work, we propose a general framework called Concept-Monitor to help demystify the black-box DNN training processes automatically using a novel unified embedding space and concept diversity metric. Concept-Monitor enables human-interpretable visualization and indicators of the DNN training processes and facilitates transparency as well as deeper understanding on how DNNs develop along the d… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  6. arXiv:2304.06129  [pdf, other

    cs.LG cs.CV

    Label-Free Concept Bottleneck Models

    Authors: Tuomas Oikarinen, Subhro Das, Lam M. Nguyen, Tsui-Wei Weng

    Abstract: Concept bottleneck models (CBM) are a popular way of creating more interpretable neural networks by having hidden layer neurons correspond to human-understandable concepts. However, existing CBMs and their variants have two crucial limitations: first, they need to collect labeled data for each of the predefined concepts, which is time consuming and labor intensive; second, the accuracy of a CBM is… ▽ More

    Submitted 5 June, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Published at ICLR 2023. New v2(5 June 2023): added crowdsourced human study in Appendix B

  7. arXiv:2204.10965  [pdf, other

    cs.CV cs.AI cs.LG

    CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks

    Authors: Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this paper, we propose CLIP-Dissect, a new technique to automatically describe the function of individual hidden neurons inside vision networks. CLIP-Dissect leverages recent advances in multimodal vision/language models to label internal neurons with open-ended concepts without the need for any labeled data or human examples. We show that CLIP-Dissect provides more accurate descriptions than e… ▽ More

    Submitted 5 June, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: Published in ICLR 2023 Conference (Spotlight). New v5(5 June 2023) - Added crowdsourced user study in Appendix B, not included in ICLR publication

  8. arXiv:2010.13668  [pdf, other

    cs.LG cs.AI

    GraphMDN: Leveraging graph structure and deep learning to solve inverse problems

    Authors: Tuomas P. Oikarinen, Daniel C. Hannah, Sohrob Kazerounian

    Abstract: The recent introduction of Graph Neural Networks (GNNs) and their growing popularity in the past few years has enabled the application of deep learning algorithms to non-Euclidean, graph-structured data. GNNs have achieved state-of-the-art results across an impressive array of graph-based machine learning problems. Nevertheless, despite their rapid pace of development, much of the work on GNNs has… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  9. arXiv:2008.01976  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Robust Deep Reinforcement Learning through Adversarial Loss

    Authors: Tuomas Oikarinen, Wang Zhang, Alexandre Megretski, Luca Daniel, Tsui-Wei Weng

    Abstract: Recent studies have shown that deep reinforcement learning agents are vulnerable to small adversarial perturbations on the agent's inputs, which raises concerns about deploying such agents in the real world. To address this issue, we propose RADIAL-RL, a principled framework to train reinforcement learning agents with improved robustness against $l_p$-norm bounded adversarial attacks. Our framewor… ▽ More

    Submitted 10 November, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

  10. arXiv:1906.06151  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Landslide Geohazard Assessment With Convolutional Neural Networks Using Sentinel-2 Imagery Data

    Authors: Silvia L. Ullo, Maximillian S. Langenkamp, Tuomas P. Oikarinen, Maria P. Del Rosso, Alessandro Sebastianelli, Federica Piccirillo, Stefania Sica

    Abstract: In this paper, the authors aim to combine the latest state of the art models in image recognition with the best publicly available satellite images to create a system for landslide risk mitigation. We focus first on landslide detection and further propose a similar system to be used for prediction. Such models are valuable as they could easily be scaled up to provide data for hazard evaluation, as… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: 4 pages, 3 figures, 1 table, accepted to 2019 IEEE IGARSS Conference that will be held in Japan next July