Skip to main content

Showing 1–50 of 77 results for author: Locatello, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15057  [pdf, other

    cs.LG

    Latent Space Translation via Inverse Relative Projection

    Authors: Valentino Maiorca, Luca Moschella, Marco Fumero, Francesco Locatello, Emanuele Rodolà

    Abstract: The emergence of similar representations between independently trained neural models has sparked significant interest in the representation learning community, leading to the development of various methods to obtain communication between latent spaces. "Latent space communication" can be achieved in two ways: i) by independently map** the original spaces to a shared or relative one; ii) by direc… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.00664, arXiv:2406.11014

  2. arXiv:2406.14183  [pdf, other

    cs.LG

    Latent Functional Maps

    Authors: Marco Fumero, Marco Pegoraro, Valentino Maiorca, Francesco Locatello, Emanuele Rodolà

    Abstract: Neural models learn data representations that lie on low-dimensional manifolds, yet modeling the relation between these representational spaces is an ongoing challenge. By integrating spectral geometry principles into neural modeling, we show that this problem can be better addressed in the functional domain, mitigating complexity, while enhancing interpretability and performances on downstream ta… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2406.13507  [pdf, other

    cs.LG

    Scalable unsupervised alignment of general metric and non-metric structures

    Authors: Sanketh Vedula, Valentino Maiorca, Lorenzo Basile, Francesco Locatello, Alex Bronstein

    Abstract: Aligning data from different domains is a fundamental problem in machine learning with broad applications across very different areas, most notably aligning experimental readouts in single-cell multiomics. Mathematically, this problem can be formulated as the minimization of disagreement of pair-wise quantities such as distances and is related to the Gromov-Hausdorff and Gromov-Wasserstein distanc… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.09196  [pdf, other

    cs.CV cs.LG

    Adaptive Slot Attention: Object Discovery with Dynamic Slot Number

    Authors: Ke Fan, Zechen Bai, Tianjun Xiao, Tong He, Max Horn, Yanwei Fu, Francesco Locatello, Zheng Zhang

    Abstract: Object-centric learning (OCL) extracts the representation of objects with slots, offering an exceptional blend of flexibility and interpretability for abstracting low-level perceptual features. A widely adopted method within OCL is slot attention, which utilizes attention mechanisms to iteratively refine slot representations. However, a major drawback of most object-centric models, including slot… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: CVPR 2024

  5. arXiv:2406.07141  [pdf, other

    cs.LG cs.AI

    Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention

    Authors: Avinash Kori, Francesco Locatello, Ainkaran Santhirasekaram, Francesca Toni, Ben Glocker, Fabio De Sousa Ribeiro

    Abstract: Learning modular object-centric representations is crucial for systematic generalization. Existing methods show promising object-binding capabilities empirically, but theoretical identifiability guarantees remain relatively underdeveloped. Understanding when object-centric representations can theoretically be identified is crucial for scaling slot-based methods to high-dimensional images with corr… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  6. arXiv:2405.17151  [pdf, other

    cs.LG

    Smoke and Mirrors in Causal Downstream Tasks

    Authors: Riccardo Cadei, Lukas Lindorfer, Sylvia Cremer, Cordelia Schmid, Francesco Locatello

    Abstract: Machine Learning and AI have the potential to transform data-driven scientific discovery, enabling accurate predictions for several scientific phenomena. As many scientific questions are inherently causal, this paper looks at the causal inference task of treatment effect estimation, where we assume binary effects that are recorded as high-dimensional images in a Randomized Controlled Trial (RCT).… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  7. arXiv:2405.16924  [pdf, other

    cs.LG stat.ML

    Demystifying amortized causal discovery with transformers

    Authors: Francesco Montagna, Max Cairney-Leeming, Dhanya Sridhar, Francesco Locatello

    Abstract: Supervised learning approaches for causal discovery from observational data often achieve competitive performance despite seemingly avoiding explicit assumptions that traditional methods make for identifiability. In this work, we investigate CSIvA (Ke et al., 2023), a transformer-based model promising to train on synthetic data and transfer to real data. First, we bridge the gap with existing iden… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  8. arXiv:2405.13888  [pdf, other

    cs.LG stat.ML

    Marrying Causal Representation Learning with Dynamical Systems for Science

    Authors: Dingling Yao, Caroline Muller, Francesco Locatello

    Abstract: Causal representation learning promises to extend causal models to hidden causal variables from raw entangled measurements. However, most progress has focused on proving identifiability results in different settings, and we are not aware of any successful real-world application. At the same time, the field of dynamical systems benefited from deep learning and scaled to countless applications but d… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 21 pages, 8 figures, 6 tables

  9. arXiv:2404.03392  [pdf, other

    cs.CV

    Two Tricks to Improve Unsupervised Segmentation Learning

    Authors: Alp Eren Sari, Francesco Locatello, Paolo Favaro

    Abstract: We present two practical improvement techniques for unsupervised segmentation learning. These techniques address limitations in the resolution and accuracy of predicted segmentation maps of recent state-of-the-art methods. Firstly, we leverage image post-processing techniques such as guided filtering to refine the output masks, improving accuracy while avoiding substantial computational costs. Sec… ▽ More

    Submitted 8 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  10. arXiv:2403.08335  [pdf, other

    cs.LG cs.AI stat.ML

    A Sparsity Principle for Partially Observable Causal Representation Learning

    Authors: Danru Xu, Dingling Yao, Sébastien Lachapelle, Perouz Taslakian, Julius von Kügelgen, Francesco Locatello, Sara Magliacane

    Abstract: Causal representation learning aims at identifying high-level causal variables from perceptual data. Most methods assume that all latent causal variables are captured in the high-dimensional observations. We instead consider a partially observed setting, in which each measurement only provides information about a subset of the underlying causal state. Prior work has studied this setting with multi… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 45 pages, 32 figures, 16 tables

  11. arXiv:2402.13368  [pdf, other

    cs.LG cs.CV

    Unsupervised Concept Discovery Mitigates Spurious Correlations

    Authors: Md Rifat Arefin, Yan Zhang, Aristide Baratin, Francesco Locatello, Irina Rish, Dianbo Liu, Kenji Kawaguchi

    Abstract: Models prone to spurious correlations in training data often produce brittle predictions and introduce unintended biases. Addressing this challenge typically involves methods relying on prior knowledge and group annotation to remove spurious correlations, which may not be readily available in many applications. In this paper, we establish a novel connection between unsupervised object-centric lear… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  12. arXiv:2402.13077  [pdf, other

    cs.LG cs.AI cs.NE

    Mechanistic Neural Networks for Scientific Machine Learning

    Authors: Adeel Pervez, Francesco Locatello, Efstratios Gavves

    Abstract: This paper presents Mechanistic Neural Networks, a neural network design for machine learning applications in the sciences. It incorporates a new Mechanistic Block in standard architectures to explicitly learn governing differential equations as representations, revealing the underlying dynamics of data and enhancing interpretability and efficiency in data modeling. Central to our approach is a no… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  13. arXiv:2402.05627  [pdf, other

    cs.LG cs.AI cs.CV q-bio.NC

    Binding Dynamics in Rotating Features

    Authors: Sindy Löwe, Francesco Locatello, Max Welling

    Abstract: In human cognition, the binding problem describes the open question of how the brain flexibly integrates diverse information into cohesive object representations. Analogously, in machine learning, there is a pursuit for models capable of strong generalization and reasoning by learning object-centric representations in an unsupervised manner. Drawing from neuroscientific theories, Rotating Features… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  14. arXiv:2311.04056  [pdf, other

    cs.LG cs.AI

    Multi-View Causal Representation Learning with Partial Observability

    Authors: Dingling Yao, Danru Xu, Sébastien Lachapelle, Sara Magliacane, Perouz Taslakian, Georg Martius, Julius von Kügelgen, Francesco Locatello

    Abstract: We present a unified framework for studying the identifiability of representations learned from simultaneously observed views, such as different data modalities. We allow a partially observed setting in which each view constitutes a nonlinear mixture of a subset of underlying latent variables, which can be causally related. We prove that the information shared across all subsets of any number of v… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 28 pages, 10 figures, 11 tables

  15. arXiv:2311.00664  [pdf, other

    cs.LG

    Latent Space Translation via Semantic Alignment

    Authors: Valentino Maiorca, Luca Moschella, Antonio Norelli, Marco Fumero, Francesco Locatello, Emanuele Rodolà

    Abstract: While different neural models often exhibit latent spaces that are alike when exposed to semantically related data, this intrinsic similarity is not always immediately discernible. Towards a better understanding of this phenomenon, our work shows how representations learned from these neural modules can be translated between different pre-trained networks via simpler transformations than previousl… ▽ More

    Submitted 11 February, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023. 21 pages, 13 figures, 8 tables

  16. arXiv:2310.18123  [pdf, ps, other

    cs.LG stat.ML

    Sample Complexity Bounds for Score-Matching: Causal Discovery and Generative Modeling

    Authors: Zhenyu Zhu, Francesco Locatello, Volkan Cevher

    Abstract: This paper provides statistical sample complexity bounds for score-matching and its applications in causal discovery. We demonstrate that accurate estimation of the score function is achievable by training a standard deep ReLU neural network using stochastic gradient descent. We establish bounds on the error rate of recovering causal relationships using the score-matching-based causal discovery me… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted in NeurIPS 2023

  17. arXiv:2310.14246  [pdf, other

    stat.ME cs.LG

    Shortcuts for causal discovery of nonlinear models by score matching

    Authors: Francesco Montagna, Nicoletta Noceti, Lorenzo Rosasco, Francesco Locatello

    Abstract: The use of simulated data in the field of causal discovery is ubiquitous due to the scarcity of annotated real data. Recently, Reisach et al., 2021 highlighted the emergence of patterns in simulated linear data, which displays increasing marginal variance in the casual direction. As an ablation in their experiments, Montagna et al., 2023 found that similar patterns may emerge in nonlinear models f… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  18. arXiv:2310.13387  [pdf, other

    stat.ME cs.LG

    Assumption violations in causal discovery and the robustness of score matching

    Authors: Francesco Montagna, Atalanti A. Mastakouri, Elias Eulig, Nicoletta Noceti, Lorenzo Rosasco, Dominik Janzing, Bryon Aragam, Francesco Locatello

    Abstract: When domain knowledge is limited and experimentation is restricted by ethical, financial, or time constraints, practitioners turn to observational causal discovery methods to recover the causal structure, exploiting the statistical properties of their data. Because causal discovery without further assumptions is an ill-posed problem, each algorithm comes with its own set of usually untestable assu… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  19. arXiv:2309.09858  [pdf, other

    cs.CV

    Unsupervised Open-Vocabulary Object Localization in Videos

    Authors: Ke Fan, Zechen Bai, Tianjun Xiao, Dominik Zietlow, Max Horn, Zixu Zhao, Carl-Johann Simon-Gabriel, Mike Zheng Shou, Francesco Locatello, Bernt Schiele, Thomas Brox, Zheng Zhang, Yanwei Fu, Tong He

    Abstract: In this paper, we show that recent advances in video representation learning and pre-trained vision-language models allow for substantial improvements in self-supervised video object localization. We propose a method that first localizes objects in videos via an object-centric approach with slot attention and then assigns text to the obtained slots. The latter is achieved by an unsupervised way to… ▽ More

    Submitted 26 June, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV 2023; Presented on CVPR 2024 Workshop CORR; Project Page:https://github.com/amazon-science/object-centric-vol

  20. arXiv:2309.00233  [pdf, other

    cs.CV

    Object-Centric Multiple Object Tracking

    Authors: Zixu Zhao, Jiaze Wang, Max Horn, Yizhuo Ding, Tong He, Zechen Bai, Dominik Zietlow, Carl-Johann Simon-Gabriel, Bing Shuai, Zhuowen Tu, Thomas Brox, Bernt Schiele, Yanwei Fu, Francesco Locatello, Zheng Zhang, Tianjun Xiao

    Abstract: Unsupervised object-centric learning methods allow the partitioning of scenes into entities without additional localization information and are excellent candidates for reducing the annotation burden of multiple-object tracking (MOT) pipelines. Unfortunately, they lack two key properties: objects are often split into parts and are not consistently tracked over time. In fact, state-of-the-art model… ▽ More

    Submitted 5 September, 2023; v1 submitted 31 August, 2023; originally announced September 2023.

    Comments: ICCV 2023 camera-ready version

  21. arXiv:2307.09552  [pdf, other

    cs.LG stat.ME stat.ML

    Self-Compatibility: Evaluating Causal Discovery without Ground Truth

    Authors: Philipp M. Faller, Leena Chennuru Vankadara, Atalanti A. Mastakouri, Francesco Locatello, Dominik Janzing

    Abstract: As causal ground truth is incredibly rare, causal discovery algorithms are commonly only evaluated on simulated data. This is concerning, given that simulations reflect preconceptions about generating processes regarding noise distributions, model classes, and more. In this work, we propose a novel method for falsifying the output of a causal discovery algorithm in the absence of ground truth. Our… ▽ More

    Submitted 15 March, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: AISTATS 2024

  22. arXiv:2307.09437  [pdf, other

    cs.LG cs.AI cs.CV

    Grounded Object Centric Learning

    Authors: Avinash Kori, Francesco Locatello, Fabio De Sousa Ribeiro, Francesca Toni, Ben Glocker

    Abstract: The extraction of modular object-centric representations for downstream tasks is an emerging area of research. Learning grounded representations of objects that are guaranteed to be stable and invariant promises robust performance across different tasks and environments. Slot Attention (SA) learns object-centric representations by assigning objects to \textit{slots}, but presupposes a \textit{sing… ▽ More

    Submitted 25 January, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

  23. arXiv:2306.00600  [pdf, other

    cs.LG cs.AI cs.CV

    Rotating Features for Object Discovery

    Authors: Sindy Löwe, Phillip Lippe, Francesco Locatello, Max Welling

    Abstract: The binding problem in human cognition, concerning how the brain represents and connects objects within a fixed network of neural connections, remains a subject of intense debate. Most machine learning efforts addressing this issue in an unsupervised setting have focused on slot-based methods, which may be limiting due to their discrete nature and difficulty to express uncertainty. Recently, the C… ▽ More

    Submitted 17 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Oral presentation at NeurIPS 2023

  24. arXiv:2305.19377  [pdf, other

    cs.LG

    Benign Overfitting in Deep Neural Networks under Lazy Training

    Authors: Zhenyu Zhu, Fanghui Liu, Grigorios G Chrysos, Francesco Locatello, Volkan Cevher

    Abstract: This paper focuses on over-parameterized deep neural networks (DNNs) with ReLU activation functions and proves that when the data distribution is well-separated, DNNs can achieve Bayes-optimal test error for classification while obtaining (nearly) zero-training error under the lazy training regime. For this purpose, we unify three interrelated concepts of overparameterization, benign overfitting,… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted in ICML 2023

  25. arXiv:2304.10253  [pdf, other

    cs.CV cs.LG

    Image retrieval outperforms diffusion models on data augmentation

    Authors: Max F. Burg, Florian Wenzel, Dominik Zietlow, Max Horn, Osama Makansi, Francesco Locatello, Chris Russell

    Abstract: Many approaches have been proposed to use diffusion models to augment training datasets for downstream tasks, such as classification. However, diffusion models are themselves trained on large datasets, often with noisy annotations, and it remains an open question to which extent these models contribute to downstream classification performance. In particular, it remains unclear if they generalize e… ▽ More

    Submitted 30 November, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

  26. arXiv:2304.07939  [pdf, other

    cs.LG

    Leveraging sparse and shared feature activations for disentangled representation learning

    Authors: Marco Fumero, Florian Wenzel, Luca Zancato, Alessandro Achille, Emanuele Rodolà, Stefano Soatto, Bernhard Schölkopf, Francesco Locatello

    Abstract: Recovering the latent factors of variation of high dimensional data has so far focused on simple synthetic settings. Mostly building on unsupervised and weakly-supervised objectives, prior work missed out on the positive implications for representation learning on real world data. In this work, we propose to leverage knowledge extracted from a diversified set of supervised tasks to learn a common… ▽ More

    Submitted 12 December, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

  27. arXiv:2304.03382  [pdf, other

    cs.LG stat.ML

    Scalable Causal Discovery with Score Matching

    Authors: Francesco Montagna, Nicoletta Noceti, Lorenzo Rosasco, Kun Zhang, Francesco Locatello

    Abstract: This paper demonstrates how to discover the whole causal graph from the second derivative of the log-likelihood in observational non-linear additive Gaussian noise models. Leveraging scalable machine learning approaches to approximate the score function $\nabla \log p(\mathbf{X})$, we extend the work of Rolland et al. (2022) that only recovers the topological order from the score and requires an e… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Journal ref: 2nd Conference on Causal Learning and Reasoning (CLeaR 2023)

  28. arXiv:2304.03265  [pdf, other

    cs.LG stat.ME

    Causal Discovery with Score Matching on Additive Models with Arbitrary Noise

    Authors: Francesco Montagna, Nicoletta Noceti, Lorenzo Rosasco, Kun Zhang, Francesco Locatello

    Abstract: Causal discovery methods are intrinsically constrained by the set of assumptions needed to ensure structure identifiability. Moreover additional restrictions are often imposed in order to simplify the inference task: this is the case for the Gaussian noise assumption on additive non-linear models, which is common to many causal discovery approaches. In this paper we show the shortcomings of infere… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Journal ref: 2nd Conference on Causal Learning and Reasoning (CLeaR 2023)

  29. arXiv:2304.01430  [pdf, other

    cs.CV cs.AI cs.LG

    Divided Attention: Unsupervised Multi-Object Discovery with Contextually Separated Slots

    Authors: Dong Lao, Zhengyang Hu, Francesco Locatello, Yanchao Yang, Stefano Soatto

    Abstract: We introduce a method to segment the visual field into independently moving regions, trained with no ground truth or supervision. It consists of an adversarial conditional encoder-decoder architecture based on Slot Attention, modified to use the image as context to decode optical flow without attempting to reconstruct the image itself. In the resulting multi-modal representation, one modality (flo… ▽ More

    Submitted 22 June, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  30. arXiv:2301.05169  [pdf, other

    cs.LG cs.AI cs.CV

    Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning

    Authors: Yuejiang Liu, Alexandre Alahi, Chris Russell, Max Horn, Dominik Zietlow, Bernhard Schölkopf, Francesco Locatello

    Abstract: Recent years have seen a surge of interest in learning high-level causal representations from low-level image pairs under interventions. Yet, existing efforts are largely limited to simple synthetic settings that are far away from real-world problems. In this paper, we present Causal Triplet, a causal representation learning benchmark featuring not only visually more complex scenes, but also two c… ▽ More

    Submitted 3 April, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Conference on Causal Learning and Reasoning (CLeaR) 2023

  31. arXiv:2211.02348  [pdf, other

    cs.LG cs.AI cs.CY

    A General Purpose Neural Architecture for Geospatial Systems

    Authors: Nasim Rahaman, Martin Weiss, Frederik Träuble, Francesco Locatello, Alexandre Lacoste, Yoshua Bengio, Chris Pal, Li Erran Li, Bernhard Schölkopf

    Abstract: Geospatial Information Systems are used by researchers and Humanitarian Assistance and Disaster Response (HADR) practitioners to support a wide variety of important applications. However, collaboration between these actors is difficult due to the heterogeneous nature of geospatial data modalities (e.g., multi-spectral images of various resolutions, timeseries, weather data) and diversity of tasks… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: Presented at AI + HADR Workshop at NeurIPS 2022

  32. arXiv:2210.12733  [pdf, other

    cs.CV

    Self-supervised Amodal Video Object Segmentation

    Authors: Jian Yao, Yuxin Hong, Chiyu Wang, Tianjun Xiao, Tong He, Francesco Locatello, David Wipf, Yanwei Fu, Zheng Zhang

    Abstract: Amodal perception requires inferring the full shape of an object that is partially occluded. This task is particularly challenging on two levels: (1) it requires more information than what is contained in the instant retina or imaging sensor, (2) it is difficult to obtain enough well-annotated amodal labels for supervision. To this end, this paper develops a new framework of Self-supervised amodal… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: accepted in Neurips2022

  33. arXiv:2210.08031  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Neural Attentive Circuits

    Authors: Nasim Rahaman, Martin Weiss, Francesco Locatello, Chris Pal, Yoshua Bengio, Bernhard Schölkopf, Li Erran Li, Nicolas Ballas

    Abstract: Recent work has seen the development of general purpose neural architectures that can be trained to perform tasks across diverse data modalities. General purpose models typically make few assumptions about the underlying data-structure and are known to perform well in the large-data regime. At the same time, there has been growing interest in modular neural architectures that represent the data us… ▽ More

    Submitted 19 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: To appear at NeurIPS 2022

  34. arXiv:2210.01738  [pdf, other

    cs.LG cs.AI cs.CV

    ASIF: Coupled Data Turns Unimodal Models to Multimodal Without Training

    Authors: Antonio Norelli, Marco Fumero, Valentino Maiorca, Luca Moschella, Emanuele Rodolà, Francesco Locatello

    Abstract: CLIP proved that aligning visual and language spaces is key to solving many vision tasks without explicit training, but required to train image and text encoders from scratch on a huge dataset. LiT improved this by only training the text encoder and using a pre-trained vision network. In this paper, we show that a common space can be created without any training at all, using single-domain encoder… ▽ More

    Submitted 10 November, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 17 pages

  35. arXiv:2209.15430  [pdf, other

    cs.LG cs.AI

    Relative representations enable zero-shot latent space communication

    Authors: Luca Moschella, Valentino Maiorca, Marco Fumero, Antonio Norelli, Francesco Locatello, Emanuele Rodolà

    Abstract: Neural networks embed the geometric structure of a data manifold lying in a high-dimensional space into latent representations. Ideally, the distribution of the data points in the latent space should depend only on the task, the data, the loss, and other architecture-specific constraints. However, factors such as the random weights initialization, training hyperparameters, or other sources of rand… ▽ More

    Submitted 7 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 notable top 5%, 26 pages, 11 figures, 18 tables

    MSC Class: 68T07 ACM Class: I.2.6

  36. arXiv:2209.14860  [pdf, other

    cs.CV cs.LG

    Bridging the Gap to Real-World Object-Centric Learning

    Authors: Maximilian Seitzer, Max Horn, Andrii Zadaianchuk, Dominik Zietlow, Tianjun Xiao, Carl-Johann Simon-Gabriel, Tong He, Zheng Zhang, Bernhard Schölkopf, Thomas Brox, Francesco Locatello

    Abstract: Humans naturally decompose their environment into entities at the appropriate level of abstraction to act in the world. Allowing machine learning algorithms to derive this decomposition in an unsupervised way has become an important line of research. However, current methods are restricted to simulated data or require additional information in the form of motion or depth in order to successfully d… ▽ More

    Submitted 6 March, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 camera-ready version

  37. arXiv:2209.11459  [pdf, other

    cs.CV cs.LG

    TeST: Test-time Self-Training under Distribution Shift

    Authors: Samarth Sinha, Peter Gehler, Francesco Locatello, Bernt Schiele

    Abstract: Despite their recent success, deep neural networks continue to perform poorly when they encounter distribution shifts at test time. Many recently proposed approaches try to counter this by aligning the model to the new distribution prior to inference. With no labels available this requires unsupervised objectives to adapt the model on the observed test data. In this paper, we propose Test-Time Sel… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Journal ref: WACV 2023

  38. arXiv:2207.09239  [pdf, other

    cs.LG stat.ML

    Assaying Out-Of-Distribution Generalization in Transfer Learning

    Authors: Florian Wenzel, Andrea Dittadi, Peter Vincent Gehler, Carl-Johann Simon-Gabriel, Max Horn, Dominik Zietlow, David Kernert, Chris Russell, Thomas Brox, Bernt Schiele, Bernhard Schölkopf, Francesco Locatello

    Abstract: Since out-of-distribution generalization is a generally ill-posed problem, various proxy targets (e.g., calibration, adversarial robustness, algorithmic corruptions, invariance across shifts) were studied across different research programs resulting in different recommendations. While sharing the same aspirational goal, these approaches have never been tested under the same experimental conditions… ▽ More

    Submitted 21 October, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

  39. arXiv:2207.05027  [pdf, other

    cs.CV

    Unsupervised Semantic Segmentation with Self-supervised Object-centric Representations

    Authors: Andrii Zadaianchuk, Matthaeus Kleindessner, Yi Zhu, Francesco Locatello, Thomas Brox

    Abstract: In this paper, we show that recent advances in self-supervised feature learning enable unsupervised object discovery and semantic segmentation with a performance that matches the state of the field on supervised semantic segmentation 10 years ago. We propose a methodology based on unsupervised saliency masks and self-supervised feature clustering to kickstart object discovery followed by training… ▽ More

    Submitted 30 April, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

  40. arXiv:2204.04440  [pdf, other

    cs.LG

    Are Two Heads the Same as One? Identifying Disparate Treatment in Fair Neural Networks

    Authors: Michael Lohaus, Matthäus Kleindessner, Krishnaram Kenthapadi, Francesco Locatello, Chris Russell

    Abstract: We show that deep networks trained to satisfy demographic parity often do so through a form of race or gender awareness, and that the more we force a network to be fair, the more accurately we can recover race or gender from the internal state of the network. Based on this observation, we investigate an alternative fairness approach: we add a second classification head to the network to explicitly… ▽ More

    Submitted 19 November, 2022; v1 submitted 9 April, 2022; originally announced April 2022.

    Comments: Accepted at NeurIPS 2022

  41. arXiv:2203.04913  [pdf, other

    cs.CV cs.LG

    Leveling Down in Computer Vision: Pareto Inefficiencies in Fair Deep Classifiers

    Authors: Dominik Zietlow, Michael Lohaus, Guha Balakrishnan, Matthäus Kleindessner, Francesco Locatello, Bernhard Schölkopf, Chris Russell

    Abstract: Algorithmic fairness is frequently motivated in terms of a trade-off in which overall performance is decreased so as to improve performance on disadvantaged groups where the algorithm would otherwise be less accurate. Contrary to this, we find that applying existing fairness approaches to computer vision improve fairness by degrading the performance of classifiers across all groups (with increased… ▽ More

    Submitted 31 March, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

  42. arXiv:2203.04413  [pdf, ps, other

    cs.LG stat.ML

    Score matching enables causal discovery of nonlinear additive noise models

    Authors: Paul Rolland, Volkan Cevher, Matthäus Kleindessner, Chris Russel, Bernhard Schölkopf, Dominik Janzing, Francesco Locatello

    Abstract: This paper demonstrates how to recover causal graphs from the score of the data distribution in non-linear additive (Gaussian) noise models. Using score matching algorithms as a building block, we show how to design a new generation of scalable causal discovery methods. To showcase our approach, we also propose a new efficient method for approximating the score's Jacobian, enabling to recover the… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  43. arXiv:2202.13212  [pdf, other

    cs.LG math.OC

    Faster One-Sample Stochastic Conditional Gradient Method for Composite Convex Minimization

    Authors: Gideon Dresdner, Maria-Luiza Vladarean, Gunnar Rätsch, Francesco Locatello, Volkan Cevher, Alp Yurtsever

    Abstract: We propose a stochastic conditional gradient method (CGM) for minimizing convex finite-sum objectives formed as a sum of smooth and non-smooth terms. Existing CGM variants for this template either suffer from slow convergence rates, or require carefully increasing the batch size over the course of the algorithm's execution, which leads to computing full gradients. In contrast, the proposed method,… ▽ More

    Submitted 17 April, 2022; v1 submitted 26 February, 2022; originally announced February 2022.

    Comments: Artificial Intelligence and Statistics (AISTATS) 2022

  44. arXiv:2201.13388  [pdf, other

    cs.RO cs.LG stat.ML

    Compositional Multi-Object Reinforcement Learning with Linear Relation Networks

    Authors: Davide Mambelli, Frederik Träuble, Stefan Bauer, Bernhard Schölkopf, Francesco Locatello

    Abstract: Although reinforcement learning has seen remarkable progress over the last years, solving robust dexterous object-manipulation tasks in multi-object settings remains a challenge. In this paper, we focus on models that can learn manipulation tasks in fixed multi-object settings and extrapolate this skill zero-shot without any drop in performance when the number of objects changes. We consider the g… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

  45. arXiv:2111.13693  [pdf, other

    cs.LG cs.AI

    Enforcing and Discovering Structure in Machine Learning

    Authors: Francesco Locatello

    Abstract: The world is structured in countless ways. It may be prudent to enforce corresponding structural properties to a learning algorithm's solution, such as incorporating prior beliefs, natural constraints, or causal structures. Doing so may translate to faster, more accurate, and more flexible models, which may directly relate to real-world impact. In this dissertation, we consider two different resea… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: Updated version of my PhD thesis, with fixed typos. Will keep updated as new typos are discovered

  46. arXiv:2110.06562  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Object Learning via Common Fate

    Authors: Matthias Tangemann, Steffen Schneider, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Thomas Brox, Matthias Kümmerer, Matthias Bethge, Bernhard Schölkopf

    Abstract: Learning generative object models from unlabelled videos is a long standing problem and required for causal scene modeling. We decompose this problem into three easier subtasks, and provide candidate solutions for each of them. Inspired by the Common Fate Principle of Gestalt Psychology, we first extract (noisy) masks of moving objects via unsupervised motion segmentation. Second, generative model… ▽ More

    Submitted 15 May, 2023; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Published at CLeaR 2023

  47. arXiv:2110.06399  [pdf, other

    cs.LG cs.CV

    Dynamic Inference with Neural Interpreters

    Authors: Nasim Rahaman, Muhammad Waleed Gondal, Shruti Joshi, Peter Gehler, Yoshua Bengio, Francesco Locatello, Bernhard Schölkopf

    Abstract: Modern neural network architectures can leverage large amounts of data to generalize well within the training distribution. However, they are less capable of systematic generalization to data drawn from unseen but related distributions, a feat that is hypothesized to require compositional reasoning and reuse of knowledge. In this work, we present Neural Interpreters, an architecture that factorize… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

  48. arXiv:2110.05304  [pdf, other

    cs.LG

    You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction

    Authors: Osama Makansi, Julius von Kügelgen, Francesco Locatello, Peter Gehler, Dominik Janzing, Thomas Brox, Bernhard Schölkopf

    Abstract: Predicting the future trajectory of a moving agent can be easy when the past trajectory continues smoothly but is challenging when complex interactions with other agents are involved. Recent deep learning approaches for trajectory prediction show promising performance and partially attribute this to successful reasoning about agent-agent interactions. However, it remains unclear which features suc… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  49. arXiv:2107.08221  [pdf, other

    cs.LG cs.CV

    Visual Representation Learning Does Not Generalize Strongly Within the Same Domain

    Authors: Lukas Schott, Julius von Kügelgen, Frederik Träuble, Peter Gehler, Chris Russell, Matthias Bethge, Bernhard Schölkopf, Francesco Locatello, Wieland Brendel

    Abstract: An important component for generalization in machine learning is to uncover underlying latent factors of variation as well as the mechanism through which each factor acts in the world. In this paper, we test whether 17 unsupervised, weakly supervised, and fully supervised representation learning approaches correctly infer the generative factors of variation in simple datasets (dSprites, Shapes3D,… ▽ More

    Submitted 12 February, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

  50. arXiv:2107.05686  [pdf, other

    cs.LG stat.ML

    The Role of Pretrained Representations for the OOD Generalization of Reinforcement Learning Agents

    Authors: Andrea Dittadi, Frederik Träuble, Manuel Wüthrich, Felix Widmaier, Peter Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, Stefan Bauer

    Abstract: Building sample-efficient agents that generalize out-of-distribution (OOD) in real-world settings remains a fundamental unsolved problem on the path towards achieving higher-level cognition. One particularly promising approach is to begin with low-dimensional, pretrained representations of our world, which should facilitate efficient downstream learning and generalization. By training 240 represen… ▽ More

    Submitted 16 April, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Published at ICLR 2022