Skip to main content

Showing 1–11 of 11 results for author: Petrini, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.16154  [pdf

    cs.LG

    Breaking the Curse of Dimensionality in Deep Neural Networks by Learning Invariant Representations

    Authors: Leonardo Petrini

    Abstract: Artificial intelligence, particularly the subfield of machine learning, has seen a paradigm shift towards data-driven models that learn from and adapt to data. This has resulted in unprecedented advancements in various domains such as natural language processing and computer vision, largely attributed to deep learning, a special class of machine learning models. Deep learning arguably surpasses tr… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: PhD Thesis @ EPFL

  2. arXiv:2307.02129  [pdf, other

    cs.LG cs.CV stat.ML

    How Deep Neural Networks Learn Compositional Data: The Random Hierarchy Model

    Authors: Francesco Cagnetta, Leonardo Petrini, Umberto M. Tomasini, Alessandro Favero, Matthieu Wyart

    Abstract: Deep learning algorithms demonstrate a surprising ability to learn high-dimensional tasks from limited examples. This is commonly attributed to the depth of neural networks, enabling them to build a hierarchy of abstract, low-dimensional data representations. However, how many training examples are required to learn such representations remains unknown. To quantitatively study this question, we in… ▽ More

    Submitted 3 July, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 9 pages, 8 figures

    Journal ref: Phys. Rev. X 14, 031001 (2024)

  3. arXiv:2210.01506  [pdf, other

    cs.LG cs.CV

    How deep convolutional neural networks lose spatial information with training

    Authors: Umberto M. Tomasini, Leonardo Petrini, Francesco Cagnetta, Matthieu Wyart

    Abstract: A central question of machine learning is how deep nets manage to learn tasks in high dimensions. An appealing hypothesis is that they achieve this feat by building a representation of the data where information irrelevant to the task is lost. For image datasets, this view is supported by the observation that after (and not before) training, the neural representation becomes less and less sensitiv… ▽ More

    Submitted 23 November, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

  4. arXiv:2206.12314  [pdf, other

    stat.ML cs.LG

    Learning sparse features can lead to overfitting in neural networks

    Authors: Leonardo Petrini, Francesco Cagnetta, Eric Vanden-Eijnden, Matthieu Wyart

    Abstract: It is widely believed that the success of deep networks lies in their ability to learn a meaningful representation of the features of the data. Yet, understanding when and how this feature learning improves performance remains a challenge: for example, it is beneficial for modern architectures trained to classify images, whereas it is detrimental for fully-connected networks trained for the same t… ▽ More

    Submitted 12 October, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

  5. arXiv:2112.03111  [pdf, ps, other

    cs.CV cs.CY cs.LG

    Ethics and Creativity in Computer Vision

    Authors: Negar Rostamzadeh, Emily Denton, Linda Petrini

    Abstract: This paper offers a retrospective of what we learnt from organizing the workshop *Ethical Considerations in Creative applications of Computer Vision* at CVPR 2021 conference and, prior to that, a series of workshops on *Computer Vision for Fashion, Art and Design* at ECCV 2018, ICCV 2019, and CVPR 2020. We hope this reflection will bring artists and machine learning researchers into conversation a… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Neural Information Processing System 2021 workshop on Machine Learning for Creativity and Design

    Journal ref: NeurIPS 2021 workshop on Machine Learning for Creativity and Design

  6. Relative stability toward diffeomorphisms indicates performance in deep nets

    Authors: Leonardo Petrini, Alessandro Favero, Mario Geiger, Matthieu Wyart

    Abstract: Understanding why deep nets can classify data in large dimensions remains a challenge. It has been proposed that they do so by becoming stable to diffeomorphisms, yet existing empirical measurements support that it is often not the case. We revisit this question by defining a maximum-entropy distribution on diffeomorphisms, that allows to study typical diffeomorphisms of a given norm. We confirm t… ▽ More

    Submitted 4 November, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: NeurIPS 2021 Conference

  7. arXiv:2104.02646  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    gradSim: Differentiable simulation for system identification and visuomotor control

    Authors: Krishna Murthy Jatavallabhula, Miles Macklin, Florian Golemo, Vikram Voleti, Linda Petrini, Martin Weiss, Breandan Considine, Jerome Parent-Levesque, Kevin Xie, Kenny Erleben, Liam Paull, Florian Shkurti, Derek Nowrouzezahrai, Sanja Fidler

    Abstract: We consider the problem of estimating an object's physical properties such as mass, friction, and elasticity directly from video sequences. Such a system identification problem is fundamentally ill-posed due to the loss of information during image formation. Current solutions require precise 3D labels which are labor-intensive to gather, and infeasible to create for many systems such as deformable… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: ICLR 2021. Project page (and a dynamic web version of the article): https://gradsim.github.io

  8. arXiv:2012.15110  [pdf, other

    cs.LG

    Perspective: A Phase Diagram for Deep Learning unifying Jamming, Feature Learning and Lazy Training

    Authors: Mario Geiger, Leonardo Petrini, Matthieu Wyart

    Abstract: Deep learning algorithms are responsible for a technological revolution in a variety of tasks including image recognition or Go playing. Yet, why they work is not understood. Ultimately, they manage to classify data lying in high dimension -- a feat generically impossible due to the geometry of high dimensional space and the associated curse of dimensionality. Understanding what kind of structure,… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

  9. arXiv:2010.13320  [pdf, other

    cs.CV cs.LG

    Zero-Shot Learning from scratch (ZFS): leveraging local compositional representations

    Authors: Tristan Sylvain, Linda Petrini, R Devon Hjelm

    Abstract: Zero-shot classification is a generalization task where no instance from the target classes is seen during training. To allow for test-time transfer, each class is annotated with semantic information, commonly in the form of attributes or text descriptions. While classical zero-shot learning does not explicitly forbid using information from other datasets, the approaches that achieve the best abso… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: ICML 2019 Workshop on Understanding and Improving General-ization in Deep Learning, Long Beach, California, 2019 Spotlight presentation. arXiv admin note: text overlap with arXiv:1912.12179

  10. Geometric compression of invariant manifolds in neural nets

    Authors: Jonas Paccolat, Leonardo Petrini, Mario Geiger, Kevin Tyloo, Matthieu Wyart

    Abstract: We study how neural networks compress uninformative input space in models where data lie in $d$ dimensions, but whose label only vary within a linear manifold of dimension $d_\parallel < d$. We show that for a one-hidden layer network initialized with infinitesimal weights (i.e. in the feature learning regime) trained with gradient descent, the first layer of weights evolve to become nearly insens… ▽ More

    Submitted 11 March, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

    Journal ref: Journal of Statistical Mechanics: Theory and Experiment, Volume 2021, April 2021

  11. arXiv:1912.12179  [pdf, other

    cs.CV cs.LG

    Locality and compositionality in zero-shot learning

    Authors: Tristan Sylvain, Linda Petrini, Devon Hjelm

    Abstract: In this work we study locality and compositionality in the context of learning representations for Zero Shot Learning (ZSL). In order to well-isolate the importance of these properties in learned representations, we impose the additional constraint that, differently from most recent work in ZSL, no pre-training on different datasets (e.g. ImageNet) is performed. The results of our experiments show… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: Published at ICLR 2020