Skip to main content

Showing 1–18 of 18 results for author: Ortiz-Jimenez, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00106  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI

    Authors: Ilia Shumailov, Jamie Hayes, Eleni Triantafillou, Guillermo Ortiz-Jimenez, Nicolas Papernot, Matthew Jagielski, Itay Yona, Heidi Howard, Eugene Bagdasaryan

    Abstract: Exact unlearning was first introduced as a privacy mechanism that allowed a user to retract their data from machine learning models on request. Shortly after, inexact schemes were proposed to mitigate the impractical costs associated with exact unlearning. More recently unlearning is often discussed as an approach for removal of impermissible knowledge i.e. knowledge that the model should not poss… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  2. arXiv:2405.07813  [pdf, other

    cs.LG cs.CV

    Localizing Task Information for Improved Model Merging and Compression

    Authors: Ke Wang, Nikolaos Dimitriadis, Guillermo Ortiz-Jimenez, François Fleuret, Pascal Frossard

    Abstract: Model merging and task arithmetic have emerged as promising scalable approaches to merge multiple single-task checkpoints to one multi-task model, but their applicability is reduced by significant performance loss. Previous works have linked these drops to interference in the weight space and erasure of important task-specific features. Instead, in this work we show that the information required t… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Accepted ICML 2024; The first two authors contributed equally to this work; Project website: https://tall-masks.github.io

  3. arXiv:2310.06600  [pdf, other

    cs.LG cs.CV

    Pi-DUAL: Using Privileged Information to Distinguish Clean from Noisy Labels

    Authors: Ke Wang, Guillermo Ortiz-Jimenez, Rodolphe Jenatton, Mark Collier, Efi Kokiopoulou, Pascal Frossard

    Abstract: Label noise is a pervasive problem in deep learning that often compromises the generalization performance of trained models. Recently, leveraging privileged information (PI) -- information available only during training but not at test time -- has emerged as an effective approach to mitigate this issue. Yet, existing PI-based methods have failed to consistently outperform their no-PI counterparts… ▽ More

    Submitted 28 May, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted ICML 2024

  4. arXiv:2305.12827  [pdf, other

    cs.LG cs.CV

    Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained Models

    Authors: Guillermo Ortiz-Jimenez, Alessandro Favero, Pascal Frossard

    Abstract: Task arithmetic has recently emerged as a cost-effective and scalable approach to edit pre-trained models directly in weight space: By adding the fine-tuned weights of different tasks, the model's performance can be improved on these tasks, while negating them leads to task forgetting. Yet, our understanding of the effectiveness of task arithmetic and its underlying principles remains limited. We… ▽ More

    Submitted 21 November, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Journal ref: Advances in Neural Information Processing Systems 36 (NeurIPS 2023)

  5. arXiv:2303.01806  [pdf, other

    cs.LG cs.CV

    When does Privileged Information Explain Away Label Noise?

    Authors: Guillermo Ortiz-Jimenez, Mark Collier, Anant Nawalgaria, Alexander D'Amour, Jesse Berent, Rodolphe Jenatton, Effrosyni Kokiopoulou

    Abstract: Leveraging privileged information (PI), or features available during training but not at test time, has recently been shown to be an effective method for addressing label noise. However, the reasons for its effectiveness are not well understood. In this study, we investigate the role played by different properties of the PI in explaining away label noise. Through experiments on multiple datasets w… ▽ More

    Submitted 1 June, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted ICML 2023, Honolulu

  6. arXiv:2206.08242  [pdf, other

    cs.LG cs.AI cs.CV

    Catastrophic overfitting can be induced with discriminative non-robust features

    Authors: Guillermo Ortiz-Jiménez, Pau de Jorge, Amartya Sanyal, Adel Bibi, Puneet K. Dokania, Pascal Frossard, Gregory Rogéz, Philip H. S. Torr

    Abstract: Adversarial training (AT) is the de facto method for building robust neural networks, but it can be computationally expensive. To mitigate this, fast single-step attacks can be used, but this may lead to catastrophic overfitting (CO). This phenomenon appears when networks gain non-trivial robustness during the first stages of AT, but then reach a breaking point where they become vulnerable in just… ▽ More

    Submitted 15 August, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  7. arXiv:2203.07159  [pdf, other

    cs.LG cs.CR

    On the benefits of knowledge distillation for adversarial robustness

    Authors: Javier Maroto, Guillermo Ortiz-Jiménez, Pascal Frossard

    Abstract: Knowledge distillation is normally used to compress a big network, or teacher, onto a smaller one, the student, by training it to match its outputs. Recently, some works have shown that robustness against adversarial attacks can also be distilled effectively to achieve good rates of robustness on mobile-friendly models. In this work, however, we take a different point of view, and show that knowle… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  8. arXiv:2112.13547  [pdf, other

    cs.CV cs.LG

    PRIME: A few primitives can boost robustness to common corruptions

    Authors: Apostolos Modas, Rahul Rade, Guillermo Ortiz-Jiménez, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

    Abstract: Despite their impressive performance on image classification tasks, deep networks have a hard time generalizing to unforeseen corruptions of their data. To fix this vulnerability, prior works have built complex data augmentation strategies, combining multiple methods to enrich the training data. However, introducing intricate design choices or heuristics makes it hard to understand which elements… ▽ More

    Submitted 13 March, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

    Comments: Code available at: https://github.com/amodas/PRIME-augmentations

    Journal ref: European Conference on Computer Vision (ECCV) 2022

  9. arXiv:2112.01917  [pdf, other

    cs.LG cs.CV

    A Structured Dictionary Perspective on Implicit Neural Representations

    Authors: Gizem Yüce, Guillermo Ortiz-Jiménez, Beril Besbinar, Pascal Frossard

    Abstract: Implicit neural representations (INRs) have recently emerged as a promising alternative to classical discretized representations of signals. Nevertheless, despite their practical success, we still do not understand how INRs represent signals. We propose a novel unified perspective to theoretically analyse INRs. Leveraging results from harmonic analysis and deep learning theory, we show that most I… ▽ More

    Submitted 25 March, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: Accepted to IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 (26 pages, 16 figures)

  10. arXiv:2106.06770  [pdf, other

    cs.LG cs.AI

    What can linearized neural networks actually say about generalization?

    Authors: Guillermo Ortiz-Jiménez, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

    Abstract: For certain infinitely-wide neural networks, the neural tangent kernel (NTK) theory fully characterizes generalization, but for the networks used in practice, the empirical NTK only provides a rough first-order approximation. Still, a growing body of work keeps leveraging this approximation to successfully analyze important deep learning phenomena and design algorithms for new applications. In our… ▽ More

    Submitted 13 October, 2021; v1 submitted 12 June, 2021; originally announced June 2021.

    Comments: 18 pages, 16 figures

    Journal ref: Advances on Neural Information Processing Systems (NeurIPS 2021)

  11. arXiv:2104.14372  [pdf, other

    cs.LG cs.AI

    A neural anisotropic view of underspecification in deep learning

    Authors: Guillermo Ortiz-Jimenez, Itamar Franco Salazar-Reque, Apostolos Modas, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

    Abstract: The underspecification of most machine learning pipelines means that we cannot rely solely on validation performance to assess the robustness of deep learning systems to naturally occurring distribution shifts. Instead, making sure that a neural network can generalize across a large number of different situations requires to understand the specific way in which it solves a task. In this work, we p… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: Presented as a RobustML workshop paper at ICLR 2021

  12. arXiv:2010.09624  [pdf, other

    cs.LG cs.AI cs.CV

    Optimism in the Face of Adversity: Understanding and Improving Deep Learning through Adversarial Robustness

    Authors: Guillermo Ortiz-Jimenez, Apostolos Modas, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

    Abstract: Driven by massive amounts of data and important advances in computational resources, new deep learning systems have achieved outstanding results in a large spectrum of applications. Nevertheless, our current theoretical understanding on the mathematical foundations of deep learning lags far behind its empirical success. Towards solving the vulnerability of neural networks, however, the field of ad… ▽ More

    Submitted 28 January, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 24 pages, 14 figures

  13. arXiv:2006.09717  [pdf, other

    cs.LG cs.CV stat.ML

    Neural Anisotropy Directions

    Authors: Guillermo Ortiz-Jimenez, Apostolos Modas, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

    Abstract: In this work, we analyze the role of the network architecture in sha** the inductive bias of deep classifiers. To that end, we start by focusing on a very simple problem, i.e., classifying a class of linearly separable distributions, and show that, depending on the direction of the discriminative feature of the distribution, many state-of-the-art deep convolutional neural networks (CNNs) have a… ▽ More

    Submitted 14 October, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted to the 34th Conference on Neural Information Processing Systems (NeurIPS 2020) (39 pages, 22 figures)

  14. arXiv:2002.06349  [pdf, other

    cs.LG cs.CV stat.ML

    Hold me tight! Influence of discriminative features on deep network boundaries

    Authors: Guillermo Ortiz-Jimenez, Apostolos Modas, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

    Abstract: Important insights towards the explainability of neural networks reside in the characteristics of their decision boundaries. In this work, we borrow tools from the field of adversarial robustness, and propose a new perspective that relates dataset features to the distance of samples to the decision boundary. This enables us to carefully tweak the position of the training samples and measure the in… ▽ More

    Submitted 15 October, 2020; v1 submitted 15 February, 2020; originally announced February 2020.

    Comments: Accepted to the 34th Conference on Neural Information Processing Systems (NeurIPS) 2020 (30 pages, 38 figures)

  15. arXiv:1911.05384  [pdf, other

    cs.SI cs.LG eess.SP stat.ML

    On the choice of graph neural network architectures

    Authors: Clément Vignac, Guillermo Ortiz-Jiménez, Pascal Frossard

    Abstract: Seminal works on graph neural networks have primarily targeted semi-supervised node classification problems with few observed labels and high-dimensional signals. With the development of graph networks, this setup has become a de facto benchmark for a significant body of research. Interestingly, several works have recently shown that in this particular setting, graph neural networks do not perform… ▽ More

    Submitted 10 February, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: 5 pages, 1 figure, accepted at ICASSP 2020

  16. arXiv:1909.11448  [pdf, other

    stat.ML cs.LG

    Forward-Backward Splitting for Optimal Transport based Problems

    Authors: Guillermo Ortiz-Jimenez, Mireille El Gheche, Effrosyni Simou, Hermina Petric Maretic, Pascal Frossard

    Abstract: Optimal transport aims to estimate a transportation plan that minimizes a displacement cost. This is realized by optimizing the scalar product between the sought plan and the given cost, over the space of doubly stochastic matrices. When the entropy regularization is added to the problem, the transportation plan can be efficiently computed with the Sinkhorn algorithm. Thanks to this breakthrough,… ▽ More

    Submitted 4 November, 2019; v1 submitted 19 September, 2019; originally announced September 2019.

  17. arXiv:1807.00145  [pdf, other

    eess.SP cs.IT

    Sampling and Reconstruction of Signals on Product Graphs

    Authors: Guillermo Ortiz-Jiménez, Mario Coutino, Sundeep Prabhakar Chepuri, Geert Leus

    Abstract: In this paper, we consider the problem of subsampling and reconstruction of signals that reside on the vertices of a product graph, such as sensor network time series, genomic signals, or product ratings in a social network. Specifically, we leverage the product structure of the underlying domain and sample nodes from the graph factors. The proposed scheme is particularly useful for processing sig… ▽ More

    Submitted 30 June, 2018; originally announced July 2018.

    Comments: 5 pages, 3 figures

  18. Sparse Sampling for Inverse Problems with Tensors

    Authors: Guillermo Ortiz-Jiménez, Mario Coutino, Sundeep Prabhakar Chepuri, Geert Leus

    Abstract: We consider the problem of designing sparse sampling strategies for multidomain signals, which can be represented using tensors that admit a known multilinear decomposition. We leverage the multidomain structure of tensor signals and propose to acquire samples using a Kronecker-structured sensing function, thereby circumventing the curse of dimensionality. For designing such sensing functions, we… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

    Comments: 13 pages, 7 figures