Skip to main content

Showing 1–9 of 9 results for author: Virmaux, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10198  [pdf, other

    cs.LG stat.ML

    SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention

    Authors: Romain Ilbert, Ambroise Odonnat, Vasilii Feofanov, Aladin Virmaux, Giuseppe Paolo, Themis Palpanas, Ievgen Redko

    Abstract: Transformer-based architectures achieved breakthrough performance in natural language processing and computer vision, yet they remain inferior to simpler linear baselines in multivariate long-term forecasting. To better understand this phenomenon, we start by studying a toy linear forecasting problem for which we show that transformers are incapable of converging to their true solution despite the… ▽ More

    Submitted 3 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted as an Oral at ICML 2024, Vienna. The first two authors contributed equally

  2. arXiv:2310.13434  [pdf, other

    cs.LG cs.AI stat.ML

    Random Matrix Analysis to Balance between Supervised and Unsupervised Learning under the Low Density Separation Assumption

    Authors: Vasilii Feofanov, Malik Tiomoko, Aladin Virmaux

    Abstract: We propose a theoretical framework to analyze semi-supervised classification under the low density separation assumption in a high-dimensional regime. In particular, we introduce QLDS, a linear classification model, where the low density separation assumption is implemented via quadratic margin maximization. The algorithm has an explicit solution with rich theoretical properties, and we show that… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:10008-10033, 2023

  3. arXiv:2110.02716  [pdf, other

    cs.LG

    Knothe-Rosenblatt transport for Unsupervised Domain Adaptation

    Authors: Aladin Virmaux, Illyyne Saffar, Jianfeng Zhang, Balázs Kégl

    Abstract: Unsupervised domain adaptation (UDA) aims at exploiting related but different data sources to tackle a common task in a target domain. UDA remains a central yet challenging problem in machine learning. In this paper, we present an approach tailored to moderate-dimensional tabular problems which are hugely important in industrial applications and less well-served by the plethora of methods designed… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: 16 pages, 3 figures

    ACM Class: I.2.6; G.3

  4. arXiv:2103.04886  [pdf, other

    cs.LG stat.ML

    Lipschitz Normalization for Self-Attention Layers with Application to Graph Neural Networks

    Authors: George Dasoulas, Kevin Scaman, Aladin Virmaux

    Abstract: Attention based neural networks are state of the art in a large range of applications. However, their performance tends to degrade when the number of layers increases. In this work, we show that enforcing Lipschitz continuity by normalizing the attention scores can significantly improve the performance of deep attention models. First, we show that, for deep graph attention networks (GAT), gradient… ▽ More

    Submitted 13 September, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: 18 pages. Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021. Copyright 2021 by the author(s)

  5. arXiv:2102.09012  [pdf, other

    cs.LG

    Improving Hierarchical Adversarial Robustness of Deep Neural Networks

    Authors: Avery Ma, Aladin Virmaux, Kevin Scaman, Juwei Lu

    Abstract: Do all adversarial examples have the same consequences? An autonomous driving system misclassifying a pedestrian as a car may induce a far more dangerous -- and even potentially lethal -- behavior than, for instance, a car as a bus. In order to better tackle this important problematic, we introduce the concept of hierarchical adversarial robustness. Given a dataset whose classes can be grouped int… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  6. arXiv:2102.08735  [pdf, other

    cs.LG cs.SI

    Ego-based Entropy Measures for Structural Representations on Graphs

    Authors: George Dasoulas, Giannis Nikolentzos, Kevin Scaman, Aladin Virmaux, Michalis Vazirgiannis

    Abstract: Machine learning on graph-structured data has attracted high research interest due to the emergence of Graph Neural Networks (GNNs). Most of the proposed GNNs are based on the node homophily, i.e neighboring nodes share similar characteristics. However, in many complex networks, nodes that lie to distant parts of the graph share structurally equivalent characteristics and exhibit similar roles (e.… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: Accepted to ICASSP 2021. arXiv admin note: substantial text overlap with arXiv:2003.00553

  7. arXiv:2003.00553  [pdf, other

    cs.LG cs.SI stat.ML

    Ego-based Entropy Measures for Structural Representations

    Authors: George Dasoulas, Giannis Nikolentzos, Kevin Scaman, Aladin Virmaux, Michalis Vazirgiannis

    Abstract: In complex networks, nodes that share similar structural characteristics often exhibit similar roles (e.g type of users in a social network or the hierarchical position of employees in a company). In order to leverage this relationship, a growing literature proposed latent representations that identify structurally equivalent nodes. However, most of the existing methods require high time and space… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

    Comments: 7 pages, 4 figures

  8. arXiv:1912.06058  [pdf, other

    cs.LG stat.ML

    Coloring graph neural networks for node disambiguation

    Authors: George Dasoulas, Ludovic Dos Santos, Kevin Scaman, Aladin Virmaux

    Abstract: In this paper, we show that a simple coloring scheme can improve, both theoretically and empirically, the expressive power of Message Passing Neural Networks(MPNNs). More specifically, we introduce a graph neural network called Colored Local Iterative Procedure (CLIP) that uses colors to disambiguate identical node attributes, and show that this representation is a universal approximator of contin… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: 17 pages, 2 figures

  9. arXiv:1805.10965  [pdf, other

    stat.ML cs.LG

    Lipschitz regularity of deep neural networks: analysis and efficient estimation

    Authors: Kevin Scaman, Aladin Virmaux

    Abstract: Deep neural networks are notorious for being sensitive to small well-chosen perturbations, and estimating the regularity of such architectures is of utmost importance for safe and robust practical applications. In this paper, we investigate one of the key characteristics to assess the regularity of such methods: the Lipschitz constant of deep learning architectures. First, we show that, even for t… ▽ More

    Submitted 25 October, 2019; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: 12 pages, 3 figures