Skip to main content

Showing 1–7 of 7 results for author: Sabanayagam, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.11672  [pdf, other

    cs.LG cs.CR

    Fast Adaptive Test-Time Defense with Robust Features

    Authors: Anurag Singh, Mahalakshmi Sabanayagam, Krikamol Muandet, Debarghya Ghoshdastidar

    Abstract: Adaptive test-time defenses are used to improve the robustness of deep neural networks to adversarial examples. However, existing methods significantly increase the inference time due to additional optimization on the model parameters or the input at test time. In this work, we propose a novel adaptive test-time defense strategy that is easy to integrate with any existing (robust) training procedu… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  2. arXiv:2307.02693  [pdf, other

    cs.LG stat.ML

    Kernels, Data & Physics

    Authors: Francesco Cagnetta, Deborah Oliveira, Mahalakshmi Sabanayagam, Nikolaos Tsilivis, Julia Kempe

    Abstract: Lecture notes from the course given by Professor Julia Kempe at the summer school "Statistical physics of Machine Learning" in Les Houches. The notes discuss the so-called NTK approach to problems in machine learning, which consists of gaining an understanding of generally unsolvable problems by finding a tractable kernel formulation. The notes are mainly focused on practical applications such as… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: These are notes from the lecture of Julia Kempe given at the summer school "Statistical Physics \& Machine Learning", that took place in Les Houches School of Physics in France from 4th to 29th July 2022

  3. arXiv:2306.07104  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Unveiling the Hessian's Connection to the Decision Boundary

    Authors: Mahalakshmi Sabanayagam, Freya Behrens, Urte Adomaityte, Anna Dawid

    Abstract: Understanding the properties of well-generalizing minima is at the heart of deep learning research. On the one hand, the generalization of neural networks has been connected to the decision boundary complexity, which is hard to study in the high-dimensional input space. Conversely, the flatness of a minimum has become a controversial proxy for generalization. In this work, we provide the missing l… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 14 pages, 6 figures + 18-page appendices with 19 figures. Any feedback is very welcome! Code is available at https://github.com/Shmoo137/Hessian-and-Decision-Boundary

  4. arXiv:2212.01046  [pdf, other

    cs.LG

    Improved Representation Learning Through Tensorized Autoencoders

    Authors: Pascal Mattia Esser, Satyaki Mukherjee, Mahalakshmi Sabanayagam, Debarghya Ghoshdastidar

    Abstract: The central question in representation learning is what constitutes a good or meaningful representation. In this work we argue that if we consider data with inherent cluster structures, where clusters can be characterized through different means and covariances, those data structures should be represented in the embedding as well. While Autoencoders (AE) are widely used in practice for unsupervise… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  5. arXiv:2210.09809  [pdf, other

    cs.LG

    Analysis of Convolutions, Non-linearity and Depth in Graph Neural Networks using Neural Tangent Kernel

    Authors: Mahalakshmi Sabanayagam, Pascal Esser, Debarghya Ghoshdastidar

    Abstract: The fundamental principle of Graph Neural Networks (GNNs) is to exploit the structural information of the data by aggregating the neighboring nodes using a `graph convolution' in conjunction with a suitable choice for the network architecture, such as depth and activation functions. Therefore, understanding the influence of each of the design choice on the network performance is crucial. Convoluti… ▽ More

    Submitted 31 October, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: 39 pages, 24 figures. Code available at https://github.com/mahalakshmi-sabanayagam/NTK_GCN

  6. arXiv:2110.04060  [pdf, other

    cs.LG stat.ML

    New Insights into Graph Convolutional Networks using Neural Tangent Kernels

    Authors: Mahalakshmi Sabanayagam, Pascal Esser, Debarghya Ghoshdastidar

    Abstract: Graph Convolutional Networks (GCNs) have emerged as powerful tools for learning on network structured data. Although empirically successful, GCNs exhibit certain behaviour that has no rigorous explanation -- for instance, the performance of GCNs significantly degrades with increasing network depth, whereas it improves marginally with depth using skip connections. This paper focuses on semi-supervi… ▽ More

    Submitted 4 November, 2023; v1 submitted 8 October, 2021; originally announced October 2021.

  7. arXiv:2110.02722  [pdf, other

    cs.LG stat.ML

    Graphon based Clustering and Testing of Networks: Algorithms and Theory

    Authors: Mahalakshmi Sabanayagam, Leena Chennuru Vankadara, Debarghya Ghoshdastidar

    Abstract: Network-valued data are encountered in a wide range of applications and pose challenges in learning due to their complex structure and absence of vertex correspondence. Typical examples of such problems include classification or grou** of protein structures and social networks. Various methods, ranging from graph kernels to graph neural networks, have been proposed that achieve some success in g… ▽ More

    Submitted 7 November, 2021; v1 submitted 6 October, 2021; originally announced October 2021.