Skip to main content

Showing 1–35 of 35 results for author: Ghoshdastidar, D

.
  1. arXiv:2403.08673  [pdf, other

    cs.LG stat.ML

    When can we Approximate Wide Contrastive Models with Neural Tangent Kernels and Principal Component Analysis?

    Authors: Gautham Govind Anil, Pascal Esser, Debarghya Ghoshdastidar

    Abstract: Contrastive learning is a paradigm for learning representations from unlabelled data that has been highly successful for image and text data. Several recent works have examined contrastive losses to claim that contrastive models effectively learn spectral embeddings, while few works show relations between (wide) contrastive models and kernel principal component analysis (PCA). However, it is not k… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  2. arXiv:2402.13108  [pdf, ps, other

    cs.LG

    On the Stability of Gradient Descent for Large Learning Rate

    Authors: Alexandru Crăciun, Debarghya Ghoshdastidar

    Abstract: There currently is a significant interest in understanding the Edge of Stability (EoS) phenomenon, which has been observed in neural networks training, characterized by a non-monotonic decrease of the loss function over epochs, while the sharpness of the loss (spectral norm of the Hessian) progressively approaches and stabilizes around 2/(learning rate). Reasons for the existence of EoS when train… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  3. arXiv:2402.09881  [pdf, other

    cs.LG

    Explaining Kernel Clustering via Decision Trees

    Authors: Maximilian Fleissner, Leena Chennuru Vankadara, Debarghya Ghoshdastidar

    Abstract: Despite the growing popularity of explainable and interpretable machine learning, there is still surprisingly limited work on inherently interpretable clustering methods. Recently, there has been a surge of interest in explaining the classic k-means algorithm, leading to efficient algorithms that approximate k-means clusters using axis-aligned decision trees. However, interpretable variants of k-m… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  4. arXiv:2309.02028  [pdf, other

    cs.LG

    Non-Parametric Representation Learning with Kernels

    Authors: Pascal Esser, Maximilian Fleissner, Debarghya Ghoshdastidar

    Abstract: Unsupervised and self-supervised representation learning has become popular in recent years for learning useful features from unlabelled data. Representation learning has been mostly developed in the neural network literature, and other models for representation learning are surprisingly unexplored. In this work, we introduce and analyze several kernel-based representation learning approaches: Fir… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  5. arXiv:2309.02011  [pdf, other

    cs.LG

    Representation Learning Dynamics of Self-Supervised Models

    Authors: Pascal Esser, Satyaki Mukherjee, Debarghya Ghoshdastidar

    Abstract: Self-Supervised Learning (SSL) is an important paradigm for learning representations from unlabelled data, and SSL with neural networks has been highly successful in practice. However current theoretical analysis of SSL is mostly restricted to generalisation error bounds. In contrast, learning dynamics often provide a precise characterisation of the behaviour of neural networks based models but, s… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  6. arXiv:2307.11672  [pdf, other

    cs.LG cs.CR

    Fast Adaptive Test-Time Defense with Robust Features

    Authors: Anurag Singh, Mahalakshmi Sabanayagam, Krikamol Muandet, Debarghya Ghoshdastidar

    Abstract: Adaptive test-time defenses are used to improve the robustness of deep neural networks to adversarial examples. However, existing methods significantly increase the inference time due to additional optimization on the model parameters or the input at test time. In this work, we propose a novel adaptive test-time defense strategy that is easy to integrate with any existing (robust) training procedu… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  7. arXiv:2302.12693  [pdf, ps, other

    cs.LG math.ST stat.ML

    Wasserstein Projection Pursuit of Non-Gaussian Signals

    Authors: Satyaki Mukherjee, Soumendu Sundar Mukherjee, Debarghya Ghoshdastidar

    Abstract: We consider the general dimensionality reduction problem of locating in a high-dimensional data cloud, a $k$-dimensional non-Gaussian subspace of interesting features. We use a projection pursuit approach -- we search for mutually orthogonal unit directions which maximise the 2-Wasserstein distance of the empirical distribution of data-projections along these directions from a standard Gaussian. U… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  8. arXiv:2212.01046  [pdf, other

    cs.LG

    Improved Representation Learning Through Tensorized Autoencoders

    Authors: Pascal Mattia Esser, Satyaki Mukherjee, Mahalakshmi Sabanayagam, Debarghya Ghoshdastidar

    Abstract: The central question in representation learning is what constitutes a good or meaningful representation. In this work we argue that if we consider data with inherent cluster structures, where clusters can be characterized through different means and covariances, those data structures should be represented in the embedding as well. While Autoencoders (AE) are widely used in practice for unsupervise… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  9. arXiv:2211.16459  [pdf, other

    cs.LG stat.ML

    A Revenue Function for Comparison-Based Hierarchical Clustering

    Authors: Aishik Mandal, Michaël Perrot, Debarghya Ghoshdastidar

    Abstract: Comparison-based learning addresses the problem of learning when, instead of explicit features or pairwise similarities, one only has access to comparisons of the form: \emph{Object $A$ is more similar to $B$ than to $C$.} Recently, it has been shown that, in Hierarchical Clustering, single and complete linkage can be directly implemented using only such comparisons while several algorithms have b… ▽ More

    Submitted 2 April, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 26 pages, 6 figures, 5 tables. Transactions on Machine Learning Research (2023)

  10. arXiv:2211.01903  [pdf, ps, other

    stat.ML cs.LG

    A Consistent Estimator for Confounding Strength

    Authors: Luca Rendsburg, Leena Chennuru Vankadara, Debarghya Ghoshdastidar, Ulrike von Luxburg

    Abstract: Regression on observational data can fail to capture a causal relationship in the presence of unobserved confounding. Confounding strength measures this mismatch, but estimating it requires itself additional assumptions. A common assumption is the independence of causal mechanisms, which relies on concentration phenomena in high dimensions. While high dimensions enable the estimation of confoundin… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 21 pages

  11. arXiv:2210.09809  [pdf, other

    cs.LG

    Analysis of Convolutions, Non-linearity and Depth in Graph Neural Networks using Neural Tangent Kernel

    Authors: Mahalakshmi Sabanayagam, Pascal Esser, Debarghya Ghoshdastidar

    Abstract: The fundamental principle of Graph Neural Networks (GNNs) is to exploit the structural information of the data by aggregating the neighboring nodes using a `graph convolution' in conjunction with a suitable choice for the network architecture, such as depth and activation functions. Therefore, understanding the influence of each of the design choice on the network performance is crucial. Convoluti… ▽ More

    Submitted 31 October, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: 39 pages, 24 figures. Code available at https://github.com/mahalakshmi-sabanayagam/NTK_GCN

  12. arXiv:2202.09054  [pdf, other

    stat.ML cs.LG

    Interpolation and Regularization for Causal Learning

    Authors: Leena Chennuru Vankadara, Luca Rendsburg, Ulrike von Luxburg, Debarghya Ghoshdastidar

    Abstract: We study the problem of learning causal models from observational data through the lens of interpolation and its counterpart -- regularization. A large volume of recent theoretical, as well as empirical work, suggests that, in highly complex model classes, interpolating estimators can have good statistical generalization properties and can even be optimal for statistical learning. Motivated by an… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

  13. arXiv:2112.03968  [pdf, other

    cs.LG stat.ML

    Learning Theory Can (Sometimes) Explain Generalisation in Graph Neural Networks

    Authors: Pascal Mattia Esser, Leena Chennuru Vankadara, Debarghya Ghoshdastidar

    Abstract: In recent years, several results in the supervised learning setting suggested that classical statistical learning-theoretic measures, such as VC dimension, do not adequately explain the performance of deep learning models which prompted a slew of work in the infinite-width and iteration regimes. However, there is little theoretical explanation for the success of neural networks beyond the supervis… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  14. arXiv:2111.09831  [pdf, other

    stat.ML cs.LG

    Causal Forecasting:Generalization Bounds for Autoregressive Models

    Authors: Leena Chennuru Vankadara, Philipp Michael Faller, Michaela Hardt, Lenon Minorics, Debarghya Ghoshdastidar, Dominik Janzing

    Abstract: Despite the increasing relevance of forecasting methods, causal implications of these algorithms remain largely unexplored. This is concerning considering that, even under simplifying assumptions such as causal sufficiency, the statistical risk of a model can differ significantly from its \textit{causal risk}. Here, we study the problem of \textit{causal generalization} -- generalizing from the ob… ▽ More

    Submitted 8 September, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

  15. arXiv:2110.09476  [pdf, other

    cs.LG stat.ML

    Recovery Guarantees for Kernel-based Clustering under Non-parametric Mixture Models

    Authors: Leena Chennuru Vankadara, Sebastian Bordt, Ulrike von Luxburg, Debarghya Ghoshdastidar

    Abstract: Despite the ubiquity of kernel-based clustering, surprisingly few statistical guarantees exist beyond settings that consider strong structural assumptions on the data generation process. In this work, we take a step towards bridging this gap by studying the statistical performance of kernel-based clustering algorithms under non-parametric mixture models. We provide necessary and sufficient separab… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  16. arXiv:2110.04060  [pdf, other

    cs.LG stat.ML

    New Insights into Graph Convolutional Networks using Neural Tangent Kernels

    Authors: Mahalakshmi Sabanayagam, Pascal Esser, Debarghya Ghoshdastidar

    Abstract: Graph Convolutional Networks (GCNs) have emerged as powerful tools for learning on network structured data. Although empirically successful, GCNs exhibit certain behaviour that has no rigorous explanation -- for instance, the performance of GCNs significantly degrades with increasing network depth, whereas it improves marginally with depth using skip connections. This paper focuses on semi-supervi… ▽ More

    Submitted 4 November, 2023; v1 submitted 8 October, 2021; originally announced October 2021.

  17. arXiv:2110.02722  [pdf, other

    cs.LG stat.ML

    Graphon based Clustering and Testing of Networks: Algorithms and Theory

    Authors: Mahalakshmi Sabanayagam, Leena Chennuru Vankadara, Debarghya Ghoshdastidar

    Abstract: Network-valued data are encountered in a wide range of applications and pose challenges in learning due to their complex structure and absence of vertex correspondence. Typical examples of such problems include classification or grou** of protein structures and social networks. Various methods, ranging from graph kernels to graph neural networks, have been proposed that achieve some success in g… ▽ More

    Submitted 7 November, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

  18. arXiv:2108.02789  [pdf, other

    astro-ph.CO astro-ph.HE astro-ph.IM

    HOLISMOKES -- VII. Time-delay measurement of strongly lensed Type Ia supernovae using machine learning

    Authors: S. Huber, S. H. Suyu, D. Ghoshdastidar, S. Taubenberger, V. Bonvin, J. H. H. Chan, M. Kromer, U. M. Noebauer, S. A. Sim, L. Leal-Taixé

    Abstract: The Hubble constant ($H_0$) is one of the fundamental parameters in cosmology, but there is a heated debate around the $>$4$σ$ tension between the local Cepheid distance ladder and the early Universe measurements. Strongly lensed Type Ia supernovae (LSNe Ia) are an independent and direct way to measure $H_0$, where a time-delay measurement between the multiple supernova (SN) images is required. In… ▽ More

    Submitted 21 December, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: 25 pages, 28 figures; accepted for publication in A&A

    Journal ref: A&A 658, A157 (2022)

  19. arXiv:2010.03918  [pdf, other

    cs.LG stat.ML

    Near-Optimal Comparison Based Clustering

    Authors: Michaël Perrot, Pascal Mattia Esser, Debarghya Ghoshdastidar

    Abstract: The goal of clustering is to group similar objects into meaningful partitions. This process is well understood when an explicit similarity measure between the objects is given. However, far less is known when this information is not readily available and, instead, one only observes ordinal comparisons such as "object i is more similar to j than to k." In this paper, we tackle this problem using a… ▽ More

    Submitted 9 October, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: To appear in NeurIPS 2020

  20. arXiv:1912.00458  [pdf, other

    stat.ML cs.LG

    On the optimality of kernels for high-dimensional clustering

    Authors: Leena Chennuru Vankadara, Debarghya Ghoshdastidar

    Abstract: This paper studies the optimality of kernel methods in high-dimensional data clustering. Recent works have studied the large sample performance of kernel clustering in the high-dimensional regime, where Euclidean distance becomes less informative. However, it is unknown whether popular methods, such as kernel k-means, are optimal in this regime. We consider the problem of high-dimensional Gaussian… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

  21. arXiv:1811.12752  [pdf, ps, other

    stat.ML cs.LG

    Practical methods for graph two-sample testing

    Authors: Debarghya Ghoshdastidar, Ulrike von Luxburg

    Abstract: Hypothesis testing for graphs has been an important tool in applied research fields for more than two decades, and still remains a challenging problem as one often needs to draw inference from few replicates of large graphs. Recent studies in statistics and learning theory have provided some theoretical insights about such high-dimensional graph testing problems, but the practicality of the develo… ▽ More

    Submitted 30 November, 2018; originally announced November 2018.

    Comments: To appear in Neural Information Processing Systems 2018

  22. arXiv:1811.00928  [pdf, other

    stat.ML cs.LG

    Foundations of Comparison-Based Hierarchical Clustering

    Authors: Debarghya Ghoshdastidar, Michaël Perrot, Ulrike von Luxburg

    Abstract: We address the classical problem of hierarchical clustering, but in a framework where one does not have access to a representation of the objects or their pairwise similarities. Instead, we assume that only a set of comparisons between objects is available, that is, statements of the form "objects $i$ and $j$ are more similar than objects $k$ and $l$." Such a scenario is commonly encountered in cr… ▽ More

    Submitted 12 June, 2019; v1 submitted 2 November, 2018; originally announced November 2018.

    Comments: 26 pages

  23. arXiv:1807.02821  [pdf, ps, other

    physics.bio-ph cond-mat.soft q-bio.BM

    Ionic liquids make DNA rigid

    Authors: Ashok Garai, Debostuti Ghoshdastidar, Sanjib Senapati, Prabal K. Maiti

    Abstract: Persistence length of dsDNA is known to decrease with increase in ionic concentration of the solution. In contrast to this, here we show that persistence length of dsDNA increases dramatically as a function of ionic liquid (IL) concentration. Using all atomic explicit solvent molecular dynamics simulations and theoretical models we present, for the first time, a systematic study to determine the m… ▽ More

    Submitted 8 July, 2018; originally announced July 2018.

    Comments: 16 pages, 3 figures, Supplementary Information (Accepted for publication in the Journal of Chemical Physics, AIP (USA))

  24. arXiv:1707.00833  [pdf, ps, other

    stat.ME stat.ML

    Two-sample Hypothesis Testing for Inhomogeneous Random Graphs

    Authors: Debarghya Ghoshdastidar, Maurilio Gutzeit, Alexandra Carpentier, Ulrike von Luxburg

    Abstract: The study of networks leads to a wide range of high dimensional inference problems. In many practical applications, one needs to draw inference from one or few large sparse networks. The present paper studies hypothesis testing of graphs in this high-dimensional regime, where the goal is to test between two populations of inhomogeneous random graphs defined on the same set of $n$ vertices. The siz… ▽ More

    Submitted 17 July, 2019; v1 submitted 4 July, 2017; originally announced July 2017.

    Comments: To appear in the Annals of Statistics. This 54-page version includes the supplementary material (appendix to the main paper)

    MSC Class: 62H15; 62C20; 05C80; 60B20

    Journal ref: Ann. Statist. Volume 48, Number 4 (2020), 2208-2229

  25. arXiv:1705.06168  [pdf, ps, other

    stat.ME stat.ML

    Two-Sample Tests for Large Random Graphs Using Network Statistics

    Authors: Debarghya Ghoshdastidar, Maurilio Gutzeit, Alexandra Carpentier, Ulrike von Luxburg

    Abstract: We consider a two-sample hypothesis testing problem, where the distributions are defined on the space of undirected graphs, and one has access to only one observation from each model. A motivating example for this problem is comparing the friendship networks on Facebook and LinkedIn. The practical approach to such problems is to compare the networks based on certain network statistics. In this pap… ▽ More

    Submitted 26 May, 2017; v1 submitted 17 May, 2017; originally announced May 2017.

    Comments: To be presented in COLT 2017 (author sequence, funding details and minor typos updated in version 2)

  26. arXiv:1704.01460  [pdf, other

    stat.ML cs.DS cs.LG

    Comparison Based Nearest Neighbor Search

    Authors: Siavash Haghiri, Debarghya Ghoshdastidar, Ulrike von Luxburg

    Abstract: We consider machine learning in a comparison-based setting where we are given a set of points in a metric space, but we have no access to the actual distances between the points. Instead, we can only ask an oracle whether the distance between two points $i$ and $j$ is smaller than the distance between the points $i$ and $k$. We are concerned with data structures and algorithms to find nearest neig… ▽ More

    Submitted 5 April, 2017; originally announced April 2017.

    Comments: 16 Pages, 3 Figures

  27. arXiv:1602.06516  [pdf, other

    cs.LG stat.ML

    Uniform Hypergraph Partitioning: Provable Tensor Methods and Sampling Techniques

    Authors: Debarghya Ghoshdastidar, Ambedkar Dukkipati

    Abstract: In a series of recent works, we have generalised the consistency results in the stochastic block model literature to the case of uniform and non-uniform hypergraphs. The present paper continues the same line of study, where we focus on partitioning weighted uniform hypergraphs---a problem often encountered in computer vision. This work is motivated by two issues that arise when a hypergraph partit… ▽ More

    Submitted 17 May, 2017; v1 submitted 21 February, 2016; originally announced February 2016.

    Comments: To appear in Journal of Machine Learning Research (vol 18)

  28. arXiv:1507.00763  [pdf, ps, other

    math.CO

    Coloring Random Non-Uniform Bipartite Hypergraphs

    Authors: Debarghya Ghoshdastidar, Ambedkar Dukkipati

    Abstract: Let $H_{n,(p_m)_{m=2,\ldots,M}}$ be a random non-uniform hypergraph of dimension $M$ on $2n$ vertices, where the vertices are split into two disjoint sets of size $n$, and colored by two distinct colors. Each non-monochromatic edge of size $m=2,\ldots,M$ is independently added with probability $p_m$. We show that if $p_2,\ldots,p_M$ are such that the expected number of edges in the hypergraph is a… ▽ More

    Submitted 17 November, 2015; v1 submitted 2 July, 2015; originally announced July 2015.

    Comments: 15 pages

  29. Consistency of Spectral Hypergraph Partitioning under Planted Partition Model

    Authors: Debarghya Ghoshdastidar, Ambedkar Dukkipati

    Abstract: Hypergraph partitioning lies at the heart of a number of problems in machine learning and network sciences. Many algorithms for hypergraph partitioning have been proposed that extend standard approaches for graph partitioning to the case of hypergraphs. However, theoretical aspects of such methods have seldom received attention in the literature as compared to the extensive studies on the guarante… ▽ More

    Submitted 3 February, 2016; v1 submitted 7 May, 2015; originally announced May 2015.

    Comments: 35 pages, 2 figures, 1 table

    Journal ref: Ann. Statist. Volume 45, Number 1 (2017), 289-315

  30. Spectral Clustering with Jensen-type kernels and their multi-point extensions

    Authors: Debarghya Ghoshdastidar, Ambedkar Dukkipati, Ajay P. Adsul, Aparna S. Vijayan

    Abstract: Motivated by multi-distribution divergences, which originate in information theory, we propose a notion of `multi-point' kernels, and study their applications. We study a class of kernels based on Jensen type divergences and show that these can be extended to measure similarity among multiple points. We study tensor flattening methods and develop a multi-point (kernel) spectral clustering (MSC) me… ▽ More

    Submitted 18 March, 2014; originally announced March 2014.

    Comments: To appear in IEEE Computer Society Conference on Computer Vision and Pattern Recognition

  31. Newton based Stochastic Optimization using q-Gaussian Smoothed Functional Algorithms

    Authors: Debarghya Ghoshdastidar, Ambedkar Dukkipati, Shalabh Bhatnagar

    Abstract: We present the first q-Gaussian smoothed functional (SF) estimator of the Hessian and the first Newton-based stochastic optimization algorithm that estimates both the Hessian and the gradient of the objective function using q-Gaussian perturbations. Our algorithm requires only two system simulations (regardless of the parameter dimension) and estimates both the gradient and the Hessian at each upd… ▽ More

    Submitted 19 May, 2014; v1 submitted 10 November, 2013; originally announced November 2013.

    Comments: This is a longer of version of the paper with the same title accepted in Automatica

  32. arXiv:1206.4832  [pdf, other

    cs.IT cs.LG stat.ME

    Smoothed Functional Algorithms for Stochastic Optimization using q-Gaussian Distributions

    Authors: Debarghya Ghoshdastidar, Ambedkar Dukkipati, Shalabh Bhatnagar

    Abstract: Smoothed functional (SF) schemes for gradient estimation are known to be efficient in stochastic optimization algorithms, specially when the objective is to improve the performance of a stochastic system. However, the performance of these methods depends on several parameters, such as the choice of a suitable smoothing kernel. Different kernels have been studied in literature, which include Gaussi… ▽ More

    Submitted 3 July, 2014; v1 submitted 21 June, 2012; originally announced June 2012.

    ACM Class: G.1.6; I.6.8

  33. arXiv:1205.0651  [pdf, ps, other

    cs.IT cs.LG

    Generative Maximum Entropy Learning for Multiclass Classification

    Authors: Ambedkar Dukkipati, Gaurav Pandey, Debarghya Ghoshdastidar, Paramita Koley, D. M. V. Satya Sriram

    Abstract: Maximum entropy approach to classification is very well studied in applied statistics and machine learning and almost all the methods that exists in literature are discriminative in nature. In this paper, we introduce a maximum entropy classification method with feature selection for large dimensional data such as text datasets that is generative in nature. To tackle the curse of dimensionality of… ▽ More

    Submitted 30 December, 2013; v1 submitted 3 May, 2012; originally announced May 2012.

  34. arXiv:1204.1800  [pdf, other

    cs.LG cs.IT stat.ML

    On Power-law Kernels, corresponding Reproducing Kernel Hilbert Space and Applications

    Authors: Debarghya Ghoshdastidar, Ambedkar Dukkipati

    Abstract: The role of kernels is central to machine learning. Motivated by the importance of power-law distributions in statistical modeling, in this paper, we propose the notion of power-law kernels to investigate power-laws in learning problem. We propose two power-law kernels by generalizing Gaussian and Laplacian kernels. This generalization is based on distributions, arising out of maximization of a ge… ▽ More

    Submitted 1 April, 2013; v1 submitted 9 April, 2012; originally announced April 2012.

    Comments: 7 pages, 3 figures, 3 tables

  35. q-Gaussian based Smoothed Functional Algorithm for Stochastic Optimization

    Authors: Debarghya Ghoshdastidar, Ambedkar Dukkipati, Shalabh Bhatnagar

    Abstract: The q-Gaussian distribution results from maximizing certain generalizations of Shannon entropy under some constraints. The importance of q-Gaussian distributions stems from the fact that they exhibit power-law behavior, and also generalize Gaussian distributions. In this paper, we propose a Smoothed Functional (SF) scheme for gradient estimation using q-Gaussian distribution, and also propose an a… ▽ More

    Submitted 16 June, 2012; v1 submitted 25 February, 2012; originally announced February 2012.

    Comments: 5 pages, 1 figure