Skip to main content

Showing 1–13 of 13 results for author: Mukherjee, S S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03437  [pdf, other

    cs.LG

    Transfer Learning for Latent Variable Network Models

    Authors: Akhil Jalan, Arya Mazumdar, Soumendu Sundar Mukherjee, Purnamrita Sarkar

    Abstract: We study transfer learning for estimation in latent variable network models. In our setting, the conditional edge probability matrices given the latent variables are represented by $P$ for the source and $Q$ for the target. We wish to estimate $Q$ given two kinds of data: (1) edge data from a subgraph induced by an $o(1)$ fraction of the nodes of $Q$, and (2) edge data from all of $P$. If the sour… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2402.17595  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Implicit Regularization via Spectral Neural Networks and Non-linear Matrix Sensing

    Authors: Hong T. M. Chu, Subhro Ghosh, Chi Thanh Lam, Soumendu Sundar Mukherjee

    Abstract: The phenomenon of implicit regularization has attracted interest in recent years as a fundamental aspect of the remarkable generalizing ability of neural networks. In a nutshell, it entails that gradient descent dynamics in many neural nets, even without any explicit regularizer in the loss function, converges to the solution of a regularized learning problem. However, known results attempting to… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  3. arXiv:2312.07839  [pdf, ps, other

    math.ST cs.LG math.PR stat.ML

    Minimax-optimal estimation for sparse multi-reference alignment with collision-free signals

    Authors: Subhro Ghosh, Soumendu Sundar Mukherjee, **g Bin Pan

    Abstract: The Multi-Reference Alignment (MRA) problem aims at the recovery of an unknown signal from repeated observations under the latent action of a group of cyclic isometries, in the presence of additive noise of high intensity $σ$. It is a more tractable version of the celebrated cryo EM model. In the crucial high noise regime, it is known that its sample complexity scales as $σ^6$. Recent investigatio… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  4. arXiv:2308.02344  [pdf, ps, other

    math.ST cs.LG stat.CO stat.ME stat.ML

    Learning Networks from Gaussian Graphical Models and Gaussian Free Fields

    Authors: Subhro Ghosh, Soumendu Sundar Mukherjee, Hoang-Son Tran, Ujan Gangopadhyay

    Abstract: We investigate the problem of estimating the structure of a weighted network from repeated measurements of a Gaussian Graphical Model (GGM) on the network. In this vein, we consider GGMs whose covariance structures align with the geometry of the weighted network on which they are based. Such GGMs have been of longstanding interest in statistical physics, and are referred to as the Gaussian Free Fi… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  5. arXiv:2307.12982  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Consistent model selection in the spiked Wigner model via AIC-type criteria

    Authors: Soumendu Sundar Mukherjee

    Abstract: Consider the spiked Wigner model \[ X = \sum_{i = 1}^k λ_i u_i u_i^\top + σG, \] where $G$ is an $N \times N$ GOE random matrix, and the eigenvalues $λ_i$ are all spiked, i.e. above the Baik-Ben Arous-Péché (BBP) threshold $σ$. We consider AIC-type model selection criteria of the form \[ -2 \, (\text{maximised log-likelihood}) + γ\, (\text{number of parameters}) \] for estimating the number $k$ of… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 14 pages, 1 figure, 3 tables

  6. arXiv:2302.12693  [pdf, ps, other

    cs.LG math.ST stat.ML

    Wasserstein Projection Pursuit of Non-Gaussian Signals

    Authors: Satyaki Mukherjee, Soumendu Sundar Mukherjee, Debarghya Ghoshdastidar

    Abstract: We consider the general dimensionality reduction problem of locating in a high-dimensional data cloud, a $k$-dimensional non-Gaussian subspace of interesting features. We use a projection pursuit approach -- we search for mutually orthogonal unit directions which maximise the 2-Wasserstein distance of the empirical distribution of data-projections along these directions from a standard Gaussian. U… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  7. arXiv:2207.00118  [pdf, other

    cs.LG cs.AI cs.CV

    ProSelfLC: Progressive Self Label Correction Towards A Low-Temperature Entropy State

    Authors: Xinshao Wang, Yang Hua, Elyor Kodirov, Sankha Subhra Mukherjee, David A. Clifton, Neil M. Robertson

    Abstract: There is a family of label modification approaches including self and non-self label correction (LC), and output regularisation. They are widely used for training robust deep neural networks (DNNs), but have not been mathematically and thoroughly analysed together. We study them and discover three key issues: (1) We are more interested in adopting Self LC as it leverages its own knowledge and requ… ▽ More

    Submitted 6 September, 2022; v1 submitted 30 June, 2022; originally announced July 2022.

    Comments: To ease the reading, a summary of changes is put in the beginning. Our source code is available at https://github.com/XinshaoAmosWang/ProSelfLC-AT

  8. arXiv:2201.08326  [pdf, other

    stat.ME cs.LG econ.EM math.ST stat.CO stat.ML

    Learning with latent group sparsity via heat flow dynamics on networks

    Authors: Subhroshekhar Ghosh, Soumendu Sundar Mukherjee

    Abstract: Group or cluster structure on explanatory variables in machine learning problems is a very general phenomenon, which has attracted broad interest from practitioners and theoreticians alike. In this work we contribute an approach to learning under such group structure, that does not require prior information on the group identities. Our paradigm is motivated by the Laplacian geometry of an underlyi… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: 36 pages, 3 figures, 3 tables

  9. arXiv:2112.00827  [pdf, other

    cs.CL cs.IR cs.LG stat.ME stat.ML

    Changepoint Analysis of Topic Proportions in Temporal Text Data

    Authors: Avinandan Bose, Soumendu Sundar Mukherjee

    Abstract: Changepoint analysis deals with unsupervised detection and/or estimation of time-points in time-series data, when the distribution generating the data changes. In this article, we consider \emph{offline} changepoint detection in the context of large scale textual data. We build a specialised temporal topic model with provisions for changepoints in the distribution of topic proportions. As full lik… ▽ More

    Submitted 29 November, 2021; originally announced December 2021.

    Comments: 32 pages, 9 figures

  10. arXiv:2009.02112  [pdf, ps, other

    stat.ME cs.SI

    Consistent detection and optimal localization of all detectable change points in piecewise stationary arbitrarily sparse network-sequences

    Authors: Sharmodeep Bhattacharyya, Shirshendu Chatterjee, Soumendu Sundar Mukherjee

    Abstract: We consider the offline change point detection and localization problem in the context of piecewise stationary networks, where the observable is a finite sequence of networks. We develop algorithms involving some suitably modified CUSUM statistics based on adaptively trimmed adjacency matrices of the observed networks for both detection and localization of single or multiple change points present… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

    Comments: 24 pages

    MSC Class: 62H30; 62F12

  11. arXiv:1906.00494  [pdf, other

    stat.ML cs.LG stat.ME

    Graphon Estimation from Partially Observed Network Data

    Authors: Soumendu Sundar Mukherjee, Sayak Chakrabarti

    Abstract: We consider estimating the edge-probability matrix of a network generated from a graphon model when the full network is not observed---only some overlap** subgraphs are. We extend the neighbourhood smoothing (NBS) algorithm of Zhang et al. (2017) to this missing-data set-up and show experimentally that, for a wide range of graphons, the extended NBS algorithm achieves significantly smaller error… ▽ More

    Submitted 27 June, 2019; v1 submitted 2 June, 2019; originally announced June 2019.

    Comments: 12 pages, 7 figures, 1 table

  12. arXiv:1901.00109  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Morphological Network: How Far Can We Go with Morphological Neurons?

    Authors: Ranjan Mondal, Sanchayan Santra, Soumendu Sundar Mukherjee, Bhabatosh Chanda

    Abstract: Morphological neurons, that is morphological operators such as dilation and erosion with learnable structuring elements, have intrigued researchers for quite some time because of the power these operators bring to the table despite their simplicity. These operators are known to be powerful nonlinear tools, but for a given problem coming up with a sequence of operations and their structuring elemen… ▽ More

    Submitted 13 December, 2022; v1 submitted 1 January, 2019; originally announced January 2019.

    Comments: Accepted at BMVC 2022

  13. IEGAN: Multi-purpose Perceptual Quality Image Enhancement Using Generative Adversarial Network

    Authors: Soumya Shubhra Ghosh, Yang Hua, Sankha Subhra Mukherjee, Neil Robertson

    Abstract: Despite the breakthroughs in quality of image enhancement, an end-to-end solution for simultaneous recovery of the finer texture details and sharpness for degraded images with low resolution is still unsolved. Some existing approaches focus on minimizing the pixel-wise reconstruction error which results in a high peak signal-to-noise ratio. The enhanced images fail to provide high-frequency detail… ▽ More

    Submitted 22 November, 2018; originally announced November 2018.

    Comments: Accepted at IEEE WACV 2019