Skip to main content

Showing 1–32 of 32 results for author: Ghosal, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.00797  [pdf, other

    stat.ME

    A placement-value based approach to concave ROC analysis

    Authors: Soutik Ghosal, Zhen Chen

    Abstract: The receiver operating characteristic (ROC) curve is an important graphic tool for evaluating a test in a wide range of disciplines. While useful, an ROC curve can cross the chance line, either by having an S-shape or a hook at the extreme specificity. These non-concave ROC curves are sub-optimal according to decision theory, as there are points that are superior than those corresponding to the po… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 18 pages, 6 figures, 2 tables

  2. arXiv:2406.13938  [pdf, other

    stat.ME

    Coverage of Credible Sets for Regression under Variable Selection

    Authors: Samhita Pal, Subhashis Ghosal

    Abstract: We study the asymptotic frequentist coverage of credible sets based on a novel Bayesian approach for a multiple linear regression model under variable selection. We initially ignore the issue of variable selection, which allows us to put a conjugate normal prior on the coefficient vector. The variable selection step is incorporated directly in the posterior through a sparsity-inducing map and uses… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2404.13284  [pdf, other

    stat.ME

    Impact of methodological assumptions and covariates on the cutoff estimation in ROC analysis

    Authors: Soutik Ghosal

    Abstract: The Receiver Operating Characteristic (ROC) curve stands as a cornerstone in assessing the efficacy of biomarkers for disease diagnosis. Beyond merely evaluating performance, it provides with an optimal cutoff for biomarker values, crucial for disease categorization. While diverse methodologies exist for threshold estimation, less attention has been paid to integrating covariate impact into this p… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  4. arXiv:2403.04915  [pdf, other

    stat.ME math.ST

    Bayesian Inference for High-dimensional Time Series by Latent Process Modeling

    Authors: Arkaprava Roy, Anindya Roy, Subhashis Ghosal

    Abstract: Time series data arising in many applications nowadays are high-dimensional. A large number of parameters describe features of these time series. We propose a novel approach to modeling a high-dimensional time series through several independent univariate time series, which are then orthogonally rotated and sparsely linearly transformed. With this approach, any specified intrinsic relations among… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  5. arXiv:2306.05202  [pdf, ps, other

    math.ST stat.ME

    Bayesian Inference for Multivariate Monotone Densities

    Authors: Kang Wang, Subhashis Ghosal

    Abstract: We consider a nonparametric Bayesian approach to estimation and testing for a multivariate monotone density. Instead of following the conventional Bayesian route of putting a prior distribution complying with the monotonicity restriction, we put a prior on the step heights through binning and a Dirichlet distribution. An arbitrary piece-wise constant probability density is converted to a monotone… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  6. arXiv:2306.05173  [pdf, other

    math.ST stat.ME

    Bayesian Inference for $k$-Monotone Densities with Applications to Multiple Testing

    Authors: Kang Wang, Subhashis Ghosal

    Abstract: Shape restriction, like monotonicity or convexity, imposed on a function of interest, such as a regression or density function, allows for its estimation without smoothness assumptions. The concept of $k$-monotonicity encompasses a family of shape restrictions, including decreasing and convex decreasing as special cases corresponding to $k=1$ and $k=2$. We consider Bayesian approaches to estimate… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  7. arXiv:2104.10335  [pdf, other

    stat.ME math.ST

    Optimal Bayesian Smoothing of Functional Observations over a Large Graph

    Authors: Arkaprava Roy, Shubhashis Ghosal

    Abstract: In modern contexts, some types of data are observed in high-resolution, essentially continuously in time. Such data units are best described as taking values in a space of functions. Subject units carrying the observations may have intrinsic relations among themselves, and are best described by the nodes of a large graph. It is often sensible to think that the underlying signals in these functiona… ▽ More

    Submitted 19 July, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

  8. Interpretable and synergistic deep learning for visual explanation and statistical estimations of segmentation of disease features from medical images

    Authors: Sambuddha Ghosal, Pratik Shah

    Abstract: Deep learning (DL) models for disease classification or segmentation from medical images are increasingly trained using transfer learning (TL) from unrelated natural world images. However, shortcomings and utility of TL for specialized tasks in the medical imaging domain remain unknown and are based on assumptions that increasing training data will improve performance. We report detailed compariso… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Journal ref: Published in Cell Reports Methods 1, 100107, 2021. "A deep-learning toolkit for visualization and interpretation of segmented medical images"

  9. arXiv:2007.00797  [pdf, other

    stat.ME

    Bayesian Multivariate Quantile Regression Using Dependent Dirichlet Process Prior

    Authors: Indrabati Bhattacharya, Subhashis Ghosal

    Abstract: In this article, we consider a non-parametric Bayesian approach to multivariate quantile regression. The collection of related conditional distributions of a response vector Y given a univariate covariate X is modeled using a Dependent Dirichlet Process (DDP) prior. The DDP is used to introduce dependence across x. As the realizations from a Dirichlet process prior are almost surely discrete, we n… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  10. arXiv:2001.03798  [pdf, ps, other

    stat.ML cs.LG stat.AP

    Bayesian Semi-supervised learning under nonparanormality

    Authors: Rui Zhu, Subhashis Ghosal

    Abstract: Semi-supervised learning is a classification method which makes use of both labeled data and unlabeled data for training. In this paper, we propose a semi-supervised learning algorithm using a Bayesian semi-supervised model. We make a general assumption that the observations will follow two multivariate normal distributions depending on their true labels after the same unknown transformation. We u… ▽ More

    Submitted 11 January, 2020; originally announced January 2020.

  11. arXiv:1911.04699  [pdf, other

    cs.LG stat.ML

    Deep Generative Models Strike Back! Improving Understanding and Evaluation in Light of Unmet Expectations for OoD Data

    Authors: John Just, Sambuddha Ghosal

    Abstract: Advances in deep generative and density models have shown impressive capacity to model complex probability density functions in lower-dimensional space. Also, applying such models to high-dimensional image data to model the PDF has shown poor generalization, with out-of-distribution data being assigned equal or higher likelihood than in-sample data. Methods to deal with this have been proposed tha… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

  12. arXiv:1906.01626  [pdf, other

    cs.LG eess.IV stat.ML

    Encoding Invariances in Deep Generative Models

    Authors: Viraj Shah, Ameya Joshi, Sambuddha Ghosal, Balaji Pokuri, Soumik Sarkar, Baskar Ganapathysubramanian, Chinmay Hegde

    Abstract: Reliable training of generative adversarial networks (GANs) typically require massive datasets in order to model complicated distributions. However, in several applications, training samples obey invariances that are \textit{a priori} known; for example, in complex physics simulations, the training data obey universal laws encoded as well-defined mathematical equations. In this paper, we propose a… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

  13. arXiv:1812.04442  [pdf, ps, other

    stat.ME

    Regression-Based Bayesian Estimation and Structure Learning for Nonparanormal Graphical Models

    Authors: Jami J. Mulgrave, Subhashis Ghosal

    Abstract: A nonparanormal graphical model is a semiparametric generalization of a Gaussian graphical model for continuous variables in which it is assumed that the variables follow a Gaussian graphical model only after some unknown smooth monotone transformations. We consider a Bayesian approach to inference in a nonparanormal graphical model in which we put priors on the unknown transformations through a r… ▽ More

    Submitted 20 February, 2021; v1 submitted 8 December, 2018; originally announced December 2018.

    Comments: arXiv admin note: text overlap with arXiv:1812.02884

  14. arXiv:1812.02884  [pdf, other

    stat.ME

    Bayesian Analysis of Nonparanormal Graphical Models Using Rank-Likelihood

    Authors: Jami J. Mulgrave, Subhashis Ghosal

    Abstract: Gaussian graphical models, where it is assumed that the variables of interest jointly follow a multivariate normal distribution with a sparse precision matrix, have been used to study intrinsic dependence among variables, but the normality assumption may be restrictive in many settings. A nonparanormal graphical model is a semiparametric generalization of a Gaussian graphical model for continuous… ▽ More

    Submitted 16 April, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: arXiv admin note: text overlap with arXiv:1812.04442

  15. arXiv:1811.06067  [pdf, other

    cs.LG cs.CV stat.ML

    Interpretable deep learning for guided structure-property explorations in photovoltaics

    Authors: Balaji Sesha Sarath Pokuri, Sambuddha Ghosal, Apurva Kokate, Baskar Ganapathysubramanian, Soumik Sarkar

    Abstract: The performance of an organic photovoltaic device is intricately connected to its active layer morphology. This connection between the active layer and device performance is very expensive to evaluate, either experimentally or computationally. Hence, designing morphologies to achieve higher performances is non-trivial and often intractable. To solve this, we first introduce a deep convolutional ne… ▽ More

    Submitted 11 December, 2018; v1 submitted 14 November, 2018; originally announced November 2018.

    Comments: Workshop on Machine Learning for Molecules and Materials (MLMM), Neural Information Processing Systems (NeurIPS) 2018, Montreal, Canada

    Journal ref: npj Comput Mater 5, 95 (2019)

  16. arXiv:1808.01236  [pdf, other

    stat.ME

    Bayesian Change Point Detection for Functional Data

    Authors: Xiuqi Li, Subhashis Ghosal

    Abstract: We propose a Bayesian method to detect change points for functional data. We extract the features of a sequence of functional data by the discrete wavelet transform (DWT), and treat each sequence of feature independently. We believe there is potentially a change in each feature at possibly different time points. The functional data evolves through such changes throughout the sequences of observati… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

    Comments: 22 pages, 9 figures

  17. arXiv:1808.00662  [pdf, other

    stat.ME

    Bayesian Classification of Multiclass Functional Data

    Authors: Xiuqi Li, Subhashis Ghosal

    Abstract: We propose a Bayesian approach to estimating parameters in multiclass functional models. Unordered multinomial probit, ordered multinomial probit and multinomial logistic models are considered. We use finite random series priors based on a suitable basis such as B-splines in these three multinomial models, and classify the functional data using the Bayes rule. We average over models based on the m… ▽ More

    Submitted 2 August, 2018; originally announced August 2018.

    Comments: 26 pages, 2 figures

  18. Bayesian Inference in Nonparanormal Graphical Models

    Authors: Jami J. Mulgrave, Subhashis Ghosal

    Abstract: Gaussian graphical models have been used to study intrinsic dependence among several variables, but the Gaussianity assumption may be restrictive in many applications. A nonparanormal graphical model is a semiparametric generalization for continuous variables where it is assumed that the variables follow a Gaussian graphical model only after some unknown smooth monotone transformations on each of… ▽ More

    Submitted 11 April, 2019; v1 submitted 12 June, 2018; originally announced June 2018.

  19. arXiv:1803.06735  [pdf, other

    stat.AP stat.ME

    Bayesian ROC surface estimation under verification bias

    Authors: Rui Zhu, Subhashis Ghosal

    Abstract: The Receiver Operating Characteristic (ROC) surface is a generalization of ROC curve and is widely used for assessment of the accuracy of diagnostic tests on three categories. A complication called the verification bias, meaning that not all subjects have their true disease status verified often occur in real application of ROC analysis. This is a common problem since the gold standard test, which… ▽ More

    Submitted 18 March, 2018; originally announced March 2018.

  20. arXiv:1801.06282  [pdf, other

    stat.ME

    Bayesian method for causal inference in spatially-correlated multivariate time series

    Authors: Bo Ning, Subhashis Ghosal, Jewell Thomas

    Abstract: Measuring the causal impact of an advertising campaign on sales is an essential task for advertising companies. Challenges arise when companies run advertising campaigns in multiple stores which are spatially correlated, and when the sales data have a low signal-to-noise ratio which makes the advertising effects hard to detect. This paper proposes a solution to address both of these challenges. A… ▽ More

    Submitted 12 March, 2018; v1 submitted 18 January, 2018; originally announced January 2018.

    Comments: 28 pages, 6 figures

  21. High-dimensional single-index Bayesian modeling of brain atrophy

    Authors: Arkaprava Roy, Subhashis Ghosal, Kingshuk Roy Choudhury

    Abstract: We propose a model of brain atrophy as a function of high-dimensional genetic information and low dimensional covariates such as gender, age, APOE gene, and disease status. A nonparametric single-index Bayesian model of high dimension is proposed to model the relationship with B-spline series prior on the unknown functions and Dirichlet process scale mixture of centered normal prior on the distrib… ▽ More

    Submitted 11 February, 2019; v1 submitted 18 December, 2017; originally announced December 2017.

    Journal ref: Bayesian Analysis (2019)

  22. arXiv:1710.08619  [pdf, other

    stat.ML cs.LG

    Interpretable Deep Learning applied to Plant Stress Phenoty**

    Authors: Sambuddha Ghosal, David Blystone, Asheesh K. Singh, Baskar Ganapathysubramanian, Arti Singh, Soumik Sarkar

    Abstract: Availability of an explainable deep learning model that can be applied to practical real world scenarios and in turn, can consistently, rapidly and accurately identify specific and minute traits in applicable fields of biological sciences, is scarce. Here we consider one such real world example viz., accurate identification, classification and quantification of biotic and abiotic stresses in crop… ▽ More

    Submitted 28 October, 2017; v1 submitted 24 October, 2017; originally announced October 2017.

  23. Bayesian Modeling of the Structural Connectome for Studying Alzheimer Disease

    Authors: Arkaprava Roy, Subhashis Ghosal, Jeffrey Prescott, Kingshuk Roy Choudhury

    Abstract: We study possible relations between the structure of the connectome, white matter connecting different regions of brain, and Alzheimer disease. Regression models in covariates including age, gender and disease status for the extent of white matter connecting each pair of regions of brain are proposed. Subject We study possible relations between the Alzheimer's disease progression and the structure… ▽ More

    Submitted 31 March, 2019; v1 submitted 12 October, 2017; originally announced October 2017.

    Report number: AOAS1257

    Journal ref: Annals of Applied Statistics 2019, Vol. 13, No. 3, 1791-1816

  24. arXiv:1709.05552  [pdf, other

    stat.ML

    Multivariate Gaussian Network Structure Learning

    Authors: Xingqi Du, Subhashis Ghosal

    Abstract: We consider a graphical model where a multivariate normal vector is associated with each node of the underlying graph and estimate the graphical structure. We minimize a loss function obtained by regressing the vector at each node on those at the remaining ones under a group penalty. We show that the proposed estimator can be computed by a fast convex optimization algorithm. We show that as the sa… ▽ More

    Submitted 16 September, 2017; originally announced September 2017.

    Comments: 30 pages, 17 figures, 3 tables

  25. Bayesian Non-parametric Simultaneous Quantile Regression for Complete and Grid Data

    Authors: Priyam Das, Subhashis Ghosal

    Abstract: In this paper, we consider Bayesian methods for non-parametric quantile regressions with multiple continuous predictors ranging values in the unit interval. In the first method, the quantile function is assumed to be smooth over the explanatory variable and is expanded in tensor product of B-spline basis functions. While in the second method, the distribution function is assumed to be smooth over… ▽ More

    Submitted 30 November, 2016; originally announced December 2016.

    Comments: 25 pages

  26. Analyzing Ozone Concentration by Bayesian Spatio-temporal Quantile Regression

    Authors: Priyam Das, Subhashis Ghosal

    Abstract: Ground level Ozone is one of the six common air-pollutants on which the EPA has set national air quality standards. In order to capture the spatio-temporal trend of 1-hour and 8-hour average ozone concentration in the US, we develop a method for spatio-temporal simultaneous quantile regression. Unlike existing procedures, in the proposed method, smoothing across the sites is incorporated within mo… ▽ More

    Submitted 5 December, 2016; v1 submitted 15 September, 2016; originally announced September 2016.

  27. arXiv:1608.03913  [pdf, other

    math.ST stat.ME

    Bayesian mode and maximum estimation and accelerated rates of contraction

    Authors: William Weimin Yoo, Subhashis Ghosal

    Abstract: We study the problem of estimating the mode and maximum of an unknown regression function in the presence of noise. We adopt the Bayesian approach by using tensor-product B-splines and endowing the coefficients with Gaussian priors. In the usual fixed-in-advanced sampling plan, we establish posterior contraction rates for mode and maximum and show that they coincide with the minimax rates for this… ▽ More

    Submitted 15 March, 2018; v1 submitted 12 August, 2016; originally announced August 2016.

    Comments: 34 pages, 4 figures

    MSC Class: Primary 62G05; 62L12; secondary 62G08; 62G15; 62L05

  28. arXiv:1508.05847  [pdf, other

    math.ST stat.ME

    Bayesian Detection of Image Boundaries

    Authors: Meng Li, Subhashis Ghosal

    Abstract: Detecting boundary of an image based on noisy observations is a fundamental problem of image processing and image segmentation. For a $d$-dimensional image ($d = 2, 3, \ldots$), the boundary can often be described by a closed smooth $(d - 1)$-dimensional manifold. In this paper, we propose a nonparametric Bayesian approach based on priors indexed by $\mathbb{S}^{d - 1}$, the unit sphere in… ▽ More

    Submitted 24 May, 2016; v1 submitted 24 August, 2015; originally announced August 2015.

  29. arXiv:1403.2695  [pdf, ps, other

    math.ST stat.ME

    Adaptive Bayesian density regression for high-dimensional data

    Authors: Weining Shen, Subhashis Ghosal

    Abstract: Density regression provides a flexible strategy for modeling the distribution of a response variable $Y$ given predictors $\mathbf{X}=(X_1,\ldots,X_p)$ by letting that the conditional density of $Y$ given $\mathbf{X}$ as a completely unknown function and allowing its shape to change with the value of $\mathbf{X}$. The number of predictors $p$ may be very large, possibly much larger than the number… ▽ More

    Submitted 6 January, 2016; v1 submitted 11 March, 2014; originally announced March 2014.

    Comments: Published at http://dx.doi.org/10.3150/14-BEJ663 in the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJ663

    Journal ref: Bernoulli 2016, Vol. 22, No. 1, 396-420

  30. arXiv:1403.0625  [pdf, ps, other

    math.ST stat.ME

    Adaptive Bayesian procedures using random series priors

    Authors: Weining Shen, Subhashis Ghosal

    Abstract: We consider a prior for nonparametric Bayesian estimation which uses finite random series with a random number of terms. The prior is constructed through distributions on the number of basis functions and the associated coefficients. We derive a general result on adaptive posterior convergence rates for all smoothness levels of the function in the true model by constructing an appropriate "sieve"… ▽ More

    Submitted 7 February, 2015; v1 submitted 3 March, 2014; originally announced March 2014.

    Comments: arXiv admin note: substantial text overlap with arXiv:1204.4238

  31. arXiv:1309.1754  [pdf, other

    math.ST stat.CO

    Bayesian estimation of a sparse precision matrix

    Authors: Sayantan Banerjee, Subhashis Ghosal

    Abstract: We consider the problem of estimating a sparse precision matrix of a multivariate Gaussian distribution, including the case where the dimension $p$ is large. Gaussian graphical models provide an important tool in describing conditional independence through presence or absence of the edges in the underlying graph. A popular non-Bayesian method of estimating a graphical structure is given by the gra… ▽ More

    Submitted 6 April, 2014; v1 submitted 6 September, 2013; originally announced September 2013.

  32. J. K. Ghosh's contribution to statistics: A brief outline

    Authors: Bertrand Clarke, Subhashis Ghosal

    Abstract: Professor Jayanta Kumar Ghosh has contributed massively to various areas of Statistics over the last five decades. Here, we survey some of his most important contributions. In roughly chronological order, we discuss his major results in the areas of sequential analysis, foundations, asymptotics, and Bayesian inference. It is seen that he progressed from thinking about data points, to thinking ab… ▽ More

    Submitted 20 May, 2008; originally announced May 2008.

    Comments: Published in at http://dx.doi.org/10.1214/074921708000000011 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-COLL3-IMSCOLL301 MSC Class: 62 (Primary) 62 (Secondary)

    Journal ref: IMS Collections 2008, Vol. 3, 1-18