Skip to main content

Showing 1–33 of 33 results for author: Mallick, B K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.03152  [pdf, other

    stat.ME

    Orthogonal calibration via posterior projections with applications to the Schwarzschild model

    Authors: Antik Chakraborty, Jonelle B. Walsh, Louis Strigari, Bani K. Mallick, Anirban Bhattacharya

    Abstract: The orbital superposition method originally developed by Schwarzschild (1979) is used to study the dynamics of growth of a black hole and its host galaxy, and has uncovered new relationships between the galaxy's global characteristics. Scientists are specifically interested in finding optimal parameter choices for this model that best match physical measurements along with quantifying the uncertai… ▽ More

    Submitted 11 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  2. arXiv:2309.06349  [pdf, other

    stat.ML cs.LG eess.SY math.OC math.ST

    Generalized Regret Analysis of Thompson Sampling using Fractional Posteriors

    Authors: Prateek Jaiswal, Debdeep Pati, Anirban Bhattacharya, Bani K. Mallick

    Abstract: Thompson sampling (TS) is one of the most popular and earliest algorithms to solve stochastic multi-armed bandit problems. We consider a variant of TS, named $α$-TS, where we use a fractional or $α$-posterior ($α\in(0,1)$) instead of the standard posterior distribution. To compute an $α$-posterior, the likelihood in the definition of the standard posterior is tempered with a factor $α$. For $α$-TS… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  3. Covariate-Assisted Bayesian Graph Learning for Heterogeneous Data

    Authors: Yabo Niu, Yang Ni, Debdeep Pati, Bani K. Mallick

    Abstract: In a traditional Gaussian graphical model, data homogeneity is routinely assumed with no extra variables affecting the conditional independence. In modern genomic datasets, there is an abundance of auxiliary information, which often gets under-utilized in determining the joint dependency structure. In this article, we consider a Bayesian approach to model undirected graphs underlying heterogeneous… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 58 pages, 12 figures, accepted by Journal of the American Statistical Association

  4. arXiv:2305.08239  [pdf, other

    stat.AP

    Bayesian Flexible Modelling of Spatially Resolved Transcriptomic Data

    Authors: Arhit Chakrabarti, Yang Ni, Bani K. Mallick

    Abstract: Single-cell RNA-sequencing technologies may provide valuable insights to the understanding of the composition of different cell types and their functions within a tissue. Recent technologies such as spatial transcriptomics, enable the measurement of gene expressions at the single cell level along with the spatial locations of these cells in the tissue. Dimension-reduction and spatial clustering ar… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  5. arXiv:2304.09945  [pdf, other

    stat.CO

    Blocked Gibbs sampler for hierarchical Dirichlet processes

    Authors: Snigdha Das, Yabo Niu, Yang Ni, Bani K. Mallick, Debdeep Pati

    Abstract: Posterior computation in hierarchical Dirichlet process (HDP) mixture models is an active area of research in nonparametric Bayes inference of grouped data. Existing literature almost exclusively focuses on the Chinese restaurant franchise (CRF) analogy of the marginal distribution of the parameters, which can mix poorly and is known to have a linear complexity with the sample size. A recently dev… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  6. arXiv:2303.08979  [pdf, other

    stat.ME stat.CO

    An Approximate Bayesian Approach to Covariate-dependent Graphical Modeling

    Authors: Sutanoy Dasgupta, Peng Zhao, Jacob Helwig, Prasenjit Ghosh, Debdeep Pati, Bani K. Mallick

    Abstract: Gaussian graphical models typically assume a homogeneous structure across all subjects, which is often restrictive in applications. In this article, we propose a weighted pseudo-likelihood approach for graphical modeling which allows different subjects to have different graphical structures depending on extraneous covariates. The pseudo-likelihood approach replaces the joint distribution by a prod… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  7. arXiv:2302.09111  [pdf, other

    stat.ME stat.ML

    Graphical Dirichlet Process for Clustering Non-Exchangeable Grouped Data

    Authors: Arhit Chakrabarti, Yang Ni, Ellen Ruth A. Morris, Michael L. Salinas, Robert S. Chapkin, Bani K. Mallick

    Abstract: We consider the problem of clustering grouped data with possibly non-exchangeable groups whose dependencies can be characterized by a known directed acyclic graph. To allow the sharing of clusters among the non-exchangeable groups, we propose a Bayesian nonparametric approach, termed graphical Dirichlet process, that jointly models the dependent group-specific random measures by assuming each rand… ▽ More

    Submitted 31 July, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  8. arXiv:2210.00091  [pdf, other

    stat.ME stat.ML

    Factorized Fusion Shrinkage for Dynamic Relational Data

    Authors: Peng Zhao, Anirban Bhattacharya, Debdeep Pati, Bani K. Mallick

    Abstract: Modern data science applications often involve complex relational data with dynamic structures. An abrupt change in such dynamic relational data is typically observed in systems that undergo regime changes due to interventions. In such a case, we consider a factorized fusion shrinkage model in which all decomposed factors are dynamically shrunk towards group-wise fusion structures, where the shrin… ▽ More

    Submitted 18 April, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

  9. arXiv:2209.15117  [pdf, other

    stat.ML math.ST stat.CO

    Structured Optimal Variational Inference for Dynamic Latent Space Models

    Authors: Peng Zhao, Anirban Bhattacharya, Debdeep Pati, Bani K. Mallick

    Abstract: We consider a latent space model for dynamic networks, where our objective is to estimate the pairwise inner products of the latent positions. To balance posterior inference and computational scalability, we present a structured mean-field variational inference framework, where the time-dependent properties of the dynamic networks are exploited to facilitate computation and inference. Additionally… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  10. arXiv:2207.03242  [pdf, ps, other

    stat.ME

    A Bayesian Survival Tree Partition Model Using Latent Gaussian Processes

    Authors: Richard D. Payne, Nilabja Guha, Bani K. Mallick

    Abstract: Survival models are used to analyze time-to-event data in a variety of disciplines. Proportional hazard models provide interpretable parameter estimates, but proportional hazards assumptions are not always appropriate. Non-parametric models are more flexible but often lack a clear inferential framework. We propose a Bayesian tree partition model which is both flexible and inferential. Inference is… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  11. arXiv:2202.03979  [pdf, other

    stat.CO

    Adaptive Bayesian Variable Clustering via Structural Learning of Breast Cancer Data

    Authors: Riddhi Pratim Ghosh, Arnab Kumar Maity, Mohsen Pourahmadi, Bani K. Mallick

    Abstract: Clustering of proteins is of interest in cancer cell biology. This article proposes a hierarchical Bayesian model for protein (variable) clustering hinging on correlation structure. Starting from a multivariate normal likelihood, we enforce the clustering through prior modeling using angle based unconstrained reparameterization of correlations and assume a truncated Poisson distribution (to penali… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  12. Bayesian Structural Equation Modeling in Multiple Omics Data Integration with Application to Circadian Genes

    Authors: Arnab Kumar Maity, Sang Chan Lee, Bani K. Mallick, Tapasree Roy Sarkar

    Abstract: It is well known that the integration among different data-sources is reliable because of its potential of unveiling new functionalities of the genomic expressions which might be dormant in a single source analysis. Moreover, different studies have justified the more powerful analyses of multi-platform data. Toward this, in this study, we consider the circadian genes' omics profile such as copy nu… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Journal ref: Bioinformatics, 36(13), 3951-3958 (2020)

  13. arXiv:2010.14638  [pdf, ps, other

    stat.ME math.ST

    Bayesian Variable Selection in Multivariate Nonlinear Regression with Graph Structures

    Authors: Yabo Niu, Nilabja Guha, Debkumar De, Anindya Bhadra, Veerabhadran Baladandayuthapani, Bani K. Mallick

    Abstract: Gaussian graphical models (GGMs) are well-established tools for probabilistic exploration of dependence structures using precision matrices. We develop a Bayesian method to incorporate covariate information in this GGMs setup in a nonlinear seemingly unrelated regression framework. We propose a joint predictor and graph selection model and develop an efficient collapsed Gibbs sampler algorithm to… ▽ More

    Submitted 30 July, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

  14. arXiv:2007.02192  [pdf, other

    math.ST stat.AP stat.CO stat.ME stat.ML

    Tail-adaptive Bayesian shrinkage

    Authors: Se Yoon Lee, Peng Zhao, Debdeep Pati, Bani K. Mallick

    Abstract: Robust Bayesian methods for high-dimensional regression problems under diverse sparse regimes are studied. Traditional shrinkage priors are primarily designed to detect a handful of signals from tens of thousands of predictors in the so-called ultra-sparsity domain. However, they may not perform desirably when the degree of sparsity is moderate. In this paper, we propose a robust sparse estimation… ▽ More

    Submitted 19 February, 2024; v1 submitted 4 July, 2020; originally announced July 2020.

  15. Estimation of COVID-19 spread curves integrating global data and borrowing information

    Authors: Se Yoon Lee, Bowen Lei, Bani K. Mallick

    Abstract: Currently, novel coronavirus disease 2019 (COVID-19) is a big threat to global health. The rapid spread of the virus has created pandemic, and countries all over the world are struggling with a surge in COVID-19 infected cases. There are no drugs or other therapeutics approved by the US Food and Drug Administration to prevent or treat COVID-19: information on the disease is very limited and scatte… ▽ More

    Submitted 10 July, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Journal ref: PLOS ONE 15 (2020) 1- 17

  16. arXiv:2003.07494  [pdf, other

    stat.ME q-bio.GN stat.AP stat.ML

    Directionally Dependent Multi-View Clustering Using Copula Model

    Authors: Kahkashan Afrin, Ashif S. Iquebal, Mostafa Karimi, Allyson Souris, Se Yoon Lee, Bani K. Mallick

    Abstract: In recent biomedical scientific problems, it is a fundamental issue to integratively cluster a set of objects from multiple sources of datasets. Such problems are mostly encountered in genomics, where data is collected from various sources, and typically represent distinct yet complementary information. Integrating these data sources for multi-source clustering is challenging due to their complex… ▽ More

    Submitted 22 August, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

  17. arXiv:1912.05084  [pdf, other

    stat.ME

    Bayesian Copula Density Deconvolution for Zero-Inflated Data in Nutritional Epidemiology

    Authors: Abhra Sarkar, Debdeep Pati, Bani K. Mallick, Raymond J. Carroll

    Abstract: Estimating the marginal and joint densities of the long-term average intakes of different dietary components is an important problem in nutritional epidemiology. Since these variables cannot be directly measured, data are usually collected in the form of 24-hour recalls of the intakes, which show marked patterns of conditional heteroscedasticity. Significantly compounding the challenges, the recal… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  18. arXiv:1703.06978  [pdf, other

    stat.ME

    A Conditional Density Estimation Partition Model Using Logistic Gaussian Processes

    Authors: Richard D. Payne, Nilabja Guha, Yu Ding, Bani K. Mallick

    Abstract: Conditional density estimation (density regression) estimates the distribution of a response variable y conditional on covariates x. Utilizing a partition model framework, a conditional density estimation method is proposed using logistic Gaussian processes. The partition is created using a Voronoi tessellation and is learned from the data using a reversible jump Markov chain Monte Carlo algorithm… ▽ More

    Submitted 20 March, 2017; originally announced March 2017.

  19. arXiv:1612.00877  [pdf, ps, other

    stat.ME math.ST

    Bayesian sparse multiple regression for simultaneous rank reduction and variable selection

    Authors: Antik Chakraborty, Anirban Bhattacharya, Bani K. Mallick

    Abstract: We develop a Bayesian methodology aimed at simultaneously estimating low-rank and row-sparse matrices in a high-dimensional multiple-response linear regression model. We consider a carefully devised shrinkage prior on the matrix of regression coefficients which obviates the need to specify a prior on the rank, and shrinks the regression matrix towards low-rank and row-sparse structures. We provide… ▽ More

    Submitted 8 April, 2019; v1 submitted 2 December, 2016; originally announced December 2016.

  20. arXiv:1611.02480  [pdf, other

    stat.ME

    Quantile Graphical Models: Bayesian Approaches

    Authors: Nilabja Guha, Veera Baladandayuthapani, Bani K. Mallick

    Abstract: Graphical models are ubiquitous tools to describe the interdependence between variables measured simultaneously such as large-scale gene or protein expression data. Gaussian graphical models (GGMs) are well-established tools for probabilistic exploration of dependence structures using precision matrices and they are generated under a multivariate normal joint distribution. However, they suffer fro… ▽ More

    Submitted 8 January, 2020; v1 submitted 8 November, 2016; originally announced November 2016.

  21. Bayesian and Variational Bayesian approaches for flows in heterogenous random media

    Authors: Keren Yang, Nilabja Guha, Yalchin Efendiev, Bani K. Mallick

    Abstract: In this paper, we study porous media flows in heterogeneous stochastic media. We propose an efficient forward simulation technique that is tailored for variational Bayesian inversion. As a starting point, the proposed forward simulation technique decomposes the solution into the sum of separable functions (with respect to randomness and the space), where each term is calculated based on a variatio… ▽ More

    Submitted 8 February, 2018; v1 submitted 3 November, 2016; originally announced November 2016.

  22. arXiv:1508.02803  [pdf, other

    stat.ME

    Bayesian Variable Selection with Structure Learning: Applications in Integrative Genomics

    Authors: Suprateek Kundu, Minsuk Shin, Yichen Cheng, Ganiraju Manyam, Bani K. Mallick, Veera Baladandayuthapani

    Abstract: Significant advances in biotechnology have allowed for simultaneous measurement of molecular data points across multiple genomic and transcriptomic levels from a single tumor/cancer sample. This has motivated systematic approaches to integrate multi-dimensional structured datasets since cancer development and progression is driven by numerous co-ordinated molecular alterations and the interactions… ▽ More

    Submitted 11 August, 2015; originally announced August 2015.

  23. arXiv:1506.04778  [pdf, other

    stat.CO

    Fast sampling with Gaussian scale-mixture priors in high-dimensional regression

    Authors: Anirban Bhattacharya, Antik Chakraborty, Bani K. Mallick

    Abstract: We propose an efficient way to sample from a class of structured multivariate Gaussian distributions which routinely arise as conditional posteriors of model parameters that are assigned a conditionally Gaussian prior. The proposed algorithm only requires matrix operations in the form of matrix multiplications and linear system solutions. We exhibit that the computational complexity of the propose… ▽ More

    Submitted 27 June, 2016; v1 submitted 15 June, 2015; originally announced June 2015.

  24. arXiv:1411.5653  [pdf, ps, other

    stat.ME

    Two-Stage Metropolis-Hastings for Tall Data

    Authors: Richard D. Payne, Bani K. Mallick

    Abstract: This paper discusses the challenges presented by tall data problems associated with Bayesian classification (specifically binary classification) and the existing methods to handle them. Current methods include parallelizing the likelihood, subsampling, and consensus Monte Carlo. A new method based on the two-stage Metropolis-Hastings algorithm is also proposed. The purpose of this algorithm is to… ▽ More

    Submitted 20 March, 2017; v1 submitted 20 November, 2014; originally announced November 2014.

    Comments: To appear in Journal of Classification, Volume 35, Issue 1 (2018)

  25. arXiv:1404.6462  [pdf, other

    stat.ME

    Bayesian Semiparametric Multivariate Density Deconvolution

    Authors: Abhra Sarkar, Debdeep Pati, Bani K. Mallick, Raymond J. Carroll

    Abstract: We consider the problem of multivariate density deconvolution when the interest lies in estimating the distribution of a vector-valued random variable but precise measurements of the variable of interest are not available, observations being contaminated with additive measurement errors. The existing sparse literature on the problem assumes the density of the measurement errors to be completely kn… ▽ More

    Submitted 5 December, 2016; v1 submitted 25 April, 2014; originally announced April 2014.

  26. Bayesian sparse graphical models for classification with application to protein expression data

    Authors: Veerabhadran Baladandayuthapani, Rajesh Talluri, Yuan Ji, Kevin R. Coombes, Yiling Lu, Bryan T. Hennessy, Michael A. Davies, Bani K. Mallick

    Abstract: Reverse-phase protein array (RPPA) analysis is a powerful, relatively new platform that allows for high-throughput, quantitative analysis of protein networks. One of the challenges that currently limit the potential of this technology is the lack of methods that allow for accurate data modeling and identification of related networks and samples. Such models may improve the accuracy of biological s… ▽ More

    Submitted 21 November, 2014; v1 submitted 29 March, 2014; originally announced March 2014.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS722 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS722

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 3, 1443-1468

  27. Bayesian object classification of gold nanoparticles

    Authors: Bledar A. Konomi, Soma S. Dhavala, Jianhua Z. Huang, Subrata Kundu, David Huitink, Hong Liang, Yu Ding, Bani K. Mallick

    Abstract: The properties of materials synthesized with nanoparticles (NPs) are highly correlated to the sizes and shapes of the nanoparticles. The transmission electron microscopy (TEM) imaging technique can be used to measure the morphological characteristics of NPs, which can be simple circles or more complex irregular polygons with varying degrees of scales and sizes. A major difficulty in analyzing the… ▽ More

    Submitted 5 December, 2013; originally announced December 2013.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOAS616 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS616

    Journal ref: Annals of Applied Statistics 2013, Vol. 7, No. 2, 640-668

  28. arXiv:1310.4195  [pdf, ps, other

    stat.ME

    Bayesian Low Rank and Sparse Covariance Matrix Decomposition

    Authors: Lin Zhang, Abhra Sarkar, Bani K. Mallick

    Abstract: We consider the problem of estimating high-dimensional covariance matrices of a particular structure, which is a summation of low rank and sparse matrices. This covariance structure has a wide range of applications including factor analysis and random effects models. We propose a Bayesian method of estimating the covariance matrices by representing the covariance model in the form of a factor mode… ▽ More

    Submitted 15 October, 2013; originally announced October 2013.

  29. arXiv:1310.1127  [pdf, other

    stat.ME

    Bayesian sparse graphical models and their mixtures using lasso selection priors

    Authors: Rajesh Talluri, Veerabhadran Baladandayuthapani, Bani K. Mallick

    Abstract: We propose Bayesian methods for Gaussian graphical models that lead to sparse and adaptively shrunk estimators of the precision (inverse covariance) matrix. Our methods are based on lasso-type regularization priors leading to parsimonious parameterization of the precision matrix, which is essential in several applications involving learning relationships among the variables. In this context, we in… ▽ More

    Submitted 3 October, 2013; originally announced October 2013.

    Comments: under revision

  30. arXiv:1308.3915  [pdf, other

    stat.ME

    Bayes Regularized Graphical Model Estimation in High Dimensions

    Authors: Suprateek Kundu, Veera Baladandayuthapani, Bani K. Mallick

    Abstract: There has been an intense development of Bayes graphical model estimation approaches over the past decade - however, most of the existing methods are restricted to moderate dimensions. We propose a novel approach suitable for high dimensional settings, by decoupling model fitting and covariance selection. First, a full model based on a complete graph is fit under novel class of continuous shrinkag… ▽ More

    Submitted 18 August, 2013; originally announced August 2013.

    Comments: 42 Pages, 4 figures, 5 tables

  31. Investigating international new product diffusion speed: A semiparametric approach

    Authors: Brian M. Hartman, Bani K. Mallick, Debabrata Talukdar

    Abstract: Global marketing managers are interested in understanding the speed of the new product diffusion process and how the speed has changed in our ever more technologically advanced and global marketplace. Understanding the process allows firms to forecast the expected rate of return on their new products and develop effective marketing strategies. The most recent major study on this topic [Marketing S… ▽ More

    Submitted 28 June, 2012; originally announced June 2012.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOAS519 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS519

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 2, 625-651

  32. arXiv:1205.1839  [pdf, other

    stat.ME

    Nonparametric Bayesian Approaches to Non-homogeneous Hidden Markov Models

    Authors: Abhra Sarkar, Anindya Bhadra, Bani K. Mallick

    Abstract: In this article a flexible Bayesian non-parametric model is proposed for non-homogeneous hidden Markov models. The model is developed through the amalgamation of the ideas of hidden Markov models and predictor dependent stick-breaking processes. Computation is carried out using auxiliary variable representation of the model which enable us to perform exact MCMC sampling from the posterior. Further… ▽ More

    Submitted 8 May, 2012; originally announced May 2012.

  33. A generalized linear mixed model for longitudinal binary data with a marginal logit link function

    Authors: Michael Parzen, Souparno Ghosh, Stuart Lipsitz, Debajyoti Sinha, Garrett M. Fitzmaurice, Bani K. Mallick, Joseph G. Ibrahim

    Abstract: Longitudinal studies of a binary outcome are common in the health, social, and behavioral sciences. In general, a feature of random effects logistic regression models for longitudinal binary data is that the marginal functional form, when integrated over the distribution of the random effects, is no longer of logistic form. Recently, Wang and Louis [Biometrika 90 (2003) 765--775] proposed a random… ▽ More

    Submitted 18 April, 2011; originally announced April 2011.

    Comments: Published in at http://dx.doi.org/10.1214/10-AOAS390 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS390

    Journal ref: Annals of Applied Statistics 2011, Vol. 5, No. 1, 449-467