Scalable Bayesian inference for the generalized linear mixed model

Berchuck, Samuel I.; Medeiros, Felipe A.; Mukherjee, Sayan; Agazzi, Andrea

Statistics > Computation

arXiv:2403.03007 (stat)

[Submitted on 5 Mar 2024 (v1), last revised 16 Apr 2024 (this version, v2)]

Title:Scalable Bayesian inference for the generalized linear mixed model

Authors:Samuel I. Berchuck, Felipe A. Medeiros, Sayan Mukherjee, Andrea Agazzi

View PDF HTML (experimental)

Abstract:The generalized linear mixed model (GLMM) is a popular statistical approach for handling correlated data, and is used extensively in applications areas where big data is common, including biomedical data settings. The focus of this paper is scalable statistical inference for the GLMM, where we define statistical inference as: (i) estimation of population parameters, and (ii) evaluation of scientific hypotheses in the presence of uncertainty. Artificial intelligence (AI) learning algorithms excel at scalable statistical estimation, but rarely include uncertainty quantification. In contrast, Bayesian inference provides full statistical inference, since uncertainty quantification results automatically from the posterior distribution. Unfortunately, Bayesian inference algorithms, including Markov Chain Monte Carlo (MCMC), become computationally intractable in big data settings. In this paper, we introduce a statistical inference algorithm at the intersection of AI and Bayesian inference, that leverages the scalability of modern AI algorithms with guaranteed uncertainty quantification that accompanies Bayesian inference. Our algorithm is an extension of stochastic gradient MCMC with novel contributions that address the treatment of correlated data (i.e., intractable marginal likelihood) and proper posterior variance estimation. Through theoretical and empirical results we establish our algorithm's statistical inference properties, and apply the method in a large electronic health records database.

Comments:	42 pages, 13 figures, 2 tables
Subjects:	Computation (stat.CO); Methodology (stat.ME); Machine Learning (stat.ML)
Cite as:	arXiv:2403.03007 [stat.CO]
	(or arXiv:2403.03007v2 [stat.CO] for this version)
	https://doi.org/10.48550/arXiv.2403.03007

Submission history

From: Andrea Agazzi [view email]
[v1] Tue, 5 Mar 2024 14:35:34 UTC (455 KB)
[v2] Tue, 16 Apr 2024 17:47:39 UTC (450 KB)

Statistics > Computation

Title:Scalable Bayesian inference for the generalized linear mixed model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Computation

Title:Scalable Bayesian inference for the generalized linear mixed model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators