Bagging Improves Generalization Exponentially

Qian, Huajie; Ying, Donghao; Lam, Henry; Yin, Wotao

Mathematics > Optimization and Control

arXiv:2405.14741 (math)

[Submitted on 23 May 2024 (v1), last revised 29 May 2024 (this version, v2)]

Title:Bagging Improves Generalization Exponentially

Authors:Huajie Qian, Donghao Ying, Henry Lam, Wotao Yin

View PDF HTML (experimental)

Abstract:Bagging is a popular ensemble technique to improve the accuracy of machine learning models. It hinges on the well-established rationale that, by repeatedly retraining on resampled data, the aggregated model exhibits lower variance and hence higher stability, especially for discontinuous base learners. In this paper, we provide a new perspective on bagging: By suitably aggregating the base learners at the parametrization instead of the output level, bagging improves generalization performances exponentially, a strength that is significantly more powerful than variance reduction. More precisely, we show that for general stochastic optimization problems that suffer from slowly (i.e., polynomially) decaying generalization errors, bagging can effectively reduce these errors to an exponential decay. Moreover, this power of bagging is agnostic to the solution schemes, including common empirical risk minimization, distributionally robust optimization, and various regularizations. We demonstrate how bagging can substantially improve generalization performances in a range of examples involving heavy-tailed data that suffer from intrinsically slow rates.

Comments:	Correct author list typo
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2405.14741 [math.OC]
	(or arXiv:2405.14741v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2405.14741

Submission history

From: Donghao Ying [view email]
[v1] Thu, 23 May 2024 16:05:10 UTC (1,892 KB)
[v2] Wed, 29 May 2024 05:27:04 UTC (1,892 KB)

Mathematics > Optimization and Control

Title:Bagging Improves Generalization Exponentially

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Bagging Improves Generalization Exponentially

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators