Sequential Bayesian Neural Subnetwork Ensembles

Jantre, Sanket; Bhattacharya, Shrijita; Urban, Nathan M.; Yoon, Byung-Jun; Maiti, Tapabrata; Balaprakash, Prasanna; Madireddy, Sandeep

Statistics > Machine Learning

arXiv:2206.00794 (stat)

[Submitted on 1 Jun 2022 (v1), last revised 19 Aug 2024 (this version, v2)]

Title:Sequential Bayesian Neural Subnetwork Ensembles

Authors:Sanket Jantre, Shrijita Bhattacharya, Nathan M. Urban, Byung-Jun Yoon, Tapabrata Maiti, Prasanna Balaprakash, Sandeep Madireddy

View PDF HTML (experimental)

Abstract:Deep ensembles have emerged as a powerful technique for improving predictive performance and enhancing model robustness across various applications by leveraging model diversity. However, traditional deep ensemble methods are often computationally expensive and rely on deterministic models, which may limit their flexibility. Additionally, while sparse subnetworks of dense models have shown promise in matching the performance of their dense counterparts and even enhancing robustness, existing methods for inducing sparsity typically incur training costs comparable to those of training a single dense model, as they either gradually prune the network during training or apply thresholding post-training. In light of these challenges, we propose an approach for sequential ensembling of dynamic Bayesian neural subnetworks that consistently maintains reduced model complexity throughout the training process while generating diverse ensembles in a single forward pass. Our approach involves an initial exploration phase to identify high-performing regions within the parameter space, followed by multiple exploitation phases that take advantage of the compactness of the sparse model. These exploitation phases quickly converge to different minima in the energy landscape, corresponding to high-performing subnetworks that together form a diverse and robust ensemble. We empirically demonstrate that our proposed approach outperforms traditional dense and sparse deterministic and Bayesian ensemble models in terms of prediction accuracy, uncertainty estimation, out-of-distribution detection, and adversarial robustness.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
Cite as:	arXiv:2206.00794 [stat.ML]
	(or arXiv:2206.00794v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2206.00794

Submission history

From: Sanket Jantre [view email]
[v1] Wed, 1 Jun 2022 22:57:52 UTC (1,150 KB)
[v2] Mon, 19 Aug 2024 22:20:16 UTC (204 KB)

Statistics > Machine Learning

Title:Sequential Bayesian Neural Subnetwork Ensembles

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Sequential Bayesian Neural Subnetwork Ensembles

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators