CR-SAM: Curvature Regularized Sharpness-Aware Minimization

Wu, Tao; Luo, Tie; Wunsch, Donald C.

Computer Science > Machine Learning

arXiv:2312.13555 (cs)

[Submitted on 21 Dec 2023 (v1), last revised 23 Dec 2023 (this version, v2)]

Title:CR-SAM: Curvature Regularized Sharpness-Aware Minimization

Authors:Tao Wu, Tie Luo, Donald C. Wunsch

View PDF HTML (experimental)

Abstract:The capacity to generalize to future unseen data stands as one of the utmost crucial attributes of deep neural networks. Sharpness-Aware Minimization (SAM) aims to enhance the generalizability by minimizing worst-case loss using one-step gradient ascent as an approximation. However, as training progresses, the non-linearity of the loss landscape increases, rendering one-step gradient ascent less effective. On the other hand, multi-step gradient ascent will incur higher training cost. In this paper, we introduce a normalized Hessian trace to accurately measure the curvature of loss landscape on {\em both} training and test sets. In particular, to counter excessive non-linearity of loss landscape, we propose Curvature Regularized SAM (CR-SAM), integrating the normalized Hessian trace as a SAM regularizer. Additionally, we present an efficient way to compute the trace via finite differences with parallelism. Our theoretical analysis based on PAC-Bayes bounds establishes the regularizer's efficacy in reducing generalization error. Empirical evaluation on CIFAR and ImageNet datasets shows that CR-SAM consistently enhances classification performance for ResNet and Vision Transformer (ViT) models across various datasets. Our code is available at this https URL.

Comments:	AAAI 2024, main track. Code available on Github. Appendix is also included in this updated version
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.13555 [cs.LG]
	(or arXiv:2312.13555v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.13555

Submission history

From: Tie Luo [view email]
[v1] Thu, 21 Dec 2023 03:46:29 UTC (540 KB)
[v2] Sat, 23 Dec 2023 07:15:23 UTC (542 KB)

Computer Science > Machine Learning

Title:CR-SAM: Curvature Regularized Sharpness-Aware Minimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:CR-SAM: Curvature Regularized Sharpness-Aware Minimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators