A Progressive Batching L-BFGS Method for Machine Learning

Bollapragada, Raghu; Mudigere, Dheevatsa; Nocedal, Jorge; Shi, Hao-Jun Michael; Tang, Ping Tak Peter

Mathematics > Optimization and Control

arXiv:1802.05374 (math)

[Submitted on 15 Feb 2018 (v1), last revised 30 May 2018 (this version, v2)]

Title:A Progressive Batching L-BFGS Method for Machine Learning

Authors:Raghu Bollapragada, Dheevatsa Mudigere, Jorge Nocedal, Hao-Jun Michael Shi, Ping Tak Peter Tang

View PDF

Abstract:The standard L-BFGS method relies on gradient approximations that are not dominated by noise, so that search directions are descent directions, the line search is reliable, and quasi-Newton updating yields useful quadratic models of the objective function. All of this appears to call for a full batch approach, but since small batch sizes give rise to faster algorithms with better generalization properties, L-BFGS is currently not considered an algorithm of choice for large-scale machine learning applications. One need not, however, choose between the two extremes represented by the full batch or highly stochastic regimes, and may instead follow a progressive batching approach in which the sample size increases during the course of the optimization. In this paper, we present a new version of the L-BFGS algorithm that combines three basic components - progressive batching, a stochastic line search, and stable quasi-Newton updating - and that performs well on training logistic regression and deep neural networks. We provide supporting convergence theory for the method.

Comments:	ICML 2018. 25 pages, 17 figures, 2 tables
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1802.05374 [math.OC]
	(or arXiv:1802.05374v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1802.05374

Submission history

From: Hao-Jun Shi [view email]
[v1] Thu, 15 Feb 2018 01:02:36 UTC (2,718 KB)
[v2] Wed, 30 May 2018 04:38:48 UTC (3,233 KB)

Mathematics > Optimization and Control

Title:A Progressive Batching L-BFGS Method for Machine Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:A Progressive Batching L-BFGS Method for Machine Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators