A Novel Frank-Wolfe Algorithm. Analysis and Applications to Large-Scale SVM Training

Allende, Hector; Frandi, Emanuele; Nanculef, Ricardo; Sartori, Claudio

Computer Science > Computer Vision and Pattern Recognition

arXiv:1304.1014 (cs)

[Submitted on 3 Apr 2013 (v1), last revised 13 Oct 2013 (this version, v2)]

Title:A Novel Frank-Wolfe Algorithm. Analysis and Applications to Large-Scale SVM Training

Authors:Hector Allende, Emanuele Frandi, Ricardo Nanculef, Claudio Sartori

View PDF

Abstract:Recently, there has been a renewed interest in the machine learning community for variants of a sparse greedy approximation procedure for concave optimization known as {the Frank-Wolfe (FW) method}. In particular, this procedure has been successfully applied to train large-scale instances of non-linear Support Vector Machines (SVMs). Specializing FW to SVM training has allowed to obtain efficient algorithms but also important theoretical results, including convergence analysis of training algorithms and new characterizations of model sparsity.
In this paper, we present and analyze a novel variant of the FW method based on a new way to perform away steps, a classic strategy used to accelerate the convergence of the basic FW procedure. Our formulation and analysis is focused on a general concave maximization problem on the simplex. However, the specialization of our algorithm to quadratic forms is strongly related to some classic methods in computational geometry, namely the Gilbert and MDM algorithms.
On the theoretical side, we demonstrate that the method matches the guarantees in terms of convergence rate and number of iterations obtained by using classic away steps. In particular, the method enjoys a linear rate of convergence, a result that has been recently proved for MDM on quadratic forms.
On the practical side, we provide experiments on several classification datasets, and evaluate the results using statistical tests. Experiments show that our method is faster than the FW method with classic away steps, and works well even in the cases in which classic away steps slow down the algorithm. Furthermore, these improvements are obtained without sacrificing the predictive accuracy of the obtained SVM model.

Comments:	REVISED VERSION (October 2013) -- Title and abstract have been revised. Section 5 was added. Some proofs have been summarized (full-length proofs available in the previous version)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1304.1014 [cs.CV]
	(or arXiv:1304.1014v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1304.1014
Journal reference:	Information Sciences 285, 66-99, 2014

Submission history

From: Emanuele Frandi [view email]
[v1] Wed, 3 Apr 2013 17:15:43 UTC (4,578 KB)
[v2] Sun, 13 Oct 2013 09:50:26 UTC (1,629 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Novel Frank-Wolfe Algorithm. Analysis and Applications to Large-Scale SVM Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Novel Frank-Wolfe Algorithm. Analysis and Applications to Large-Scale SVM Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators