Efficiently Bounding Optimal Solutions after Small Data Modification in Large-Scale Empirical Risk Minimization

Hanada, Hiroyuki; Shibagaki, Atsushi; Sakuma, Jun; Takeuchi, Ichiro

Statistics > Machine Learning

arXiv:1606.00136 (stat)

[Submitted on 1 Jun 2016]

Title:Efficiently Bounding Optimal Solutions after Small Data Modification in Large-Scale Empirical Risk Minimization

Authors:Hiroyuki Hanada, Atsushi Shibagaki, Jun Sakuma, Ichiro Takeuchi

View PDF

Abstract:We study large-scale classification problems in changing environments where a small part of the dataset is modified, and the effect of the data modification must be quickly incorporated into the classifier. When the entire dataset is large, even if the amount of the data modification is fairly small, the computational cost of re-training the classifier would be prohibitively large. In this paper, we propose a novel method for efficiently incorporating such a data modification effect into the classifier without actually re-training it. The proposed method provides bounds on the unknown optimal classifier with the cost only proportional to the size of the data modification. We demonstrate through numerical experiments that the proposed method provides sufficiently tight bounds with negligible computational costs, especially when a small part of the dataset is modified in a large-scale classification problem.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1606.00136 [stat.ML]
	(or arXiv:1606.00136v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1606.00136

Submission history

From: Ichiro Takeuchi Prof. [view email]
[v1] Wed, 1 Jun 2016 06:56:17 UTC (730 KB)

Statistics > Machine Learning

Title:Efficiently Bounding Optimal Solutions after Small Data Modification in Large-Scale Empirical Risk Minimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Efficiently Bounding Optimal Solutions after Small Data Modification in Large-Scale Empirical Risk Minimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators