Empirical Risk Minimization with Relative Entropy Regularization

Perlaza, Samir M.; Bisson, Gaetan; Esnaola, Iñaki; Jean-Marie, Alain; Rini, Stefano

doi:10.1109/TIT.2024.3365728

Mathematics > Statistics Theory

arXiv:2211.06617 (math)

[Submitted on 12 Nov 2022 (v1), last revised 8 Apr 2024 (this version, v5)]

Title:Empirical Risk Minimization with Relative Entropy Regularization

Authors:Samir M. Perlaza, Gaetan Bisson, Iñaki Esnaola, Alain Jean-Marie, Stefano Rini

View PDF

Abstract:The empirical risk minimization (ERM) problem with relative entropy regularization (ERM-RER) is investigated under the assumption that the reference measure is a $\sigma$-finite measure, and not necessarily a probability measure. Under this assumption, which leads to a generalization of the ERM-RER problem allowing a larger degree of flexibility for incorporating prior knowledge, numerous relevant properties are stated. Among these properties, the solution to this problem, if it exists, is shown to be a unique probability measure, mutually absolutely continuous with the reference measure. Such a solution exhibits a probably-approximately-correct guarantee for the ERM problem independently of whether the latter possesses a solution. For a fixed dataset and under a specific condition, the empirical risk is shown to be a sub-Gaussian random variable when the models are sampled from the solution to the ERM-RER problem. The generalization capabilities of the solution to the ERM-RER problem (the Gibbs algorithm) are studied via the sensitivity of the expected empirical risk to deviations from such a solution towards alternative probability measures. Finally, an interesting connection between sensitivity, generalization error, and lautum information is established.

Comments:	Appears in IEEE Transactions on Information Theory: Submitted June 2023. Revised in October 2023. Accepted January 2024. CameraReady February 2024. Also available as: Research Report, INRIA, No. RR-9454, Centre Inria d'Université Côte d'Azur, Sophia Antipolis, France, Feb., 2022. Last version: Version 7
Subjects:	Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (cs.LG)
Report number:	RR-9454
Cite as:	arXiv:2211.06617 [math.ST]
	(or arXiv:2211.06617v5 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.2211.06617
Related DOI:	https://doi.org/10.1109/TIT.2024.3365728

Submission history

From: Samir M. Perlaza [view email]
[v1] Sat, 12 Nov 2022 09:41:02 UTC (1,171 KB)
[v2] Mon, 12 Jun 2023 11:46:01 UTC (409 KB)
[v3] Tue, 21 Nov 2023 09:20:33 UTC (1,447 KB)
[v4] Tue, 27 Feb 2024 07:36:52 UTC (314 KB)
[v5] Mon, 8 Apr 2024 07:44:38 UTC (314 KB)

Mathematics > Statistics Theory

Title:Empirical Risk Minimization with Relative Entropy Regularization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title:Empirical Risk Minimization with Relative Entropy Regularization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators