Estimation of KL Divergence: Optimal Minimax Rate

Bu, Yuheng; Zou, Shaofeng; Liang, Yingbin; Veeravalli, Venugopal V.

Computer Science > Information Theory

arXiv:1607.02653 (cs)

[Submitted on 9 Jul 2016 (v1), last revised 20 Feb 2018 (this version, v4)]

Title:Estimation of KL Divergence: Optimal Minimax Rate

Authors:Yuheng Bu, Shaofeng Zou, Yingbin Liang, Venugopal V. Veeravalli

View PDF

Abstract:The problem of estimating the Kullback-Leibler divergence $D(P\|Q)$ between two unknown distributions $P$ and $Q$ is studied, under the assumption that the alphabet size $k$ of the distributions can scale to infinity. The estimation is based on $m$ independent samples drawn from $P$ and $n$ independent samples drawn from $Q$. It is first shown that there does not exist any consistent estimator that guarantees asymptotically small worst-case quadratic risk over the set of all pairs of distributions. A restricted set that contains pairs of distributions, with density ratio bounded by a function $f(k)$ is further considered. {An augmented plug-in estimator is proposed, and its worst-case quadratic risk is shown to be within a constant factor of $(\frac{k}{m}+\frac{kf(k)}{n})^2+\frac{\log ^2 f(k)}{m}+\frac{f(k)}{n}$, if $m$ and $n$ exceed a constant factor of $k$ and $kf(k)$, respectively.} Moreover, the minimax quadratic risk is characterized to be within a constant factor of $(\frac{k}{m\log k}+\frac{kf(k)}{n\log k})^2+\frac{\log ^2 f(k)}{m}+\frac{f(k)}{n}$, if $m$ and $n$ exceed a constant factor of $k/\log(k)$ and $kf(k)/\log k$, respectively. The lower bound on the minimax quadratic risk is characterized by employing a generalized Le Cam's method. A minimax optimal estimator is then constructed by employing both the polynomial approximation and the plug-in approaches.

Comments:	IEEE Transactions on Information Theory
Subjects:	Information Theory (cs.IT)
Cite as:	arXiv:1607.02653 [cs.IT]
	(or arXiv:1607.02653v4 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.1607.02653

Submission history

From: Shaofeng Zou [view email]
[v1] Sat, 9 Jul 2016 20:02:01 UTC (40 KB)
[v2] Wed, 14 Sep 2016 18:30:36 UTC (153 KB)
[v3] Fri, 21 Apr 2017 18:05:19 UTC (160 KB)
[v4] Tue, 20 Feb 2018 23:24:14 UTC (162 KB)

Computer Science > Information Theory

Title:Estimation of KL Divergence: Optimal Minimax Rate

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Theory

Title:Estimation of KL Divergence: Optimal Minimax Rate

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators