On the Importance of Firth Bias Reduction in Few-Shot Classification

Ghaffari, Saba; Saleh, Ehsan; Forsyth, David; Wang, Yu-xiong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2110.02529 (cs)

[Submitted on 6 Oct 2021 (v1), last revised 14 Apr 2022 (this version, v2)]

Title:On the Importance of Firth Bias Reduction in Few-Shot Classification

Authors:Saba Ghaffari, Ehsan Saleh, David Forsyth, Yu-xiong Wang

View PDF

Abstract:Learning accurate classifiers for novel categories from very few examples, known as few-shot image classification, is a challenging task in statistical machine learning and computer vision. The performance in few-shot classification suffers from the bias in the estimation of classifier parameters; however, an effective underlying bias reduction technique that could alleviate this issue in training few-shot classifiers has been overlooked. In this work, we demonstrate the effectiveness of Firth bias reduction in few-shot classification. Theoretically, Firth bias reduction removes the $O(N^{-1})$ first order term from the small-sample bias of the Maximum Likelihood Estimator. Here we show that the general Firth bias reduction technique simplifies to encouraging uniform class assignment probabilities for multinomial logistic classification, and almost has the same effect in cosine classifiers. We derive an easy-to-implement optimization objective for Firth penalized multinomial logistic and cosine classifiers, which is equivalent to penalizing the cross-entropy loss with a KL-divergence between the uniform label distribution and the predictions. Then, we empirically evaluate that it is consistently effective across the board for few-shot image classification, regardless of (1) the feature representations from different backbones, (2) the number of samples per class, and (3) the number of classes. Finally, we show the robustness of Firth bias reduction, in the case of imbalanced data distribution. Our implementation is available at this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2110.02529 [cs.CV]
	(or arXiv:2110.02529v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2110.02529

Submission history

From: Saba Ghaffari [view email]
[v1] Wed, 6 Oct 2021 06:32:37 UTC (551 KB)
[v2] Thu, 14 Apr 2022 21:54:00 UTC (667 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:On the Importance of Firth Bias Reduction in Few-Shot Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:On the Importance of Firth Bias Reduction in Few-Shot Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators