Detection of data drift and outliers affecting machine learning model performance over time

Ackerman, Samuel; Farchi, Eitan; Raz, Orna; Zalmanovici, Marcel; Dube, Parijat

Statistics > Applications

arXiv:2012.09258 (stat)

[Submitted on 16 Dec 2020 (v1), last revised 6 Sep 2022 (this version, v3)]

Title:Detection of data drift and outliers affecting machine learning model performance over time

Authors:Samuel Ackerman, Eitan Farchi, Orna Raz, Marcel Zalmanovici, Parijat Dube

View PDF

Abstract:A trained ML model is deployed on another `test' dataset where target feature values (labels) are unknown. Drift is distribution change between the training and deployment data, which is concerning if model performance changes. For a cat/dog image classifier, for instance, drift during deployment could be rabbit images (new class) or cat/dog images with changed characteristics (change in distribution). We wish to detect these changes but can't measure accuracy without deployment data labels. We instead detect drift indirectly by nonparametrically testing the distribution of model prediction confidence for changes. This generalizes our method and sidesteps domain-specific feature representation.
We address important statistical issues, particularly Type-1 error control in sequential testing, using Change Point Models (CPMs; see Adams and Ross 2012). We also use nonparametric outlier methods to show the user suspicious observations for model diagnosis, since the before/after change confidence distributions overlap significantly. In experiments to demonstrate robustness, we train on a subset of MNIST digit classes, then insert drift (e.g., unseen digit class) in deployment data in various settings (gradual/sudden changes in the drift proportion). A novel loss function is introduced to compare the performance (detection delay, Type-1 and 2 errors) of a drift detector under different levels of drift class contamination.

Comments:	In: JSM Proceedings, Nonparametric Statistics Section, 20202. Philadelphia, PA: American Statistical Association. 144--160
Subjects:	Applications (stat.AP); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2012.09258 [stat.AP]
	(or arXiv:2012.09258v3 [stat.AP] for this version)
	https://doi.org/10.48550/arXiv.2012.09258

Submission history

From: Samuel Ackerman [view email]
[v1] Wed, 16 Dec 2020 20:50:12 UTC (135 KB)
[v2] Wed, 20 Jan 2021 09:31:46 UTC (135 KB)
[v3] Tue, 6 Sep 2022 07:23:55 UTC (134 KB)

Statistics > Applications

Title:Detection of data drift and outliers affecting machine learning model performance over time

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Applications

Title:Detection of data drift and outliers affecting machine learning model performance over time

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators