Privacy Aware K-Means Clustering with High Utility.

Privacy aware k-means clustering with high utility

TD Nguyen, S Gupta, S Rana, S Venkatesh - Advances in Knowledge …, 2016 - Springer

Advances in Knowledge Discovery and Data Mining: 20th Pacific-Asia Conference …, 2016•Springer

Abstract

Privacy-preserving data mining aims to keep data safe, yet useful. But algorithms providing strong guarantees often end up with low utility. We propose a novel privacy preserving framework that thwarts an adversary from inferring an unknown data point by ensuring that the estimation error is almost invariant to the inclusion/exclusion of the data point. By focusing directly on the estimation error of the data point, our framework is able to significantly lower the perturbation required. We use this framework to propose a new privacy aware K-means clustering algorithm. Using both synthetic and real datasets, we demonstrate that the utility of this algorithm is almost equal to that of the unperturbed K-means, and at strict privacy levels, almost twice as good as compared to the differential privacy counterpart.

Springer

Show moreShow less

Save Cite Cited by 11 Related articles All 4 versions

Showing the best result for this search. See all results

Cite

Advanced search

Saved to My library

Privacy aware k-means clustering with high utility