Federated Forest

Liu, Yang; Liu, Yingting; Liu, Zhijie; Zhang, Junbo; Meng, Chuishi; Zheng, Yu

doi:10.1109/TBDATA.2020.2992755

Computer Science > Machine Learning

arXiv:1905.10053 (cs)

[Submitted on 24 May 2019]

Title:Federated Forest

Authors:Yang Liu, Yingting Liu, Zhijie Liu, Junbo Zhang, Chuishi Meng, Yu Zheng

View PDF

Abstract:Most real-world data are scattered across different companies or government organizations, and cannot be easily integrated under data privacy and related regulations such as the European Union's General Data Protection Regulation (GDPR) and China' Cyber Security Law. Such data islands situation and data privacy & security are two major challenges for applications of artificial intelligence. In this paper, we tackle these challenges and propose a privacy-preserving machine learning model, called Federated Forest, which is a lossless learning model of the traditional random forest method, i.e., achieving the same level of accuracy as the non-privacy-preserving approach. Based on it, we developed a secure cross-regional machine learning system that allows a learning process to be jointly trained over different regions' clients with the same user samples but different attribute sets, processing the data stored in each of them without exchanging their raw data. A novel prediction algorithm was also proposed which could largely reduce the communication overhead. Experiments on both real-world and UCI data sets demonstrate the performance of the Federated Forest is as accurate as the non-federated version. The efficiency and robustness of our proposed system had been verified. Overall, our model is practical, scalable and extensible for real-life tasks.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1905.10053 [cs.LG]
	(or arXiv:1905.10053v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.10053
Related DOI:	https://doi.org/10.1109/TBDATA.2020.2992755

Submission history

From: Yang Liu [view email]
[v1] Fri, 24 May 2019 06:38:01 UTC (1,510 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-05

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yang Liu
Yingting Liu
Zhijie Liu
Junbo Zhang
Chuishi Meng

…

export BibTeX citation

Computer Science > Machine Learning

Title:Federated Forest

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Federated Forest

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators