Mask-Encoded Sparsification: Mitigating Biased Gradients in Communication-Efficient Split Learning

Zhou, Wenxuan; Qu, Zhihao; Lyu, Shen-Huan; Cai, Miao; Ye, Baoliu

Computer Science > Machine Learning

arXiv:2408.13787 (cs)

[Submitted on 25 Aug 2024 (v1), last revised 27 Sep 2024 (this version, v3)]

Title:Mask-Encoded Sparsification: Mitigating Biased Gradients in Communication-Efficient Split Learning

Authors:Wenxuan Zhou, Zhihao Qu, Shen-Huan Lyu, Miao Cai, Baoliu Ye

View PDF

Abstract:This paper introduces a novel framework designed to achieve a high compression ratio in Split Learning (SL) scenarios where resource-constrained devices are involved in large-scale model training. Our investigations demonstrate that compressing feature maps within SL leads to biased gradients that can negatively impact the convergence rates and diminish the generalization capabilities of the resulting models. Our theoretical analysis provides insights into how compression errors critically hinder SL performance, which previous methodologies underestimate. To address these challenges, we employ a narrow bit-width encoded mask to compensate for the sparsification error without increasing the order of time complexity. Supported by rigorous theoretical analysis, our framework significantly reduces compression errors and accelerates the convergence. Extensive experiments also verify that our method outperforms existing solutions regarding training efficiency and communication complexity.

Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2408.13787 [cs.LG]
	(or arXiv:2408.13787v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.13787
Journal reference:	Proceedings of the 27th European Conference on Artificial Intelligence, 2024

Submission history

From: Wenxuan Zhou [view email]
[v1] Sun, 25 Aug 2024 09:30:34 UTC (296 KB)
[v2] Wed, 18 Sep 2024 06:44:48 UTC (296 KB)
[v3] Fri, 27 Sep 2024 03:07:05 UTC (296 KB)

Computer Science > Machine Learning

Title:Mask-Encoded Sparsification: Mitigating Biased Gradients in Communication-Efficient Split Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Mask-Encoded Sparsification: Mitigating Biased Gradients in Communication-Efficient Split Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators