SparseDM: Toward Sparse Efficient Diffusion Models

Wang, Kafeng; Chen, Jianfei; Li, He; Mi, Zhenpeng; Zhu, Jun

Computer Science > Machine Learning

arXiv:2404.10445 (cs)

[Submitted on 16 Apr 2024 (v1), last revised 20 Nov 2024 (this version, v3)]

Title:SparseDM: Toward Sparse Efficient Diffusion Models

Authors:Kafeng Wang, Jianfei Chen, He Li, Zhenpeng Mi, Jun Zhu

View PDF HTML (experimental)

Abstract:Diffusion models have been extensively used in data generation tasks and are recognized as one of the best generative models. However, their time-consuming deployment, long inference time, and requirements on large memory limit their application on mobile devices. In this paper, we propose a method based on the improved Straight-Through Estimator to improve the deployment efficiency of diffusion models. Specifically, we add sparse masks to the Convolution and Linear layers in a pre-trained diffusion model, then use design progressive sparsity for model training in the fine-tuning stage, and switch the inference mask on and off, which supports a flexible choice of sparsity during inference according to the FID and MACs requirements. Experiments on four datasets conducted on a state-of-the-art Transformer-based diffusion model demonstrate that our method reduces MACs by $50\%$ while increasing FID by only 1.5 on average. Under other MACs conditions, the FID is also lower than 1$\sim$137 compared to other methods.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.10445 [cs.LG]
	(or arXiv:2404.10445v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2404.10445

Submission history

From: Kafeng Wang [view email]
[v1] Tue, 16 Apr 2024 10:31:06 UTC (1,773 KB)
[v2] Fri, 31 May 2024 02:56:14 UTC (1,404 KB)
[v3] Wed, 20 Nov 2024 04:51:59 UTC (1,915 KB)

Computer Science > Machine Learning

Title:SparseDM: Toward Sparse Efficient Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SparseDM: Toward Sparse Efficient Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators