HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: utfsym

Authors: achieve the best HTML results from your LaTeX submissions by following these best practices.

License: arXiv.org perpetual non-exclusive license
arXiv:2403.04430v1 [cs.LG] 07 Mar 2024

On-demand Quantization for Green Federated Generative Diffusion in Mobile Edge Networks

Bingkun Lai1, Jiayi He1, Jiawen Kang1,Gaolei Li2,Minrui Xu3,Tao zhang4,Shengli Xie1 This work was supported by the National Natural Science Foundation of China (NSFC) under Grants No. 62102099, No. U22A2054, the Pearl River Talent Recruitment Program under Grant 2021QN02S643, the Talent Fund of Beijing Jiaotong University under Grant 2023XKRC050, the National Funded Postdoctoral Research Program under Grant GZC20230223, and Guangzhou Basic Research Program under Grant 2023A04J1699, and is also supported by Energy Research Test-Bed and Industry Partnership Funding Initiative, Energy Grid (EG) 2.0 programme, DesCartes and MOE Tier 1 (RG87/22). corresponding author: Jiawen Kang (e-mail: [email protected]) 1School of Automation, Guangdong University of Technology, Guangzhou, China
2School of Electronics Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai, China 3School of Computer Science and Engineering, Nanyang Technological University, Singapore
4 School of Software Engineering, Beijing Jiaotong University, Beijing, China
Abstract

Generative Artificial Intelligence (GAI) shows remarkable productivity and creativity in Mobile Edge Networks, such as the metaverse and the Industrial Internet of Things. Federated learning is a promising technique for effectively training GAI models in mobile edge networks due to its data distribution. However, there is a notable issue with communication consumption when training large GAI models like generative diffusion models in mobile edge networks. Additionally, the substantial energy consumption associated with training diffusion-based models, along with the limited resources of edge devices and complexities of network environments, pose challenges for improving the training efficiency of GAI models. To address this challenge, we propose an on-demand quantized energy-efficient federated diffusion approach for mobile edge networks. Specifically, we first design a dynamic quantized federated diffusion training scheme considering various demands from the edge devices. Then, we study an energy efficiency problem based on specific quantization requirements. Numerical results show that our proposed method significantly reduces system energy consumption and transmitted model size compared to both baseline federated diffusion and fixed quantized federated diffusion methods while effectively maintaining reasonable quality and diversity of generated data.

Index Terms:
Federated Diffusion, Energy Efficient, Generative AI, Generative Diffusion, On-demand Quantization.

I Introduction

As the carrier of content flow, mobile edge networks become essential fundamentals of next-generation applications like Metaverse [1] and the Industrial Internet of Things. The generative models like GAN [2] have demonstrated excellent performance in trajectory prediction [3], education [4], healthcare [5], and other scenarios involving the Internet of Things and the Internet of Vehicles. Therefore, more creative generative diffusion models are expected to be deployed in mobile edge networks for next-generation application scenarios such as the 6G communication networks [6] and vehicular metaverses [7][8]. Towards deploying the generative diffusion models in mobile edge networks, distributed training schemes based on federated learning called federated diffusions [9, 10]are proposed. These innovative models enable mobile edge networks to achieve higher productivity and efficiency in next-generation application scenarios.

During the training phase of federated diffusions, the model needs to be transferred between the server and edge devices at each training step to update the global model [11]. This is not a problem when training traditional AI models using federated learning due to their small number of model parameters. However, generative diffusion models are usually large, leading to significant energy expenditure during the federated training process [12]. Therefore, reducing training energy costs is crucial for improving overall operational efficiency in mobile edge networks [13].

Recent research on federated diffusions has primarily focused on improving their task performance [9]. This includes endeavors to elevate the quality and diversity of generated content. However, there is still limited depth and scope in studies that aim to optimize the overall training cost of these models. The authors in [14][15] studied the problem of energy-efficient resource allocation of FL over wireless communication networks. They derived the energy consumption models for FL based on the convergence rate analysis. The authors in [16] explored post-training quantization techniques for diffusion models, allowing direct quantization into 8 bits without significant performance degradation, with no substantial decline in performance. However, existing works fail to take into account the substantial training costs of diffusion models or the trade-off between performance and efficiency in the context of complex generative diffusion models [17].

To explore the deployment of a green generative diffusion model in mobile edge networks, we propose a dynamic quantization scheme for transmitting models during federated diffusion training. Firstly, we compress the diffusion models using a quantization scheme before transmission. We then study an energy consumption optimization problem and its solution. The performance of our proposed scheme is evaluated through simulations on the DDPM [17] model. Our main contributions can be summarized as follows:

  • We design a new and environmentally friendly federated generative diffusion framework that utilizes a dynamic method for parameter quantization and training in mobile edge networks.

  • We formulate an optimization problem for resource allocation in dynamic quantized federated diffusion, aiming to minimize total energy consumption while maintaining commendable performance.

  • Numerical results demonstrate the effectiveness of our proposed method compared to other baseline methods, particularly in terms of energy efficiency and sample quality.

The structure of the paper is organized as follows. The system model and the proposed on-demand quantized federated diffusion framework are introduced in section  II. Next, we study the energy efficiency optimization problem in section  III. Finally, We show the simulation results in section  IV and discuss the conclusion and future work in section  V.

II System Model

As shown in Fig. 1, we consider a mobile edge network scenario where a central server and k𝑘kitalic_k edge devices collaborate to train a diffusion model using federated learning. Given the inherent characteristics of large diffusion models, training them in federated learning scenarios can be exceptionally energy-intensive. A promising model compression method named stochastic quantization [18] is implemented prior to the transmission of model parameters from edge devices to the edge server for aggregation, this is done to mitigate the transmission costs of the training process. Additionally, we take into account the variable quantization level needs of edge devices, ensuring the flexibility of quantization to accommodate different device requirements. Furthermore, considering the heterogeneous nature of edge devices and their varying resource capacities, an energy optimization problem is formulated to further minimize the energy consumption during federated diffusion training. After the efficient training is done, the server could utilize the final global diffusion model for efficient and high-quality content generation. The learning process for each round of iteration is as follows:

  • Step 1: Given different quantization requirements, the central server determines the optimal strategy for each edge device to balance computing and communication resources based on the resource status of different devices.

  • Step 2: The edge devices then perform a local diffusion computation and transmission according to the optimal strategy.

  • Step 3: After receiving all local diffusions from edge devices, the central server uses an aggregation scheme (such as Fedavg [19]) to unite the local diffusion into a new global diffusion and send it back to the edge devices for next round of training.

Hereinafter, we introduce the concept of quantization, which is a promising method for compressing neural networks. Stochastic quantization can be efficiently used in the federated learning process to significantly reduce energy consumption while maintaining minimal impact on model performance. To minimize the cost of transmitting a comparatively large model, as in Fig. 1, we propose quantizing the local diffusion model before uploading it to the server, since the resources of edge devices are often limited. To train the federated diffusion model with a quantization scheme, we first define the stochastic quantization function as Q()𝑄Q(\cdot)italic_Q ( ⋅ ). Given the local diffusion weight 𝒘ksubscript𝒘𝑘{\boldsymbol{w}}_{k}bold_italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, the quantized weight can be expressed as 𝒘^k=Q(𝒘k)subscript^𝒘𝑘𝑄subscript𝒘𝑘\hat{\boldsymbol{w}}_{k}=Q({\boldsymbol{w}}_{k})over^ start_ARG bold_italic_w end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_Q ( bold_italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ). Let |𝒘k[n]|subscriptsuperscript𝒘delimited-[]𝑛𝑘|{\boldsymbol{w}}^{[n]}_{k}|| bold_italic_w start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | denotes the absolute value of element in 𝒘ksubscript𝒘𝑘{\boldsymbol{w}}_{k}bold_italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, The stochastic quantization function is defined as

Refer to caption
Figure 1: On-demand quantized federated diffusion framework
Q(𝒘k)=asign(𝒘k){ql+1w.p. |𝒘k[n]|aqla(ql+1ql)qlw.p. aql+1|𝒘k[n]|a(ql+1ql)𝑄subscript𝒘𝑘𝑎𝑠𝑖𝑔𝑛subscript𝒘𝑘casessuperscript𝑞𝑙1w.p. |𝒘k[n]|aqla(ql+1ql)superscript𝑞𝑙w.p. aql+1|𝒘k[n]|a(ql+1ql)Q({\boldsymbol{w}}_{k})=a\cdot sign({\boldsymbol{w}}_{k})\cdot\begin{cases}q^{% l+1}&\text{w.p. $\frac{|{\boldsymbol{w}}^{[n]}_{k}|-aq^{l}}{a(q^{l+1}-q^{l})}$% }\\ q^{l}&\text{w.p. $\frac{aq^{l+1}-{|{\boldsymbol{w}}^{[n]}_{k}|}}{a(q^{l+1}-q^{% l})}$}\\ \end{cases}italic_Q ( bold_italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = italic_a ⋅ italic_s italic_i italic_g italic_n ( bold_italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ⋅ { start_ROW start_CELL italic_q start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT end_CELL start_CELL w.p. divide start_ARG | bold_italic_w start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | - italic_a italic_q start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT end_ARG start_ARG italic_a ( italic_q start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT - italic_q start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) end_ARG end_CELL end_ROW start_ROW start_CELL italic_q start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT end_CELL start_CELL w.p. divide start_ARG italic_a italic_q start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT - | bold_italic_w start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | end_ARG start_ARG italic_a ( italic_q start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT - italic_q start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) end_ARG end_CELL end_ROW (1)

Here, a𝑎aitalic_a is the scale factor and sgn()𝑠𝑔𝑛sgn(\cdot)italic_s italic_g italic_n ( ⋅ ) denotes the sign function which represents the sign of 𝒘ksubscript𝒘𝑘{\boldsymbol{w}}_{k}bold_italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Moreover, [ql,ql+1]superscript𝑞𝑙superscript𝑞𝑙1[q^{l},q^{l+1}][ italic_q start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT , italic_q start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT ] is the quantization interval such that for any |𝒘k[n]|subscriptsuperscript𝒘delimited-[]𝑛𝑘{|\boldsymbol{w}}^{[n]}_{k}|| bold_italic_w start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | there exists |𝒘k[n]|a[ql,ql+1]subscriptsuperscript𝒘delimited-[]𝑛𝑘𝑎superscript𝑞𝑙superscript𝑞𝑙1\frac{|{\boldsymbol{w}}^{[n]}_{k}|}{a}\in[q^{l},q^{l+1}]divide start_ARG | bold_italic_w start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | end_ARG start_ARG italic_a end_ARG ∈ [ italic_q start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT , italic_q start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT ]. With the given quantization level Lksubscript𝐿𝑘L_{k}italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, qlsuperscript𝑞𝑙q^{l}italic_q start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT can be calculated as

ql=l(𝒘kmax𝒘kmin)a(ql+1ql)+𝒘kminasuperscript𝑞𝑙𝑙subscriptsuperscript𝒘𝑚𝑎𝑥𝑘subscriptsuperscript𝒘𝑚𝑖𝑛𝑘𝑎superscript𝑞𝑙1superscript𝑞𝑙subscriptsuperscript𝒘𝑚𝑖𝑛𝑘𝑎\begin{split}q^{l}=\frac{l(\boldsymbol{w}^{max}_{k}-\boldsymbol{w}^{min}_{k})}% {a(q^{l+1}-q^{l})}+\frac{\boldsymbol{w}^{min}_{k}}{a}\end{split}start_ROW start_CELL italic_q start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT = divide start_ARG italic_l ( bold_italic_w start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - bold_italic_w start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_a ( italic_q start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT - italic_q start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) end_ARG + divide start_ARG bold_italic_w start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_a end_ARG end_CELL end_ROW (2)

Where 𝒘kmaxsubscriptsuperscript𝒘𝑚𝑎𝑥𝑘\boldsymbol{w}^{max}_{k}bold_italic_w start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and 𝒘kminsubscriptsuperscript𝒘𝑚𝑖𝑛𝑘\boldsymbol{w}^{min}_{k}bold_italic_w start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT represent the maximum and minimum value of non-zero element |𝒘k[n]|subscriptsuperscript𝒘delimited-[]𝑛𝑘|{\boldsymbol{w}}^{[n]}_{k}|| bold_italic_w start_POSTSUPERSCRIPT [ italic_n ] end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | , respectively.

Next, we formulate the computation and communication models of the proposed scheme. Let fksubscript𝑓𝑘f_{k}italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT represent the computation frequency of local client k𝑘kitalic_k, and Dksubscript𝐷𝑘D_{k}italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denotes the data size of the local dataset. The computation time for training diffusion model is expressed by

Tkcmp=IkDkCfksubscriptsuperscript𝑇𝑐𝑚𝑝𝑘subscript𝐼𝑘subscript𝐷𝑘𝐶subscript𝑓𝑘\displaystyle\begin{split}{T^{cmp}_{k}}=\frac{I_{k}{D_{k}}{C}}{f_{k}}\end{split}start_ROW start_CELL italic_T start_POSTSUPERSCRIPT italic_c italic_m italic_p end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_C end_ARG start_ARG italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG end_CELL end_ROW (3)

where Iksubscript𝐼𝑘I_{k}italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and C𝐶Citalic_C denote the local iteration times in each communication round and the workload of local diffusion training, respectively. Following that, given the energy coefficient τksubscript𝜏𝑘\tau_{k}italic_τ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, the energy consumption of client k𝑘kitalic_k is estimated by

Ekcmp=τkfk2IkDkCsubscriptsuperscript𝐸𝑐𝑚𝑝𝑘subscript𝜏𝑘superscriptsubscript𝑓𝑘2subscript𝐼𝑘subscript𝐷𝑘𝐶\displaystyle\begin{split}{E^{cmp}_{k}}={\tau_{k}}{{f_{k}}^{2}{I_{k}}{D_{k}}{C% }}\end{split}start_ROW start_CELL italic_E start_POSTSUPERSCRIPT italic_c italic_m italic_p end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_τ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_C end_CELL end_ROW (4)

In the distributed diffusion setting, each edge device uploads the local diffusion model in order to generate a better global model. Moreover, the local diffusion model is quantized as 𝒘^ksubscript^𝒘𝑘\hat{\boldsymbol{w}}_{k}over^ start_ARG bold_italic_w end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT for efficiency improvement. To this end, we adopt a frequency division multiple access (FDMA) transmission scheme [20] for quantized local diffusion transmission. Therefore, with the transmission power Pksubscript𝑃𝑘P_{k}italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, the uplink transmission rate of client k𝑘kitalic_k is deduced by

rk=Blog2(1+|h|2dηPkBN0)subscript𝑟𝑘𝐵subscript21superscript2superscript𝑑𝜂subscript𝑃𝑘𝐵subscript𝑁0\displaystyle\begin{split}{r_{k}}={B\log_{2}{(1+\frac{|h|^{2}d^{-\eta}P_{k}}{% BN_{0}})}}\end{split}start_ROW start_CELL italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_B roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG | italic_h | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT - italic_η end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_B italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) end_CELL end_ROW (5)

Here, B𝐵Bitalic_B and N0subscript𝑁0N_{0}italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT denote the bandwidth and noise power-spectral-density each while d𝑑ditalic_d corresponds to the distance between the client and server. Meanwhile, hhitalic_h and η𝜂\etaitalic_η represent the Rayleigh channel coefficient and pathloss exponent, separately. Subsequently, given the updated model size Mksubscript𝑀𝑘M_{k}italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, and quantization level Lksubscript𝐿𝑘L_{k}italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, the time spent by client k𝑘kitalic_k to transmit the local model to the server is

Tkcom=MkLkrksubscriptsuperscript𝑇𝑐𝑜𝑚𝑘subscript𝑀𝑘subscript𝐿𝑘subscript𝑟𝑘\displaystyle\begin{split}{T^{com}_{k}}=\frac{M_{k}L_{k}}{r_{k}}\end{split}start_ROW start_CELL italic_T start_POSTSUPERSCRIPT italic_c italic_o italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG end_CELL end_ROW (6)

Thus, the corresponding energy consumption is calculated by

Ekcom=PkTkcomsubscriptsuperscript𝐸𝑐𝑜𝑚𝑘subscript𝑃𝑘subscriptsuperscript𝑇𝑐𝑜𝑚𝑘\displaystyle\begin{split}{E^{com}_{k}}=P_{k}{T^{com}_{k}}\end{split}start_ROW start_CELL italic_E start_POSTSUPERSCRIPT italic_c italic_o italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT italic_c italic_o italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL end_ROW (7)

III Energy Efficiency Optimization

Before the discussion of the energy efficiency problem of the proposed methods started, similar to the work in [15], we first present the following assumption and theorem. Where δksubscript𝛿𝑘\delta_{k}italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT represents the unique bound demand of various edge devices, a smaller δksubscript𝛿𝑘\delta_{k}italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT indicates that the resource of edge device k𝑘kitalic_k is relatively insufficient, which leads to a lower quantization level strategy:

Assumption 1.

the expectation of the square norm of the local weight uploaded by edge devices is bounded: for any uploaded weight, 𝔼𝐰k2δk𝔼superscriptnormsubscript𝐰𝑘2subscript𝛿𝑘{\mathbb{E}}\|{\boldsymbol{w}}_{k}\|^{2}\leq\delta_{k}blackboard_E ∥ bold_italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.

Theorem 1.

Based on Assumptions 1, the square of local weight quantization error Δksubscriptnormal-Δ𝑘{\Delta}_{k}roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is bounded by:

Δk=𝔼𝒘k𝒘^k2δk2Lk2.subscriptΔ𝑘𝔼superscriptdelimited-∥∥subscript𝒘𝑘subscript^𝒘𝑘2subscript𝛿𝑘2subscriptsuperscript𝐿2𝑘\displaystyle\begin{split}{\Delta}_{k}={\mathbb{E}}\|{\boldsymbol{w}}_{k}-\hat% {\boldsymbol{w}}_{k}\|^{2}\leq\frac{\delta_{k}}{2L^{2}_{k}}.\end{split}start_ROW start_CELL roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = blackboard_E ∥ bold_italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - over^ start_ARG bold_italic_w end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ divide start_ARG italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG . end_CELL end_ROW (8)

Through this theorem, we can easily obtain the corresponding quantization levels for each heterogeneous edge device at different demands, thereby further constructing our energy consumption optimization model, which we will discuss in detail in the next section.

III-A Problem Formulation

As a consequence of variations in resource capabilities among edge devices, variability exists in the required quantization level demands. In simpler terms, each device has its distinct upper bound for quantization error. Leveraging Theorem 1, we establish the energy minimization problem within the confines of this quantization error constraint as follows:

(P1)P1\displaystyle({\text{P1}})( P1 ) minPk,fk,Lk(Ekcmp\displaystyle\min\limits_{P_{k},f_{k},L_{k}}(E^{cmp}_{k}roman_min start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_E start_POSTSUPERSCRIPT italic_c italic_m italic_p end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT +Ekcom)\displaystyle+E^{com}_{k})\quad+ italic_E start_POSTSUPERSCRIPT italic_c italic_o italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) (III-A)
subject to: Tkcmp+Tkcomsuperscriptsubscript𝑇𝑘𝑐𝑚𝑝superscriptsubscript𝑇𝑘𝑐𝑜𝑚absent\displaystyle T_{k}^{cmp}+T_{k}^{com}\leqitalic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c italic_m italic_p end_POSTSUPERSCRIPT + italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c italic_o italic_m end_POSTSUPERSCRIPT ≤ Tkmax,ksuperscriptsubscript𝑇𝑘𝑚𝑎𝑥for-all𝑘\displaystyle\;{T_{k}^{max}},\forall kitalic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT , ∀ italic_k (9a)
𝔼𝒘k𝒘^k2𝔼superscriptnormsubscript𝒘𝑘subscript^𝒘𝑘2absent\displaystyle{\mathbb{E}}\|{\boldsymbol{w}}_{k}-\hat{\boldsymbol{w}}_{k}\|^{2}\leqblackboard_E ∥ bold_italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - over^ start_ARG bold_italic_w end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ δk2Lk2,ksubscript𝛿𝑘2subscriptsuperscript𝐿2𝑘for-all𝑘\displaystyle\;\frac{\delta_{k}}{2L^{2}_{k}},\forall kdivide start_ARG italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG , ∀ italic_k (9b)
PkminPksubscriptsuperscript𝑃𝑚𝑖𝑛𝑘subscript𝑃𝑘absent\displaystyle P^{min}_{k}\leq P_{k}\leqitalic_P start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ Pkmax,ksubscriptsuperscript𝑃𝑚𝑎𝑥𝑘for-all𝑘\displaystyle\;P^{max}_{k},\forall kitalic_P start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k (9c)
fkminfksuperscriptsubscript𝑓𝑘𝑚𝑖𝑛subscript𝑓𝑘absent\displaystyle f_{k}^{min}\leq f_{k}\leqitalic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT ≤ italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ fkmax,ksuperscriptsubscript𝑓𝑘𝑚𝑎𝑥for-all𝑘\displaystyle\;f_{k}^{max},\forall kitalic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT , ∀ italic_k (9d)

With the unique δksubscript𝛿𝑘{\delta}_{k}italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT given by the different bound requirements of local edge devices, we can always obtain the optimal solution Lk*=δk2Δksubscriptsuperscript𝐿𝑘subscript𝛿𝑘2subscriptΔ𝑘L^{*}_{k}=\sqrt{\frac{{\delta}_{k}}{2{\Delta}_{k}}}italic_L start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = square-root start_ARG divide start_ARG italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 2 roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG end_ARG for the optimization problem. As a result, we simplify P1 as

(P2)P2\displaystyle({\text{P2}})( P2 ) minPk,fk(Ekcmp\displaystyle\min\limits_{P_{k},f_{k}}(E^{cmp}_{k}roman_min start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_E start_POSTSUPERSCRIPT italic_c italic_m italic_p end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT +Ekcom)\displaystyle+E^{com}_{k})\quad+ italic_E start_POSTSUPERSCRIPT italic_c italic_o italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) (III-A)
subject to: Tkcmp+Tkcomsuperscriptsubscript𝑇𝑘𝑐𝑚𝑝superscriptsubscript𝑇𝑘𝑐𝑜𝑚absent\displaystyle T_{k}^{cmp}+T_{k}^{com}\leqitalic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c italic_m italic_p end_POSTSUPERSCRIPT + italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c italic_o italic_m end_POSTSUPERSCRIPT ≤ Tkmax,ksuperscriptsubscript𝑇𝑘𝑚𝑎𝑥for-all𝑘\displaystyle\;{T_{k}^{max}},\forall kitalic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT , ∀ italic_k (10a)
PkminPksubscriptsuperscript𝑃𝑚𝑖𝑛𝑘subscript𝑃𝑘absent\displaystyle P^{min}_{k}\leq P_{k}\leqitalic_P start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ Pkmax,ksubscriptsuperscript𝑃𝑚𝑎𝑥𝑘for-all𝑘\displaystyle\;P^{max}_{k},\forall kitalic_P start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k (10b)
fkminfksuperscriptsubscript𝑓𝑘𝑚𝑖𝑛subscript𝑓𝑘absent\displaystyle f_{k}^{min}\leq f_{k}\leqitalic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT ≤ italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ fkmax,ksuperscriptsubscript𝑓𝑘𝑚𝑎𝑥for-all𝑘\displaystyle\;f_{k}^{max},\forall kitalic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT , ∀ italic_k (10c)

Following that, we transform P2 into a more tractable form by introducing two intermediate variables θk>0subscript𝜃𝑘0\theta_{k}>0italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT > 0 and πk>0subscript𝜋𝑘0\pi_{k}>0italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT > 0. Moreover, we let θksubscript𝜃𝑘\theta_{k}italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and πksubscript𝜋𝑘\pi_{k}italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT represent the weight factors of maximum time budget for client k𝑘kitalic_k such that

θkTkcmp=IkDkCfk,πkTkcom=Mklog2(Lk)rksubscript𝜃𝑘superscriptsubscript𝑇𝑘𝑐𝑚𝑝subscript𝐼𝑘subscript𝐷𝑘𝐶subscript𝑓𝑘,subscript𝜋𝑘superscriptsubscript𝑇𝑘𝑐𝑜𝑚subscript𝑀𝑘subscript2subscript𝐿𝑘subscript𝑟𝑘\displaystyle\begin{split}{\theta_{k}}{T_{k}^{cmp}}=\frac{I_{k}{D_{k}}{C}}{f_{% k}}\text{,}\\ {\pi_{k}}{T_{k}^{com}}=\frac{M_{k}\log_{2}{(L_{k})}}{r_{k}}\end{split}start_ROW start_CELL italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c italic_m italic_p end_POSTSUPERSCRIPT = divide start_ARG italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_C end_ARG start_ARG italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG , end_CELL end_ROW start_ROW start_CELL italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c italic_o italic_m end_POSTSUPERSCRIPT = divide start_ARG italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG end_CELL end_ROW (11)

Here, the lower bound of θksubscript𝜃𝑘\theta_{k}italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and πksubscript𝜋𝑘\pi_{k}italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT can be easily acquired given the optimal Lksubscript𝐿𝑘L_{k}italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT

θkmin=IkDkCfkmaxTkmax,πkmin=Mklog2(Lk*)BTkmaxlog2(1+|h|2dηPkmaxBN0)superscriptsubscript𝜃𝑘𝑚𝑖𝑛subscript𝐼𝑘subscript𝐷𝑘𝐶superscriptsubscript𝑓𝑘𝑚𝑎𝑥superscriptsubscript𝑇𝑘𝑚𝑎𝑥,subscriptsuperscript𝜋𝑚𝑖𝑛𝑘subscript𝑀𝑘subscript2subscriptsuperscript𝐿𝑘𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥subscript21superscript2superscript𝑑𝜂subscriptsuperscript𝑃𝑚𝑎𝑥𝑘𝐵subscript𝑁0\displaystyle\begin{split}{\theta_{k}^{min}}=\frac{I_{k}{D_{k}}{C}}{f_{k}^{max% }T_{k}^{max}}\text{,}\\ {\pi^{min}_{k}}=\frac{M_{k}\log_{2}{(L^{*}_{k})}}{{BT_{k}^{max}\log_{2}{(1+% \frac{|h|^{2}d^{-\eta}P^{max}_{k}}{BN_{0}})}}}\end{split}start_ROW start_CELL italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT = divide start_ARG italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_C end_ARG start_ARG italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG , end_CELL end_ROW start_ROW start_CELL italic_π start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_L start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG | italic_h | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT - italic_η end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_B italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) end_ARG end_CELL end_ROW (12)

Furthermore, the total energy consumption of client k𝑘kitalic_k during the fine-tuning process can be rewritten in the following form

Ek=Ekcmp+Ekcom=τIk3Dk3Ck3θk2(Tkmax)2+N0BTmax|h|2dη(2Mklog2(Lk*)πkBTkmax1)subscript𝐸𝑘subscriptsuperscript𝐸𝑐𝑚𝑝𝑘subscriptsuperscript𝐸𝑐𝑜𝑚𝑘𝜏subscriptsuperscript𝐼3𝑘subscriptsuperscript𝐷3𝑘subscriptsuperscript𝐶3𝑘superscriptsubscript𝜃𝑘2superscriptsuperscriptsubscript𝑇𝑘𝑚𝑎𝑥2subscript𝑁0𝐵superscript𝑇𝑚𝑎𝑥superscript2superscript𝑑𝜂superscript2subscript𝑀𝑘subscript2subscriptsuperscript𝐿𝑘subscript𝜋𝑘𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥1\displaystyle\begin{split}{E_{k}}&=E^{cmp}_{k}+E^{com}_{k}\\ &=\frac{\tau I^{3}_{k}D^{3}_{k}C^{3}_{k}}{\theta_{k}^{2}(T_{k}^{max})^{2}}+% \frac{N_{0}BT^{max}}{\lvert h\rvert^{2}d^{-\eta}}\big{(}2^{\frac{M_{k}\log_{2}% {(L^{*}_{k})}}{\pi_{k}BT_{k}^{max}}}-1\big{)}\end{split}start_ROW start_CELL italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL start_CELL = italic_E start_POSTSUPERSCRIPT italic_c italic_m italic_p end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_E start_POSTSUPERSCRIPT italic_c italic_o italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG italic_τ italic_I start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + divide start_ARG italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_B italic_T start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG start_ARG | italic_h | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT - italic_η end_POSTSUPERSCRIPT end_ARG ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_L start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT - 1 ) end_CELL end_ROW (13)

Thus, we convert problem P3 into the following form

(P3)P3\displaystyle(\text{P3})( P3 ) minθk,πkEksubscriptsubscript𝜃𝑘subscript𝜋𝑘subscript𝐸𝑘\displaystyle\min\limits_{\theta_{k},\pi_{k}}E_{k}\quad\thinspaceroman_min start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (III-A)
subject to: θk+πk=1,ksubscript𝜃𝑘subscript𝜋𝑘1for-all𝑘\displaystyle\theta_{k}+\pi_{k}=1,\forall kitalic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1 , ∀ italic_k (14a)
θkminθk,ksuperscriptsubscript𝜃𝑘𝑚𝑖𝑛subscript𝜃𝑘for-all𝑘\displaystyle\;\theta_{k}^{min}\leq\theta_{k},\forall kitalic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT ≤ italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k (14b)
πkminπk,ksuperscriptsubscript𝜋𝑘𝑚𝑖𝑛subscript𝜋𝑘for-all𝑘\displaystyle\;\pi_{k}^{min}\leq\pi_{k},\forall kitalic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT ≤ italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k (14c)

Through this basic form, we can readily acquire the numerical solution for the original problem. In the next subsection, we will present the solution to address the current matter.

III-B Solution

It can be easily proved that problem P3 is a convex problem, which can be effectively solved by applying the Karush-Kuhn-Tucker (KKT) conditions [21]. With the optimal energy optimization solution, we can decide the final resource allocation scheme for the federated diffusion. The Lagrange function of P3 is as follows:

𝑳(Pk,fk,νk,ζkθ,ζkπ)=τIk3Dk3Ck3θk2(Tkmax)2+N0BTkmax|h|2dη(2Mklog2(Lk*)πkBTkmax1)+νk(θk+πk1)+ζkθ(θkminθk)+ζkπ(πkminπk)𝑳subscript𝑃𝑘subscript𝑓𝑘subscript𝜈𝑘superscriptsubscript𝜁𝑘𝜃superscriptsubscript𝜁𝑘𝜋𝜏subscriptsuperscript𝐼3𝑘subscriptsuperscript𝐷3𝑘subscriptsuperscript𝐶3𝑘superscriptsubscript𝜃𝑘2superscriptsuperscriptsubscript𝑇𝑘𝑚𝑎𝑥2subscript𝑁0𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥superscript2superscript𝑑𝜂superscript2subscript𝑀𝑘subscript2subscriptsuperscript𝐿𝑘subscript𝜋𝑘𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥1subscript𝜈𝑘subscript𝜃𝑘subscript𝜋𝑘1superscriptsubscript𝜁𝑘𝜃superscriptsubscript𝜃𝑘𝑚𝑖𝑛subscript𝜃𝑘superscriptsubscript𝜁𝑘𝜋superscriptsubscript𝜋𝑘𝑚𝑖𝑛subscript𝜋𝑘\displaystyle\begin{split}{\boldsymbol{L}}(P_{k},f_{k},{\nu}_{k},\zeta_{k}^{% \theta},\zeta_{k}^{\pi})=\frac{\tau I^{3}_{k}D^{3}_{k}C^{3}_{k}}{\theta_{k}^{2% }(T_{k}^{max})^{2}}\\ +\frac{N_{0}BT_{k}^{max}}{\lvert h\rvert^{2}d^{-\eta}}\big{(}2^{\frac{M_{k}% \log_{2}{(L^{*}_{k})}}{\pi_{k}BT_{k}^{max}}}-1\big{)}+{\nu}_{k}(\theta_{k}+\pi% _{k}-1)\\ +\zeta_{k}^{\theta}(\theta_{k}^{min}-\theta_{k})+\zeta_{k}^{\pi}(\pi_{k}^{min}% -\pi_{k})\end{split}start_ROW start_CELL bold_italic_L ( italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_θ end_POSTSUPERSCRIPT , italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_π end_POSTSUPERSCRIPT ) = divide start_ARG italic_τ italic_I start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_CELL end_ROW start_ROW start_CELL + divide start_ARG italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG start_ARG | italic_h | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT - italic_η end_POSTSUPERSCRIPT end_ARG ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_L start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT - 1 ) + italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - 1 ) end_CELL end_ROW start_ROW start_CELL + italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_θ end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT - italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) + italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_π end_POSTSUPERSCRIPT ( italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT - italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_CELL end_ROW (15)

Here, νksubscript𝜈𝑘{\nu}_{k}italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the equality Lagrange multiplier associated with equality constraint (14a), while ζkθsuperscriptsubscript𝜁𝑘𝜃{\zeta}_{k}^{\theta}italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_θ end_POSTSUPERSCRIPT and ζkπsuperscriptsubscript𝜁𝑘𝜋\zeta_{k}^{\pi}italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_π end_POSTSUPERSCRIPT denote the inequality Lagrange multiplier for constraints (14b) and (14c), respectively. In order to accomplish optimality for problem P3, we derive the necessary equations from the Lagrange function as follows:

{constraint (14a)-(14c)2τIk3Dk3Ck3θk3(Tkmax)2+νkζkθ=0N0BTkmax|h|2dη(2Mklog2(Lk*)πkBTkmax1ln(2)Mklog2(Lk*)πkBTkmax)+νkζkπ=0ζkθ(θkminθk)ζkπ(πkminπk)=0casesconstraint (14a)-(14c)2𝜏subscriptsuperscript𝐼3𝑘subscriptsuperscript𝐷3𝑘subscriptsuperscript𝐶3𝑘superscriptsubscript𝜃𝑘3superscriptsuperscriptsubscript𝑇𝑘𝑚𝑎𝑥2subscript𝜈𝑘superscriptsubscript𝜁𝑘𝜃0subscript𝑁0𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥superscript2superscript𝑑𝜂superscript2subscript𝑀𝑘subscript2subscriptsuperscript𝐿𝑘subscript𝜋𝑘𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥12subscript𝑀𝑘subscript2subscriptsuperscript𝐿𝑘subscript𝜋𝑘𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥subscript𝜈𝑘superscriptsubscript𝜁𝑘𝜋0superscriptsubscript𝜁𝑘𝜃superscriptsubscript𝜃𝑘𝑚𝑖𝑛subscript𝜃𝑘superscriptsubscript𝜁𝑘𝜋superscriptsubscript𝜋𝑘𝑚𝑖𝑛subscript𝜋𝑘0\left\{\begin{array}[]{c}{\text{constraint (14a)-(14c)}}\\ \frac{2\tau I^{3}_{k}D^{3}_{k}C^{3}_{k}}{\theta_{k}^{3}(T_{k}^{max})^{2}}+{\nu% }_{k}-\zeta_{k}^{\theta}=0\\ \frac{N_{0}BT_{k}^{max}}{\lvert h\rvert^{2}d^{-\eta}}\left(2^{\frac{M_{k}\log_% {2}{(L^{*}_{k})}}{\pi_{k}BT_{k}^{max}}}-1-\frac{\ln{(2)}M_{k}\log_{2}{(L^{*}_{% k})}}{\pi_{k}BT_{k}^{max}}\right)+{\nu}_{k}-\zeta_{k}^{\pi}=0\\ \zeta_{k}^{\theta}(\theta_{k}^{min}-\theta_{k})-\zeta_{k}^{\pi}(\pi_{k}^{min}-% \pi_{k})=0\\ \end{array}\right.{ start_ARRAY start_ROW start_CELL constraint (14a)-(14c) end_CELL end_ROW start_ROW start_CELL divide start_ARG 2 italic_τ italic_I start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_θ end_POSTSUPERSCRIPT = 0 end_CELL end_ROW start_ROW start_CELL divide start_ARG italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG start_ARG | italic_h | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT - italic_η end_POSTSUPERSCRIPT end_ARG ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_L start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT - 1 - divide start_ARG roman_ln ( 2 ) italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_L start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG ) + italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_π end_POSTSUPERSCRIPT = 0 end_CELL end_ROW start_ROW start_CELL italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_θ end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT - italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - italic_ζ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_π end_POSTSUPERSCRIPT ( italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT - italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = 0 end_CELL end_ROW end_ARRAY (16)

Based on Eqns 16, there exist two cases that satisfy constrain (14b) concerning variable θksubscript𝜃𝑘\theta_{k}italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. If θk>θkminsubscript𝜃𝑘superscriptsubscript𝜃𝑘𝑚𝑖𝑛\theta_{k}>\theta_{k}^{min}italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT > italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT, the optimal solution of θk*subscriptsuperscript𝜃𝑘\theta^{*}_{k}italic_θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is obtained by

θk*=2τIk3Dk3Ck3νk(Tkmax)23subscriptsuperscript𝜃𝑘32𝜏subscriptsuperscript𝐼3𝑘subscriptsuperscript𝐷3𝑘subscriptsuperscript𝐶3𝑘subscript𝜈𝑘superscriptsuperscriptsubscript𝑇𝑘𝑚𝑎𝑥2\displaystyle\begin{split}\theta^{*}_{k}=\sqrt[3]{\frac{2\tau I^{3}_{k}D^{3}_{% k}C^{3}_{k}}{\nu_{k}(T_{k}^{max})^{2}}}\end{split}start_ROW start_CELL italic_θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = nth-root start_ARG 3 end_ARG start_ARG divide start_ARG 2 italic_τ italic_I start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_ARG end_CELL end_ROW (17)

Otherwise, we always have θk*=θkminsubscriptsuperscript𝜃𝑘superscriptsubscript𝜃𝑘𝑚𝑖𝑛\theta^{*}_{k}=\theta_{k}^{min}italic_θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT. Similarly to θksubscript𝜃𝑘\theta_{k}italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, when πk>πkminsubscript𝜋𝑘superscriptsubscript𝜋𝑘𝑚𝑖𝑛\pi_{k}>\pi_{k}^{min}italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT > italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT, the optimal πk*subscriptsuperscript𝜋𝑘\pi^{*}_{k}italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is acquired the same way. Given the equality Lagrange multiplier νksubscript𝜈𝑘\nu_{k}italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, we have

Φ(πk0)=N0BTkmax|h|2dη(2Mklog2(Lk*)πk0BTkmax1ln(2)Mklog2(Lk*)πk0BTkmax)+νk=0Φsubscriptsuperscript𝜋0𝑘subscript𝑁0𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥superscript2superscript𝑑𝜂superscript2subscript𝑀𝑘subscript2subscriptsuperscript𝐿𝑘subscriptsuperscript𝜋0𝑘𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥12subscript𝑀𝑘subscript2subscriptsuperscript𝐿𝑘subscriptsuperscript𝜋0𝑘𝐵superscriptsubscript𝑇𝑘𝑚𝑎𝑥subscript𝜈𝑘0\displaystyle\begin{split}\Phi(\pi^{0}_{k})=\frac{N_{0}BT_{k}^{max}}{\lvert h% \rvert^{2}d^{-\eta}}(2^{\frac{M_{k}\log_{2}{(L^{*}_{k})}}{\pi^{0}_{k}BT_{k}^{% max}}}-1-\frac{\ln{(2)}M_{k}\log_{2}{(L^{*}_{k})}}{\pi^{0}_{k}BT_{k}^{max}})\\ +{\nu}_{k}=0\end{split}start_ROW start_CELL roman_Φ ( italic_π start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = divide start_ARG italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG start_ARG | italic_h | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT - italic_η end_POSTSUPERSCRIPT end_ARG ( 2 start_POSTSUPERSCRIPT divide start_ARG italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_L start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_π start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT - 1 - divide start_ARG roman_ln ( 2 ) italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_L start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_π start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_B italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT end_ARG ) end_CELL end_ROW start_ROW start_CELL + italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0 end_CELL end_ROW (18)

Where πk0subscriptsuperscript𝜋0𝑘\pi^{0}_{k}italic_π start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the zero point of function Φ(πk)Φsubscript𝜋𝑘\Phi(\pi_{k})roman_Φ ( italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ). In general, the optimal solution of θksubscript𝜃𝑘\theta_{k}italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and πksubscript𝜋𝑘\pi_{k}italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT can be acquired by

θk=max{2τIk3Dk3Ck3νk(Tkmax)23,θkmin},πk=max{πk0,πkmin}formulae-sequencesubscript𝜃𝑘32𝜏subscriptsuperscript𝐼3𝑘subscriptsuperscript𝐷3𝑘subscriptsuperscript𝐶3𝑘subscript𝜈𝑘superscriptsuperscriptsubscript𝑇𝑘𝑚𝑎𝑥2subscriptsuperscript𝜃𝑚𝑖𝑛𝑘subscript𝜋𝑘subscriptsuperscript𝜋0𝑘subscriptsuperscript𝜋𝑚𝑖𝑛𝑘\displaystyle\begin{split}\theta_{k}={\max}\{\sqrt[3]{\frac{2\tau I^{3}_{k}D^{% 3}_{k}C^{3}_{k}}{\nu_{k}(T_{k}^{max})^{2}}},\theta^{min}_{k}\},\pi_{k}={\max}% \{\pi^{0}_{k},\pi^{min}_{k}\}\end{split}start_ROW start_CELL italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_max { nth-root start_ARG 3 end_ARG start_ARG divide start_ARG 2 italic_τ italic_I start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_D start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_C start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_ARG , italic_θ start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } , italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_max { italic_π start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_π start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } end_CELL end_ROW (19)

It’s worth mentioning that seeking the problem’s optimal solution directly can be rather intricate, which is why we employed binary search to find the optimal strategy for the Lagrange multiplier νksubscript𝜈𝑘\nu_{k}italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Utilizing the most favorable Lagrange multiplier value, the optimal approach for variables θksubscript𝜃𝑘\theta_{k}italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and πksubscript𝜋𝑘\pi_{k}italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT are computed based on (19). To be specific, given the searching range of νksubscript𝜈𝑘\nu_{k}italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and error tolerance λ𝜆\lambdaitalic_λ, the optimal νksubscript𝜈𝑘\nu_{k}italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is obtained with the constraint (14a). Additional and more detailed information is provided in Algorithm 1. Finally, the overall algorithm of the proposed method is shown as algorithm 2.

Input: νkminsubscriptsuperscript𝜈𝑚𝑖𝑛𝑘\nu^{min}_{k}italic_ν start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT,νkmaxsubscriptsuperscript𝜈𝑚𝑎𝑥𝑘\nu^{max}_{k}italic_ν start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT,πkminsubscriptsuperscript𝜋𝑚𝑖𝑛𝑘\pi^{min}_{k}italic_π start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT,πkmaxsubscriptsuperscript𝜋𝑚𝑎𝑥𝑘\pi^{max}_{k}italic_π start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT,and λ𝜆\lambdaitalic_λ.
Output: The optimal Lagrange multiplier νk*subscriptsuperscript𝜈𝑘\nu^{*}_{k}italic_ν start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.
1 while |νkmaxνkmin|λsubscriptsuperscript𝜈𝑚𝑎𝑥𝑘subscriptsuperscript𝜈𝑚𝑖𝑛𝑘𝜆|\nu^{max}_{k}-\nu^{min}_{k}|\leq\lambda| italic_ν start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_ν start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ≤ italic_λ do
2       νk=(νkmax+νkmin)/2subscript𝜈𝑘subscriptsuperscript𝜈𝑚𝑎𝑥𝑘subscriptsuperscript𝜈𝑚𝑖𝑛𝑘2\nu_{k}=(\nu^{max}_{k}+\nu^{min}_{k})/2italic_ν start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ( italic_ν start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_ν start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) / 2;
3       Calculate θk*subscriptsuperscript𝜃𝑘\theta^{*}_{k}italic_θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT;
4       Search for πk*subscriptsuperscript𝜋𝑘\pi^{*}_{k}italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT;
5       if θk*+πk*1subscriptsuperscript𝜃𝑘subscriptsuperscript𝜋𝑘1\theta^{*}_{k}+\pi^{*}_{k}\leq 1italic_θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ 1 then νkmax=νk*subscriptsuperscript𝜈𝑚𝑎𝑥𝑘subscriptsuperscript𝜈𝑘\nu^{max}_{k}=\nu^{*}_{k}italic_ν start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_ν start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT else νkmin=νk*subscriptsuperscript𝜈𝑚𝑖𝑛𝑘subscriptsuperscript𝜈𝑘\nu^{min}_{k}=\nu^{*}_{k}italic_ν start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_ν start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ;
6      
7 end while
8return νk*subscriptsuperscript𝜈𝑘\nu^{*}_{k}italic_ν start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT 
9 while |πkmaxπkmin|λsubscriptsuperscript𝜋𝑚𝑎𝑥𝑘subscriptsuperscript𝜋𝑚𝑖𝑛𝑘𝜆|\pi^{max}_{k}-\pi^{min}_{k}|\leq\lambda| italic_π start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_π start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ≤ italic_λ do
10       πk=(πkmax+πkmin)/2subscript𝜋𝑘subscriptsuperscript𝜋𝑚𝑎𝑥𝑘subscriptsuperscript𝜋𝑚𝑖𝑛𝑘2\pi_{k}=(\pi^{max}_{k}+\pi^{min}_{k})/2italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ( italic_π start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_π start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) / 2;
11       Calculate Φ(πk)Φsubscript𝜋𝑘\Phi(\pi_{k})roman_Φ ( italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT );
12       if Φ>0Φ0\Phi>0roman_Φ > 0 then πkmax=πk*subscriptsuperscript𝜋𝑚𝑎𝑥𝑘subscriptsuperscript𝜋𝑘\pi^{max}_{k}=\pi^{*}_{k}italic_π start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT  else πkmin=πk*subscriptsuperscript𝜋𝑚𝑖𝑛𝑘subscriptsuperscript𝜋𝑘\pi^{min}_{k}=\pi^{*}_{k}italic_π start_POSTSUPERSCRIPT italic_m italic_i italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ;
13      
14 end while
15return πk*subscriptsuperscript𝜋𝑘\pi^{*}_{k}italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT 
Algorithm 1 Binary Search
Input: pre-trained model 𝒘0superscript𝒘0{\boldsymbol{w}}^{0}bold_italic_w start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT; variance schedule {β}𝛽\{\beta\}{ italic_β }; iteration I𝐼Iitalic_I; sample step T𝑇Titalic_T; error bound δksubscript𝛿𝑘\delta_{k}italic_δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.
Output: global model 𝒘Isuperscript𝒘𝐼{\boldsymbol{w}}^{I}bold_italic_w start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT.
1 for i=0𝑖0i=0italic_i = 0 to I𝐼Iitalic_I do
2       K𝐾absentK~{}\leftarrowitalic_K ← Select K𝐾Kitalic_K devices from edge devices pool;
3       Calculate the optimal resource allocation strategy  based on Algorithm 1;
4       for k𝑘kitalic_k in K𝐾Kitalic_K parallel do
5             Initialize local model 𝒘kisubscriptsuperscript𝒘𝑖𝑘{\boldsymbol{w}}^{i}_{k}bold_italic_w start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT by 𝒘isuperscript𝒘𝑖{\boldsymbol{w}}^{i}bold_italic_w start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT;
6             A mini-batch original images x0subscript𝑥0x_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in local dataset Dksubscript𝐷𝑘D_{k}italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT;
7             tUniform({1,,T})similar-to𝑡𝑈𝑛𝑖𝑓𝑜𝑟𝑚1𝑇t\sim Uniform(\{1,...,T\})italic_t ∼ italic_U italic_n italic_i italic_f italic_o italic_r italic_m ( { 1 , … , italic_T } );
8             ϵ𝒩(0,I)similar-toitalic-ϵ𝒩0𝐼\epsilon\sim\mathcal{N}(0,I)italic_ϵ ∼ caligraphic_N ( 0 , italic_I );
9             Diffuse x0subscript𝑥0x_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT to xtϵsubscript𝑥𝑡italic-ϵx_{t}\approx\epsilonitalic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≈ italic_ϵ by: xt=α¯tx0+1α¯tϵsubscript𝑥𝑡subscript¯𝛼𝑡subscript𝑥01subscript¯𝛼𝑡italic-ϵx_{t}=\sqrt{\overline{\alpha}_{t}}x_{0}+\sqrt{1-\overline{\alpha}_{t}}\epsilonitalic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = square-root start_ARG over¯ start_ARG italic_α end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + square-root start_ARG 1 - over¯ start_ARG italic_α end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG italic_ϵ;
10             Take the gradient decent step by minimizing: ϵF𝒘ki(xt,t)2superscriptnormitalic-ϵsubscript𝐹subscriptsuperscript𝒘𝑖𝑘subscript𝑥𝑡𝑡2||\epsilon-F_{{\boldsymbol{w}}^{i}_{k}}(x_{t},t)||^{2}| | italic_ϵ - italic_F start_POSTSUBSCRIPT bold_italic_w start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT;
11             Quantized 𝒘kisubscriptsuperscript𝒘𝑖𝑘{\boldsymbol{w}}^{i}_{k}bold_italic_w start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT based on stochastic quantization;
12             Upload the quantized model 𝒘^ki+1subscriptsuperscript^𝒘𝑖1𝑘\hat{\boldsymbol{w}}^{i+1}_{k}over^ start_ARG bold_italic_w end_ARG start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT to the server.
13       end for
14      
15 end for
Algorithm 2 Quantized Federated Diffusion

IV Numerical Results

IV-A Simulation Settings

To simulate the practical case of federated diffusion in mobile edge networks, we fine-tuned the pre-trained DDPM [17] on CIFAR10 [22] using the GTSRB [23] dataset. The dataset is divided into 10 subsets for 10 edge devices to perform federated learning. We fine-tuned the federated model with 1000 epochs for performance evaluations. The sample steps are set as 1000 for image generation. For computation and communication hyper-parameters {Ik,Dk,C,fkmax,τksubscript𝐼𝑘subscript𝐷𝑘𝐶superscriptsubscript𝑓𝑘𝑚𝑎𝑥subscript𝜏𝑘{I_{k}},{D_{k}},{C},{f_{k}^{max}},{\tau_{k}}italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_C , italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_a italic_x end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT} and {B,|h|2,d,η,N0,Mk𝐵superscript2𝑑𝜂subscript𝑁0subscript𝑀𝑘B,{|h|^{2}},d,{\eta},{N_{0}},{M_{k}}italic_B , | italic_h | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_d , italic_η , italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_M start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT}, the default settings are {1,512,3.2515123.251,512,3.251 , 512 , 3.25MCycles,109,1026,10^{9},10^{-26}, 10 start_POSTSUPERSCRIPT 9 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT - 26 end_POSTSUPERSCRIPT} and {50505050MHz,0.001,0.001, 0.001W,45,45, 45m,3.76,95,3.76,-95, 3.76 , - 95dbm/MHz,37,37, 37M}.

IV-B Performance Evaluations

Fig. 2 illustrates the performance and energy consumption of the proposed algorithm. We employed the Fréchet Inception Distance (FID) [24] as an evaluative metric for assessing the quality of the images generated by the model. A lower FID value indicates a higher degree of similarity between the distribution of the generated dataset and that of the original dataset. To enhance the precision of our evaluation concerning the quality of the generated dataset, we ensured that the number of generated datasets equaled the number of original images. Remarkably, our proposed methodology takes into consideration distinct quantization error constraints customized for heterogeneous edge devices. Subsequently, an energy minimization problem is optimized. The range of quantization levels spans from 6 bits to 8 bits, a range normally associated with a substantial reduction in energy consumption while concurrently upholding commendable performance. We conduct a comparative analysis between the baseline method, Fedavg, in addition to the fixed quantization methods employing 6-bits, 7-bits, and 8-bits quantization levels. It is evident from our results that our approach surpasses the more economical 8-bits quantization scheme in terms of both performance and cost-efficiency. It is essential to note that the compared methods did not specifically optimize for energy consumption, utilizing only 50%percent5050\%50 % time budget for computation and another 50%percent5050\%50 % for communication by default.

Refer to caption
Figure 2: FID performance and energy consumption of different schemes

Fig. 3 illustrates the successful convergence achieved by our proposed binary search algorithm in addressing the energy optimization problem we have established. Specifically, provided various quantization level requirements, the optimal solution for minimizing the energy cost is determined after about 20 searching iterations. Furthermore, a study was conducted to examine the impacts of two different hyperparameters, the time budget and the distance of communication. It can be clearly seen that the proposed method can converge well under different settings. As the allotted time budget reduces, the requisite energy consumption by the system escalates. In parallel, with an augmentation in the distance of communication between edge devices and central servers, there is a concurrent amplification in energy expenditure. Fig. 4 presents a comparative analysis of our method alongside other approaches across various time budget ranges. As observed, with an increase in the allotted time budget ranging from 13s to 18s, the system’s energy costs diminish, and our solution consistently outperforms the baseline approach. This indicates that our method can adapt to parameter settings within certain ranges.

Refer to caption
Figure 3: Convergence of proposed binary search algorithm
Refer to caption
Figure 4: Energy cost vs. Time budget

V Conclusion and Future Work

In this paper, we first design a dynamic quantized federated diffusion training considering each edge device’s demand. Subsequently, our study turns towards addressing the challenge of energy efficiency, taking into account the unique constraint imposed by quantization demand. Our simulation results demonstrate that our proposed method outperforms both the baseline federated diffusion approach and fixed quantized federated diffusion in substantially reducing system energy consumption and transmitted model size. Remarkably, this reduction is achieved without compromising the reasonable quality and diversity of the generated data, underscoring the effectiveness of our approach.

To achieve the benefits of efficient federated generative diffusion, there still exist several open and challenging issues. For distributed diffusion models, the matter of proficient sampling remains an unsolved problem, primarily due to the distinctive characteristics inherent to diffusion itself. Diverging from conventional AI models, the interference phase of diffusion entails a substantial energy outlay, particularly in the denoising sampling steps. This heightened energy consumption may be deemed unmanageable for certain edge devices. Consequently, further study is imperative to enhance the efficiency of sampling within the context of diffusion, particularly in the domain of distributed edge intelligence scenarios.

References

  • [1] B. Mao, Y. Liu, J. Liu, and N. Kato, “AI-assisted edge caching for metaverse of connected and automated vehicles: Proposal, challenges, and future perspectives,” IEEE Vehicular Technology Magazine, vol. 18, no. 4, pp. 66–74, 2023.
  • [2] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, “Generative adversarial networks,” Communications of the ACM, vol. 63, no. 11, pp. 139–144, 2020.
  • [3] S. Choi, J. Kim, and H. Yeo, “Trajgail: Generating urban vehicle trajectories using generative adversarial imitation learning,” Transportation Research Part C: Emerging Technologies, vol. 128, p. 103091, 2021.
  • [4] G. Cooper, “Examining science education in chatgpt: An exploratory study of generative artificial intelligence,” Journal of Science Education and Technology, vol. 32, no. 3, pp. 444–452, 2023.
  • [5] J. Kang, J. Wen, D. Ye, B. Lai, T. Wu, Z. Xiong, J. Nie, D. Niyato, Y. Zhang, and S. Xie, “Blockchain-empowered federated learning for healthcare metaverses: User-centric incentive mechanism with optimal data freshness,” IEEE Transactions on Cognitive Communications and Networking, 2023.
  • [6] B. Mao, X. Zhou, J. Liu, and N. Kato, “Digital twin satellite networks towards 6G: Motivations, challenges, and future perspectives,” IEEE Network, pp. 1–1, 2023.
  • [7] J. Kang, J. He, H. Du, Z. Xiong, Z. Yang, X. Huang, and S. Xie, “Adversarial attacks and defenses for semantic communication in vehicular metaverses,” IEEE Wireless Communications, vol. 30, no. 4, pp. 48–55, 2023.
  • [8] X. Luo, J. Wen, J. Kang, J. Nie, Z. Xiong, Y. Zhang, Z. Yang, and S. Xie, “Privacy attacks and defenses for digital twin migrations in vehicular metaverses,” IEEE Network, 2023.
  • [9] F. V. S. Jothiraj and A. Mashhadi, “Phoenix: A federated generative diffusion model,” arXiv preprint arXiv:2306.04098, 2023.
  • [10] M. de Goede, “Training diffusion models with federated learning: A communication-efficient model for cross-silo federated image generation,” 2023.
  • [11] B. Mao, J. Liu, Y. Wu, and N. Kato, “Security and privacy on 6g network edge: A survey,” IEEE Communications Surveys & Tutorials, vol. 25, no. 2, pp. 1095–1127, 2023.
  • [12] X. Huang, P. Li, H. Du, J. Kang, D. Niyato, D. I. Kim, and Y. Wu, “Federated learning-empowered AI-generated content in wireless networks,” arXiv preprint arXiv:2307.07146, 2023.
  • [13] J. Wen, J. Kang, M. Xu, H. Du, Z. Xiong, Y. Zhang, and D. Niyato, “Freshness-aware incentive mechanism for mobile AI-generated content (aigc) networks,” in 2023 IEEE/CIC International Conference on Communications in China (ICCC), pp. 1–6, 2023.
  • [14] Z. Yang, M. Chen, W. Saad, C. S. Hong, and M. Shikh-Bahaei, “Energy efficient federated learning over wireless communication networks,” IEEE Transactions on Wireless Communications, vol. 20, no. 3, pp. 1935–1949, 2020.
  • [15] P. Li, G. Cheng, X. Huang, J. Kang, R. Yu, Y. Wu, M. Pan, and D. Niyato, “Snowball: Energy efficient and accurate federated learning with coarse-to-fine compression over heterogeneous wireless edge devices,” IEEE Transactions on Wireless Communications, 2023.
  • [16] X. Meng and Y. Kabashima, “Quantized compressed sensing with score-based generative models,” in The Eleventh International Conference on Learning Representations, 2022.
  • [17] J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Advances in neural information processing systems, vol. 33, pp. 6840–6851, 2020.
  • [18] R. Chen, L. Li, K. Xue, C. Zhang, M. Pan, and Y. Fang, “Energy efficient federated learning over heterogeneous mobile devices via joint design of weight quantization and wireless transmission,” IEEE Transactions on Mobile Computing, 2022.
  • [19] B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y Arcas, “Communication-efficient learning of deep networks from decentralized data,” in Artificial intelligence and statistics, pp. 1273–1282, PMLR, 2017.
  • [20] H. G. Myung, J. Lim, and D. J. Goodman, “Single carrier fdma for uplink wireless transmission,” IEEE vehicular technology magazine, vol. 1, no. 3, pp. 30–38, 2006.
  • [21] S. P. Boyd and L. Vandenberghe, Convex optimization. Cambridge university press, 2004.
  • [22] A. Krizhevsky, G. Hinton, et al., “Learning multiple layers of features from tiny images,” 2009.
  • [23] S. Houben, J. Stallkamp, J. Salmen, M. Schlipsing, and C. Igel, “Detection of traffic signs in real-world images: The German Traffic Sign Detection Benchmark,” in International Joint Conference on Neural Networks, no. 1288, 2013.
  • [24] R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer, “High-resolution image synthesis with latent diffusion models,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10684–10695, 2022.