\cormark

[1] \creditConceptualization, Modeling, Methodology, Software, Writing

\credit

Methodology, Proof

\credit

Modeling, Revision

\credit

Supervision, Revision, Funding Support

\cortext

[1]Corresponding author

Long-Term Energy Management for Microgrid with Hybrid Hydrogen-Battery Energy Storage: A Prediction-Free Coordinated Optimization Framework

Ning Qi [email protected]    Kaidi Huang    Zhiyuan Fan    Bolun Xu Department of Earth and Environmental Engineering, Columbia University, New York, NY 10027, USA Department of Electrical Engineering, Tsinghua University, Beijing 100084, China
Abstract

This paper studies the long-term energy management of a microgrid coordinating hybrid hydrogen-battery energy storage. We develop an approximate semi-empirical hydrogen storage model to accurately capture the power-dependent efficiency of hydrogen storage. We introduce a prediction-free two-stage coordinated optimization framework, which generates the annual state-of-charge (SoC) reference for hydrogen storage offline. During online operation, it updates the SoC reference online using kernel regression and makes operation decisions based on the proposed adaptive virtual-queue-based online convex optimization (OCO) algorithm. We innovatively incorporate penalty terms for long-term pattern tracking and expert-tracking for step size updates. We provide theoretical proof to show that the proposed OCO algorithm achieves a sublinear bound of dynamic regret without using prediction information. Numerical studies based on the Elia and North China datasets show that the proposed framework significantly outperforms the existing online optimization approaches by reducing the operational costs and loss of load by around 30% and 80%, respectively. These benefits can be further enhanced with optimized settings for the penalty coefficient and step size of OCO, as well as more historical references.

keywords:
Long-Term Energy Management \sepHydrogen \sepHybrid Energy Storage \sepOnline Convex Optimization \sepMicrogrid
{highlights}

Long-term energy management of microgrid considering seasonal uncertainties and seasonal storage

A prediction-free two-stage coordinated optimization framework

SoC reference of hydrogen storage generated from kernel regression and historical and AI-generated scenarios

A virtual-queue-based online convex optimization algorithm with expert-tracking

Numerical studies on Elia and North China with ground-truth datasets spanning 10 years

1 Introduction

1.1 Background and motivation

A microgrid is a self-contained electrical network with resources including energy storage (ES), renewable energy sources (RES), and controllable loads, operated in either grid-connected or island mode [5, 30]. Microgrids enhance energy resilience, promote decarbonization, and reduce transmission system investments, but the volatility of RES poses challenges to short-term supply-demand balances [23, 21]. Besides, seasonal variations in RES availability [13] and extreme weather events [42] have highlighted the significance of the long-term energy management of microgrids.

Hybrid energy storage system (HESS) [14, 24] offers a promising way to guarantee both the short-term and long-term supply-demand balance of microgrids. HESS is composed of two or more ES units with different but complementing characteristics, such as duration and efficiency. In day-ahead or intra-day operations, batteries can effectively address the uncertainties introduced by RES and demand. For long-term operation, hydrogen storage consisting of electrolyzer and fuel cell can provide efficient solutions to seasonal energy shifting [16]. In this paper, we focus on a typical application: hybrid hydrogen-battery energy storage (H-BES). Given the differences in storage properties and unanticipated seasonal uncertainties, designing an effective long-term energy management framework for microgrids with H-BES is significant but challenging.

1.2 Literature review

Previous research mainly focuses on the short-term energy management of microgrids with H-BES. Two-stage robust optimization is proposed in [9] for the market operation of H-BES, where the uncertainties from RES are modeled by uncertainty sets. A two-stage distributionally robust optimization-based coordinated scheduling of an integrated energy system with H-BES is introduced in [26], where an ambiguity set is employed to model the uncertainties from RES and integrated energy loads. Two-stage stochastic energy management of H-BES is proposed in [7], where the uncertainties from RES, load, and prices are modeled by typical scenarios. However, these works rely solely on offline optimization methods with predefined uncertainty modeling, which may face optimality or feasibility issues in real-time operation. This motivates the research on real-time energy management with online optimization methods, such as the rolling-horizon method. Model predictive control (MPC) is the widely used rolling-horizon method and multi-level MPC controllers are developed for microgrids with hydrogen or H-BES in [31, 13]. An actor-critic deep reinforcement learning method is proposed in [15] to address multi-timescale coordinated dispatch of microgrid with hybrid battery and supercapacitor. MPC and approximate dynamic programming approach are jointly utilized for multi-stage coordinated dispatch [19], which achieves robust real-time performance through continuously updated forecasts. However, the limitations in the aforementioned works mainly lie in (i) The short-term energy management methods may face infeasibility issues in the long-term operation when considering seasonal variations RES and load. (ii) The performance of these techniques strongly depends on the accuracy of the prediction of uncertainties. However, the predictions are practically

unavailable or unreliable for microgrid operators.

To address the first limitation, recent studies have started to explore the long-term energy management of microgrids, which aims to solve the multi-time-period dispatch with non-anticipativity. Stochastic dynamic programming is technically sound, which can decompose the multi-period dispatch problem into sequential single-period dispatch problems through value function. And it is applied in [4] by learning the value function of H-BES. However, it becomes computationally intractable to train the value function if the storage duration spans multiple months. A continuous spectrum splitting approach is proposed in [10] to assign low-frequency uncertainty scenarios to hydrogen and high-frequency uncertainty scenarios to batteries for power balance, but this approach is designed for the planning of H-BES. A data-driven coordinated dispatch framework is proposed in [13], where the state of charge (SoC) reference for hydrogen storage is generated based on historical simulations. This reference is then updated and embedded into MPC for real-time operation. However, the use of MPC makes the entire framework dependent on forecasting.

Additionally, prediction-free online optimization methods are gaining increased attention. Lyapunov optimization and online convex optimization (OCO) are effective representatives [33]. Lyapunov optimization adopts a “1-lookahead” pattern, where uncertainties are observed first, followed by solving the Lyapunov drift problem [28]. It has wide applications in demand response [40], electric vehicle charging [34], microgrid [1], etc. The long-term operational cost minimization of hydrogen-based building energy systems is transformed into several single-slot subproblems using Lyapunov optimization [37]. A joint energy scheduling and trading algorithm based on Lyapunov optimization and a double-auction mechanism is designed in [43] to optimize the long-term energy cost of each microgrid. However, in some cases, the uncertainties can not be observed before decision-making and Lyapunov optimization becomes inapplicable. For instance, storage participants bid with unknown future prices, and the prices are cleared by the market after the bidding process [41]. Instead, OCO adopts a “0-lookahead” pattern, where the decision is made before the observation of uncertainties. And OCO has been utilized in demand side management [17] and ancillary services [39] due to its completely prediction-free and fast response nature. However, to the best of our knowledge, no research has addressed the long-term energy management of microgrids with H-BES within the OCO framework. The application of OCO in the focused topic may face the following challenges: (i) OCO is problem-dependent without a predefined mathematical formulation, and there is no prior experience available as a reference for designing OCO for microgrids with H-BES. (ii) Although recent works [22, 20, 36, 6] have embedded inter-temporal constraints into the OCO framework, OCO still risks falling into local optima due to its myopic nature. (iii) OCO aims to achieve regret (Reg) that grows sublinearly with time horizon T𝑇Titalic_T. However, most of the existing OCO algorithms fail to address the sublinear bounds for dynamic Reg [20, 36, 6] or require prediction information to improve the performance [22]. Please see Table 1 for a comprehensive comparison.

1.3 Research gap

Existing literature is summarized in Table 1. Although some works achieve good results in the long-term energy management of microgrids with H-BES, there are still several research gaps that have not been adequately addressed.

(1) Most existing studies employ a simplified operational model for hydrogen storage, using a constant energy conversion efficiency regardless of whether the storage operates at full capacity. However, the efficiency of hydrogen storage varies with the charge/discharge power and follows a nonlinear function [32]. Using a simplified model can result in sub-optimal or even infeasible solutions [3]. Therefore, it is crucial to incorporate this nonlinearity into the microgrid energy management with H-BES.

(2) Current microgrid energy management approaches either employ offline optimization methods (e.g., robust optimization [9], frequency-domain method [10]) or prediction-dependent online optimization methods (e.g., MPC [13], stochastic dynamic programming [4]). However, the distribution and prediction information is often inaccurate or unavailable in practical microgrid operations. Thus, designing a prediction-free optimization framework for microgrid energy management with H-BES is necessary.

(3) OCO is a promising “0-lookahead” online optimization method originating from the fields of machine learning and control [36]-[6]. However, OCO lacks a global view of long-term operations and adaptability to the high volatility of microgrids. Hence, it is important to extend traditional OCO methods to incorporate long-term operational patterns and time-varying properties.

Table 1: Comparison of existing literature on long-term and short-term energy management of H-BES.
Reference Storage Type & Model Long-term Optimization Short-term Optimization Prediction-Free
[9] H-BES-Constant X Robust Optimization \checkmark(Offline)
[26] H-BES-Constant X Distributionally Robust Optimization \checkmark(Offline)
[7] H-BES-Constant X Stochastic Optimization \checkmark(Offline)
[31] H-BES-Electrochemical X MPC X
[15] Battery+Supercapacitor-Constant X Deep Reinforcement Learning X
[19] Battery+Thermal Storage-Constant X MPC+Dynamic Programming X
[4] H-BES-Constant Stochastic Dynamic Programming X
[10] H-BES-Constant Spectrum Splitting Approach \checkmark(Offline)
[13] H-BES-Constant Historical Reference MPC X
[37] H-BES-Constant Lyapunov Optimization \checkmark(1-lookahead)
[43] Hydrogen Full Cell-Constant Lyapunov Optimization \checkmark(1-lookahead)
[22] Not Given X OCO: Dynamic Reg 𝒪(Tmax{1ac,c})𝒪superscript𝑇1𝑎𝑐𝑐\mathcal{O}(T^{\max\{1-a-c,c\}})caligraphic_O ( italic_T start_POSTSUPERSCRIPT roman_max { 1 - italic_a - italic_c , italic_c } end_POSTSUPERSCRIPT ), 0<a,c<1 \checkmark(0-lookahead)
[20] Not Given X OCO: Dynamic Reg 𝒪(TcPxc)𝒪superscript𝑇𝑐superscriptsubscript𝑃𝑥𝑐\mathcal{O}(T^{c}P_{x}^{c})caligraphic_O ( italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ), 0<c<1 \checkmark(0-lookahead)
[36] Not Given X OCO: Static Reg 𝒪(Tmax{1c,c})𝒪superscript𝑇1𝑐𝑐\mathcal{O}(T^{\max\{1-c,c\}})caligraphic_O ( italic_T start_POSTSUPERSCRIPT roman_max { 1 - italic_c , italic_c } end_POSTSUPERSCRIPT ), 0<c<1 \checkmark(0-lookahead)
[6] Not Given X OCO: Dynamic Reg 𝒪(max(TcPx,T1c))𝒪superscript𝑇𝑐subscript𝑃𝑥superscript𝑇1𝑐\mathcal{O}(\max(T^{c}P_{x},T^{1-c}))caligraphic_O ( roman_max ( italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_T start_POSTSUPERSCRIPT 1 - italic_c end_POSTSUPERSCRIPT ) ), 0<c<1 \checkmark(0-lookahead)
This Paper H-BES-Semi-Empirical Historical&AI-Generated Reference OCO: Dynamic Reg 𝒪(Tc(1+Px)1κ+T1c(1+Px)κ)𝒪superscript𝑇𝑐superscript1subscript𝑃𝑥1𝜅superscript𝑇1𝑐superscript1subscript𝑃𝑥𝜅\mathcal{O}(T^{c}(1+P_{x})^{1-\kappa}+T^{1-c}(1+P_{x})^{\kappa})caligraphic_O ( italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ( 1 + italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 - italic_κ end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT 1 - italic_c end_POSTSUPERSCRIPT ( 1 + italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT ), 0<κ<c<10𝜅𝑐10<\kappa<c<10 < italic_κ < italic_c < 1 \checkmark(0-lookahead)
  • a

    Depending on the baselines used, Reg is divided into static Reg, with the baseline being a single-period optimal solution, and dynamic Reg, with the baseline being the global optimal solution.

  • b

    Pxsubscript𝑃𝑥P_{x}italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT: path-length, i.e., the accumulated variation of optimal decisions; Pgsubscript𝑃𝑔P_{g}italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT: function variation, i.e., the accumulated variation of constraints.

1.4 Contributions

Motivated by the research gaps, this paper proposes a prediction-free coordinated optimization framework for long-term energy management of microgrid with H-BES while incorporating the nonlinearity of hydrogen storage and seasonal uncertainties from RES and load. Specifically, our contributions are threefold:

(1) Modeling: We propose an approximate semi-empirical hydrogen storage model using piecewise linear relaxation, which accurately captures the power-dependent efficiency of hydrogen storage. Simulations demonstrate that, compared to the constant efficiency model, the proposed approximation model avoids both overly optimistic and overly conservative strategies. This results in a reduction of the practical yearly operational cost by 10% or 36%, and a decrease in yearly loss of load by 1.94 MWh or 3.85 MWh.

(2) Solution Methodology: We introduce a prediction-free two-stage coordinated optimization framework. In the offline stage, the ex-post SoC references for hydrogen storage are generated by deterministic mixed-integer linear programming with historical and AI-generated data on RES and load. These references help to avoid myopic online decision-making and are incrementally updated by kernel regression with newly observed data. Subsequently, we develop an adaptive virtual-queue-based OCO algorithm for prediction-free online decision-making. Compared to the traditional OCO algorithm [22, 20, 36, 6], the proposed method innovatively incorporates a penalty term for long-term pattern tracking and expert-tracking for step size updates. The proposed OCO algorithm is proven to achieve a sublinear bound for dynamic regret.

(3) Numerical Study: We demonstrate the effectiveness of the proposed framework using ground-truth data from Elia [8] and North China [12]. Simulations show that introducing the reference significantly reduces operational costs and loss of load by 40%-57% and 60%-90%, respectively. Furthermore, compared to the prediction-dependent MPC method, the prediction-free OCO method further decreases operational costs and loss of load by 24%-29% and 73%-89%, respectively. These benefits can be further enhanced with optimized settings for the penalty coefficient and step size of OCO, as well as more historical references.

1.5 Paper Organization

We organize the remainder of the paper as follows. Section 2 presents an approximate semi-empirical modeling of hydrogen storage. Section 3 provides the problem formulation for long-term energy management of the microgrid with H-BES. Section 4 introduces the prediction-free two-stage coordinated optimization framework and the proof of OCO performance. Section 5 describes numerical case studies to verify the effectiveness of the proposed framework. Finally, we conclude this paper in Section 6.

2 Approximate semi-empirical hydrogen energy
storage model

2.1 Structure of hydrogen storage system

A hydrogen storage system is composed of several key components, such as electrolyzers, hydrogen storage tanks, fuel cells, compressors, and other auxiliary equipment, as illustrated in Fig. 1. Electrolyzers convert electrical energy into chemical energy by producing hydrogen and oxygen. This paper considers the most mature and commonly used alkaline water electrolyzer. Hydrogen storage tanks are used to store the produced hydrogen. Fuel cells convert the stored hydrogen back into electricity, and we consider the typical type, proton exchange membrane fuel cell (PEMFC). Other auxiliary equipment, including the compressor, cooling system, and control system, is excluded from the modeling.

Refer to caption
Figure 1: Schematic diagram of hydrogen storage system.

2.2 Alkaline water electrolyzer model

(1) Polarization curve

The polarization curve describes the electrochemical behavior of an electrolyzer, modeling the relationship between current and voltage. To account for the impact of temperature and pressure on the thermodynamics and electrochemical process within the electrolyzer, we combine the most used model proposed by Ulleberg [32] and the modified model proposed by Sanchez [27]:

UcellE=Urev+[(r1+d1)+r2θ+d2P]iA+slog[(t1+t2θ+t3θ2)iA+1]superscriptsubscript𝑈cellEsubscript𝑈revdelimited-[]subscript𝑟1subscript𝑑1subscript𝑟2𝜃subscript𝑑2𝑃𝑖𝐴𝑠subscript𝑡1subscript𝑡2𝜃subscript𝑡3superscript𝜃2𝑖𝐴1\begin{split}U_{\text{cell}}^{\text{E}}&=U_{\mathrm{rev}}+\left[\left(r_{1}+d_% {1}\right)+r_{2}\cdot\theta+d_{2}\cdot P\right]\cdot\dfrac{i}{A}\\ &+s\cdot\log\left[\left(t_{1}+\frac{t_{2}}{\theta}+\frac{t_{3}}{\theta^{2}}% \right)\cdot\dfrac{i}{A}+1\right]\end{split}start_ROW start_CELL italic_U start_POSTSUBSCRIPT cell end_POSTSUBSCRIPT start_POSTSUPERSCRIPT E end_POSTSUPERSCRIPT end_CELL start_CELL = italic_U start_POSTSUBSCRIPT roman_rev end_POSTSUBSCRIPT + [ ( italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⋅ italic_θ + italic_d start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⋅ italic_P ] ⋅ divide start_ARG italic_i end_ARG start_ARG italic_A end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_s ⋅ roman_log [ ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + divide start_ARG italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_θ end_ARG + divide start_ARG italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG italic_θ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) ⋅ divide start_ARG italic_i end_ARG start_ARG italic_A end_ARG + 1 ] end_CELL end_ROW (1)

where the reversible voltage and cell voltage of the electrolyzer are defined as Urevsubscript𝑈revU_{\mathrm{rev}}italic_U start_POSTSUBSCRIPT roman_rev end_POSTSUBSCRIPT and UcellEsuperscriptsubscript𝑈cellEU_{\text{cell}}^{\text{E}}italic_U start_POSTSUBSCRIPT cell end_POSTSUBSCRIPT start_POSTSUPERSCRIPT E end_POSTSUPERSCRIPT. Temperature and pressure are given by θ𝜃\thetaitalic_θ and P𝑃Pitalic_P. The current and effective area of the electrode is defined as i𝑖iitalic_i and A𝐴Aitalic_A. Parameters r1subscript𝑟1r_{1}italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, r2subscript𝑟2r_{2}italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, d1subscript𝑑1d_{1}italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, d2subscript𝑑2d_{2}italic_d start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, t1subscript𝑡1t_{1}italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, t2subscript𝑡2t_{2}italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, t3subscript𝑡3t_{3}italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, s𝑠sitalic_s are the constants which can be learned from the experimental data.

(2) Faraday efficiency

Faraday efficiency is defined as the ratio of measured hydrogen production to the theoretical value. For an alkaline electrolyzer, the Faraday efficiency typically ranges from 85% to 95% and is affected by temperature. We adopt the four-parameter Faraday efficiency model as (2).

ηF=((i/A)2f1+f2θ+(i/A)2)(f3+f4θ)subscript𝜂Fsuperscript𝑖𝐴2subscript𝑓1subscript𝑓2𝜃superscript𝑖𝐴2subscript𝑓3subscript𝑓4𝜃\eta_{\text{F}}=\left(\frac{(i/A)^{2}}{f_{1}+f_{2}\cdot\theta+(i/A)^{2}}\right% )\cdot\left(f_{3}+f_{4}\cdot\theta\right)italic_η start_POSTSUBSCRIPT F end_POSTSUBSCRIPT = ( divide start_ARG ( italic_i / italic_A ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⋅ italic_θ + ( italic_i / italic_A ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) ⋅ ( italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT + italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ⋅ italic_θ ) (2)

where Faraday efficiency is defined as ηFsubscript𝜂F\eta_{\text{F}}italic_η start_POSTSUBSCRIPT F end_POSTSUBSCRIPT. Parameters f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, f3subscript𝑓3f_{3}italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, f4subscript𝑓4f_{4}italic_f start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT are the constants which can be learned from the experimental data.

(3) Approximate charging efficiency

According to Faraday’s law, the hydrogen production rate is defined as (3a). The charging efficiency is given by (3b).

hc=3600ηFMiN2Fsuperscriptc3600subscript𝜂F𝑀𝑖𝑁2𝐹\displaystyle h^{\text{c}}=3600\cdot\dfrac{\eta_{\text{F}}\cdot M\cdot i\cdot N% }{2F}italic_h start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT = 3600 ⋅ divide start_ARG italic_η start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ⋅ italic_M ⋅ italic_i ⋅ italic_N end_ARG start_ARG 2 italic_F end_ARG (3a)
ηH,c=hcLHVPStack=3600ηFMLHV2FUcellsuperscript𝜂HcsuperscriptcLHVsubscript𝑃Stack3600subscript𝜂F𝑀LHV2𝐹subscript𝑈cell\displaystyle\eta^{\text{H},{\text{c}}}=\dfrac{h^{\text{c}}\cdot\text{LHV}}{P_% {\text{Stack}}}=3600\cdot\dfrac{\eta_{\text{F}}\cdot M\cdot\text{LHV}}{2F\cdot U% _{\text{cell}}}italic_η start_POSTSUPERSCRIPT H , c end_POSTSUPERSCRIPT = divide start_ARG italic_h start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT ⋅ LHV end_ARG start_ARG italic_P start_POSTSUBSCRIPT Stack end_POSTSUBSCRIPT end_ARG = 3600 ⋅ divide start_ARG italic_η start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ⋅ italic_M ⋅ LHV end_ARG start_ARG 2 italic_F ⋅ italic_U start_POSTSUBSCRIPT cell end_POSTSUBSCRIPT end_ARG (3b)

where hcsuperscriptch^{\text{c}}italic_h start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT is the hydrogen production rate of electrolyzer. M𝑀Mitalic_M is the molar mass of hydrogen. F𝐹Fitalic_F is the Faraday’s constant, i.e., 96485 C/mol. N𝑁Nitalic_N is the number of cells of the stack. LHV is the lower heat value of hydrogen, i.e., 33.33 kWh/kg.

As illustrated in Fig. 2, the blue curves from the semi-empirical model are non-linear and power-dependent, including a peak in efficiency at around 20% of the rated power. Therefore, a constant conversion efficiency cannot capture the variations in efficiency. To facilitate dispatch optimization, we adopt a piecewise linear approximation for hydrogen production, depicted by red dashed lines.

Refer to caption
Figure 2: Approximation of alkaline electrolyzer properties at 90Csuperscript90C90\,^{\circ}\mathrm{C}90 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT roman_C, 10 bar: (a) hydrogen production, (b) efficiency.

2.3 PEMFC model

(1) Polarization curve

The polarization curve of PEMFC is typically modeled using the equivalent circuit model proposed by Amphlett [2]. The cell voltage UcellFsuperscriptsubscript𝑈cellFU_{\text{cell}}^{\text{F}}italic_U start_POSTSUBSCRIPT cell end_POSTSUBSCRIPT start_POSTSUPERSCRIPT F end_POSTSUPERSCRIPT is given by (4), which equals the open circuit voltage ENernst subscript𝐸Nernst E_{\text{Nernst }}italic_E start_POSTSUBSCRIPT Nernst end_POSTSUBSCRIPT dropped by three types of irreversible losses: activation losses Uact subscript𝑈act U_{\text{act }}italic_U start_POSTSUBSCRIPT act end_POSTSUBSCRIPT, ohmic losses Uohmic subscript𝑈ohmic U_{\text{ohmic }}italic_U start_POSTSUBSCRIPT ohmic end_POSTSUBSCRIPT, and concentration losses Ucon subscript𝑈con U_{\text{con }}italic_U start_POSTSUBSCRIPT con end_POSTSUBSCRIPT.

UcellF=ENernst Uact Uohmic Uconsuperscriptsubscript𝑈cellFsubscript𝐸Nernst subscript𝑈act subscript𝑈ohmic subscript𝑈con\displaystyle U_{\text{cell}}^{\text{F}}=E_{\text{Nernst }}-U_{\text{act }}-U_% {\text{ohmic }}-U_{\text{con }}italic_U start_POSTSUBSCRIPT cell end_POSTSUBSCRIPT start_POSTSUPERSCRIPT F end_POSTSUPERSCRIPT = italic_E start_POSTSUBSCRIPT Nernst end_POSTSUBSCRIPT - italic_U start_POSTSUBSCRIPT act end_POSTSUBSCRIPT - italic_U start_POSTSUBSCRIPT ohmic end_POSTSUBSCRIPT - italic_U start_POSTSUBSCRIPT con end_POSTSUBSCRIPT (4a)
ENernst =12F[ΔGΔS(θθref )\displaystyle E_{\text{Nernst }}=\frac{1}{2F}\big{[}\Delta G-\Delta S(\theta-% \theta_{\text{ref }})italic_E start_POSTSUBSCRIPT Nernst end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 italic_F end_ARG [ roman_Δ italic_G - roman_Δ italic_S ( italic_θ - italic_θ start_POSTSUBSCRIPT ref end_POSTSUBSCRIPT )
+Rθ(log(PH2)+log(PO2)2)]\displaystyle\hskip 28.45274pt+R\cdot\theta\left(\log(P_{\mathrm{H}_{2}})+% \frac{\log(P_{\mathrm{O}_{2}})}{2}\right)\big{]}+ italic_R ⋅ italic_θ ( roman_log ( italic_P start_POSTSUBSCRIPT roman_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) + divide start_ARG roman_log ( italic_P start_POSTSUBSCRIPT roman_O start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG 2 end_ARG ) ] (4b)
Uact =a1+a2θ+a3θlog(CO2)+a4θlog(i)subscript𝑈act subscript𝑎1subscript𝑎2𝜃subscript𝑎3𝜃subscript𝐶subscriptO2subscript𝑎4𝜃𝑖\displaystyle U_{\text{act }}=a_{1}+a_{2}\cdot\theta+a_{3}\cdot\theta\cdot\log% (C_{\mathrm{O}_{2}})+a_{4}\cdot\theta\cdot\log\left(i\right)italic_U start_POSTSUBSCRIPT act end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⋅ italic_θ + italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ⋅ italic_θ ⋅ roman_log ( italic_C start_POSTSUBSCRIPT roman_O start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) + italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ⋅ italic_θ ⋅ roman_log ( italic_i ) (4c)
Uohmic =iRohmic =i(rMl/A+Rc)subscript𝑈ohmic 𝑖subscript𝑅ohmic 𝑖subscript𝑟𝑀𝑙𝐴subscript𝑅𝑐\displaystyle U_{\text{ohmic }}=i\cdot R_{\text{ohmic }}=i\left(r_{M}\cdot l/A% +R_{c}\right)italic_U start_POSTSUBSCRIPT ohmic end_POSTSUBSCRIPT = italic_i ⋅ italic_R start_POSTSUBSCRIPT ohmic end_POSTSUBSCRIPT = italic_i ( italic_r start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ⋅ italic_l / italic_A + italic_R start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ) (4d)
Ucon =Blog(1JJmax)J=iAsubscript𝑈con 𝐵1𝐽subscript𝐽𝐽𝑖𝐴\displaystyle U_{\text{con }}=B\cdot\log\left(1-\dfrac{J}{J_{\max}}\right)% \text{, }J=\dfrac{i}{A}italic_U start_POSTSUBSCRIPT con end_POSTSUBSCRIPT = italic_B ⋅ roman_log ( 1 - divide start_ARG italic_J end_ARG start_ARG italic_J start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_ARG ) , italic_J = divide start_ARG italic_i end_ARG start_ARG italic_A end_ARG (4e)

where ΔGΔ𝐺\Delta Groman_Δ italic_G is the Gibbs free energy. ΔSΔ𝑆\Delta Sroman_Δ italic_S is the entropy change. R𝑅Ritalic_R is the gas constant (8.314 J/(K\cdotmol)). PH2subscript𝑃subscriptH2P_{\mathrm{H}_{2}}italic_P start_POSTSUBSCRIPT roman_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and PO2subscript𝑃subscriptO2P_{\mathrm{O}_{2}}italic_P start_POSTSUBSCRIPT roman_O start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT are the partial pressures of hydrogen and oxygen respectively. Trefsubscript𝑇refT_{\text{ref}}italic_T start_POSTSUBSCRIPT ref end_POSTSUBSCRIPT is the reference temperature (298.15 K). CO2subscript𝐶subscriptO2C_{\mathrm{O}_{2}}italic_C start_POSTSUBSCRIPT roman_O start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is the oxygen concentration at the surface of the cathode catalyst. rMsubscript𝑟𝑀r_{M}italic_r start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT is the resistivity of the electrolyte membrane. l𝑙litalic_l is the thickness of the electrolyte membrane. B𝐵Bitalic_B is the concentration overpotential coefficient. J𝐽Jitalic_J and Jmaxsubscript𝐽maxJ_{\text{max}}italic_J start_POSTSUBSCRIPT max end_POSTSUBSCRIPT are the current density and its maximum value. a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, a2subscript𝑎2a_{2}italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, a3subscript𝑎3a_{3}italic_a start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, a4subscript𝑎4a_{4}italic_a start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT are constants that can be learned from the experimental data.

(2) Approximate discharging efficiency

According to Faraday’s law, the hydrogen consumption rate is defined as (5a). The discharging efficiency is given by (5b).

hd=3600MiN2Fsuperscriptd3600𝑀𝑖𝑁2𝐹\displaystyle h^{\text{d}}=3600\cdot\dfrac{M\cdot i\cdot N}{2F}italic_h start_POSTSUPERSCRIPT d end_POSTSUPERSCRIPT = 3600 ⋅ divide start_ARG italic_M ⋅ italic_i ⋅ italic_N end_ARG start_ARG 2 italic_F end_ARG (5a)
ηH,d=PStackhdHHV=2FUcell3600MHHVsuperscript𝜂Hdsubscript𝑃StacksuperscriptdHHV2𝐹subscript𝑈cell3600𝑀HHV\displaystyle\eta^{\text{H},{\text{d}}}=\dfrac{P_{\text{Stack}}}{h^{\text{d}}% \cdot\text{HHV}}=\dfrac{2F\cdot U_{\text{cell}}}{3600M\cdot\text{HHV}}italic_η start_POSTSUPERSCRIPT H , d end_POSTSUPERSCRIPT = divide start_ARG italic_P start_POSTSUBSCRIPT Stack end_POSTSUBSCRIPT end_ARG start_ARG italic_h start_POSTSUPERSCRIPT d end_POSTSUPERSCRIPT ⋅ HHV end_ARG = divide start_ARG 2 italic_F ⋅ italic_U start_POSTSUBSCRIPT cell end_POSTSUBSCRIPT end_ARG start_ARG 3600 italic_M ⋅ HHV end_ARG (5b)

where hdsuperscriptdh^{\text{d}}italic_h start_POSTSUPERSCRIPT d end_POSTSUPERSCRIPT is the hydrogen consumption rate of PEMFC. HHV is the higher heat value of hydrogen, i.e., 39.4 kWh/kg.

The blue curves from the semi-empirical model in Fig. 3 are non-linear and power-dependent. We also adopt a piecewise linear approximation for hydrogen consumption, depicted by red dashed lines.

Refer to caption
Figure 3: Approximation of PEMFC properties at 60Csuperscript60C60\,^{\circ}\mathrm{C}60 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT roman_C, 10 bar: (a) hydrogen consumption, (b) efficiency.

2.4 Equivalent hydrogen storage model

The equivalent hydrogen storage model is presented in (6). Constraint (6a) defines the relationship between SoC, charge power, and discharge power. Constraints (6b) limit the SoC of hydrogen storage within the bounds. Constraint (6c) guarantees ensures a sustainable energy state for hydrogen storage over cycles. Constraints (6d)-(6f) describe the tractable formulation of piecewise linear charging and discharging functions. Constraints (6g) limit hydrogen storage’s charging and discharging power.

Constraints: t𝛀Tp𝛀Pfor-all𝑡subscript𝛀𝑇for-all𝑝subscript𝛀𝑃\forall t\in{{\bm{\Omega}}_{T}}~{}\forall p\in{{\bm{\Omega}}_{{P}}}∀ italic_t ∈ bold_Ω start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ∀ italic_p ∈ bold_Ω start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT

Et+1H=EtH+Δt(htchtd)EtH,Lsuperscriptsubscript𝐸𝑡1Hsuperscriptsubscript𝐸𝑡HΔ𝑡superscriptsubscript𝑡csuperscriptsubscript𝑡dsuperscriptsubscript𝐸𝑡H,L\displaystyle E_{t+1}^{\text{H}}=E_{t}^{\text{H}}+\Delta t(h_{t}^{\text{c}}-h_% {t}^{\text{d}})-E_{t}^{\text{H,L}}italic_E start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT = italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT + roman_Δ italic_t ( italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT - italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT d end_POSTSUPERSCRIPT ) - italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,L end_POSTSUPERSCRIPT (6a)
E¯HEtHE¯Hsuperscript¯𝐸Hsuperscriptsubscript𝐸𝑡Hsuperscript¯𝐸H\displaystyle\underline{E}^{\text{H}}\leq E_{t}^{\text{H}}\leq\overline{E}^{% \text{H}}under¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ≤ italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ≤ over¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT (6b)
ETHE0Hsuperscriptsubscript𝐸𝑇Hsuperscriptsubscript𝐸0H\displaystyle E_{T}^{\text{H}}\geq E_{0}^{\text{H}}italic_E start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ≥ italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT (6c)
htc=p(ApPp,tH,c+Bpzp,tc)htd=p(CpPp,tH,d+Dpzp,td)superscriptsubscript𝑡csubscript𝑝subscript𝐴𝑝superscriptsubscript𝑃𝑝,𝑡H,csubscript𝐵𝑝superscriptsubscript𝑧𝑝,𝑡csuperscriptsubscript𝑡dsubscript𝑝subscript𝐶𝑝superscriptsubscript𝑃𝑝,𝑡H,dsubscript𝐷𝑝superscriptsubscript𝑧𝑝,𝑡d\displaystyle h_{t}^{\text{c}}=\sum_{p}(A_{p}P_{p\text{,}t}^{\text{H\text{,}c}% }+B_{p}z_{p\text{,}t}^{\text{c}})\text{, }h_{t}^{\text{d}}=\sum_{p}(C_{p}P_{p% \text{,}t}^{\text{H,d}}+D_{p}z_{p\text{,}t}^{\text{d}})italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_A start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H , c end_POSTSUPERSCRIPT + italic_B start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT ) , italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT d end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_C start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT + italic_D start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT d end_POSTSUPERSCRIPT ) (6d)
PtH,c=pPp,tH,cPtH,d=pPp,tH,dsuperscriptsubscript𝑃𝑡H,csubscript𝑝superscriptsubscript𝑃𝑝,𝑡H,csuperscriptsubscript𝑃𝑡H,dsubscript𝑝superscriptsubscript𝑃𝑝,𝑡H,d\displaystyle P_{t}^{\text{H\text{,}c}}=\sum\nolimits_{p}P_{p\text{,}t}^{\text% {H\text{,}c}}\text{, }P_{t}^{\text{H,d}}=\sum\nolimits_{p}P_{p\text{,}t}^{% \text{H,d}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H , c end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H , c end_POSTSUPERSCRIPT , italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT (6e)
pzp,tc=1pzp,td=1subscript𝑝superscriptsubscript𝑧𝑝,𝑡c1subscript𝑝superscriptsubscript𝑧𝑝,𝑡d1\displaystyle\sum\nolimits_{p}z_{p\text{,}t}^{\text{c}}=1\text{, }\sum% \nolimits_{p}z_{p\text{,}t}^{\text{d}}=1∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT = 1 , ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT d end_POSTSUPERSCRIPT = 1 (6f)
P¯pHzp,tcPp,tH,cP¯pHzp,tc0Pp,tH,dP¯pHzp,tdsuperscriptsubscript¯𝑃𝑝Hsuperscriptsubscript𝑧𝑝,𝑡csuperscriptsubscript𝑃𝑝,𝑡H,csuperscriptsubscript¯𝑃𝑝Hsuperscriptsubscript𝑧𝑝,𝑡c0superscriptsubscript𝑃𝑝,𝑡H,dsuperscriptsubscript¯𝑃𝑝Hsuperscriptsubscript𝑧𝑝,𝑡d\displaystyle\underline{P}_{p}^{\text{H}}z_{p\text{,}t}^{\text{c}}\leq P_{p% \text{,}t}^{\text{H\text{,}c}}\leq\overline{P}_{p}^{\text{H}}z_{p\text{,}t}^{% \text{c}}\text{, }0\leq P_{p\text{,}t}^{\text{H,d}}\leq\overline{P}_{p}^{\text% {H}}z_{p\text{,}t}^{\text{d}}under¯ start_ARG italic_P end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT ≤ italic_P start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H , c end_POSTSUPERSCRIPT ≤ over¯ start_ARG italic_P end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT , 0 ≤ italic_P start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT ≤ over¯ start_ARG italic_P end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT d end_POSTSUPERSCRIPT (6g)

where 𝛀Tsubscript𝛀𝑇{{\bm{\Omega}}_{T}}bold_Ω start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT and 𝛀Ssubscript𝛀𝑆{{\bm{\Omega}}_{S}}bold_Ω start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT are the set of time and parameter segments, respectively. PtH,csuperscriptsubscript𝑃𝑡H,cP_{t}^{\text{H,c}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,c end_POSTSUPERSCRIPT, PtH,dsuperscriptsubscript𝑃𝑡H,dP_{t}^{\text{H,d}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT, and EtHsuperscriptsubscript𝐸𝑡HE_{t}^{\text{H}}italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT are decision variables for the charge power, discharge power, and SoC of hydrogen storage. The SoC of hydrogen storage can be measured by the hydrogen mass or as a ratio of the rated capacity. EtH,Lsuperscriptsubscript𝐸𝑡H,LE_{t}^{\text{H,L}}italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,L end_POSTSUPERSCRIPT is the hydrogen load for industrial production processes, such as fertilizer manufacturing and steel-making. E¯Hsuperscript¯𝐸H\underline{E}^{\text{H}}under¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT and E¯Hsuperscript¯𝐸H\overline{E}^{\text{H}}over¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT are the lower and upper bounds of SoC. P¯Hsuperscript¯𝑃H\underline{P}^{\text{H}}under¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT and P¯Hsuperscript¯𝑃H\overline{P}^{\text{H}}over¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT are the lower and upper bounds of power. The lower charging power bound is set by the minimum operating power of the electrolyzer, typically 15%-20% of the nominal power. Apsubscript𝐴𝑝A_{p}italic_A start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and Bpsubscript𝐵𝑝B_{p}italic_B start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT are the slope and the intercept of piecewise linear charging segments. Cpsubscript𝐶𝑝C_{p}italic_C start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and Dpsubscript𝐷𝑝D_{p}italic_D start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT are the slope and the intercept of piecewise linear discharging segments. zp,tcsuperscriptsubscript𝑧𝑝,𝑡cz_{p\text{,}t}^{\text{c}}italic_z start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT c end_POSTSUPERSCRIPT and zp,tdsuperscriptsubscript𝑧𝑝,𝑡dz_{p\text{,}t}^{\text{d}}italic_z start_POSTSUBSCRIPT italic_p , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT d end_POSTSUPERSCRIPT are binary variables for piecewise linear function.

3 Long-term energy management of microgrid

3.1 Microgrid structure

In this paper, we only consider the island mode of the microgrid, and the microgrid structure is illustrated in Fig. 4. The microgrid consists of renewable generators (wind and solar), diesel generators, H-BES and local loads.

Refer to caption
Figure 4: Diagram of microgrid structure.

3.2 Problem formulation

The objective defined in (7) aims to minimize the system cost. This cost comprises the production costs of the diesel generator, penalties for load curtailment (island mode), and operational costs of H-BES. Constraints (8a) and (8b) define the power bounds and ramping bounds of the diesel generator. Constraints (9) define the constraints for battery, which are similar in formulation to those for hydrogen storage (6), as both types of storage involve constraints on charging and discharging rates, SoC, etc. However, it is important to note that the battery efficiency is considered to be constant, there is no minimum charging power limit in (9d), and the self-discharge rate should be considered in (9a). Constraints (10) limit the load curtailment and dispatchable RES. Constraint (11) limits the power import from the main grid. Power balance constraint is defined as (12). The complementary constraints for charging and discharging of battery and hydrogen storage are relaxed and have been removed from the model since sufficient conditions are satisfied [18], i.e., discharging price (“+”) is greater than the charging price (“0”). Moreover, the power flow constraints are overlooked within the dispatch model since the microgrid network is generally designed with high reliability and large redundancy [29].

Objective Function:

min𝒙G(𝒙,𝝃)=t𝛀T(CtL+CtD+CtB+CtH)𝒙𝐺𝒙𝝃subscript𝑡subscript𝛀𝑇superscriptsubscript𝐶𝑡Lsuperscriptsubscript𝐶𝑡Dsuperscriptsubscript𝐶𝑡Bsuperscriptsubscript𝐶𝑡H\displaystyle\underset{\bm{x}}{\min}\ G(\bm{x},{\bm{\xi}})=\sum_{t\in\bm{% \Omega}_{T}}\left(C_{t}^{\text{L}}+C_{t}^{\text{D}}+C_{t}^{\text{B}}+C_{t}^{% \text{H}}\right)underbold_italic_x start_ARG roman_min end_ARG italic_G ( bold_italic_x , bold_italic_ξ ) = ∑ start_POSTSUBSCRIPT italic_t ∈ bold_Ω start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT + italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT + italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT + italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ) (7a)
CtL=cLPtLΔtCtD=cDPtDΔtCtB/H=cB/HPtB/H,dΔtsuperscriptsubscript𝐶𝑡Lsuperscript𝑐Lsuperscriptsubscript𝑃𝑡LΔ𝑡superscriptsubscript𝐶𝑡Dsuperscript𝑐Dsuperscriptsubscript𝑃𝑡DΔ𝑡superscriptsubscript𝐶𝑡B/Hsuperscript𝑐B/Hsuperscriptsubscript𝑃𝑡B/H,dΔ𝑡\displaystyle C_{t}^{\text{L}}=c^{\text{L}}P_{t}^{\text{L}}\Delta t\text{, }C_% {t}^{\text{D}}=c^{\text{D}}P_{t}^{\text{D}}\Delta t\text{, }C_{t}^{\text{B/H}}% =c^{\text{B/H}}P_{t}^{\text{B/H,d}}\Delta titalic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT = italic_c start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT roman_Δ italic_t , italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT = italic_c start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT roman_Δ italic_t , italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B/H end_POSTSUPERSCRIPT = italic_c start_POSTSUPERSCRIPT B/H end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B/H,d end_POSTSUPERSCRIPT roman_Δ italic_t (7b)

where cLsuperscript𝑐Lc^{\text{L}}italic_c start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT and PtLsuperscriptsubscript𝑃𝑡LP_{t}^{\text{L}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT are the load curtailment price and load curtailment power. cDsuperscript𝑐Dc^{\text{D}}italic_c start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT and PtDsuperscriptsubscript𝑃𝑡DP_{t}^{\text{D}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT are the fuel price and power of diesel generator. cBsuperscript𝑐Bc^{\text{B}}italic_c start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT and cHsuperscript𝑐Hc^{\text{H}}italic_c start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT are marginal discharge costs of battery and hydrogen storage. ΔtΔ𝑡\Delta troman_Δ italic_t is the time interval.

Constraints: t𝛀Tfor-all𝑡subscript𝛀𝑇\forall t\in{{\bm{\Omega}}_{T}}~{}∀ italic_t ∈ bold_Ω start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT

P¯DPtDP¯Dsuperscript¯𝑃Dsuperscriptsubscript𝑃𝑡Dsuperscript¯𝑃D\displaystyle\underline{P}^{\text{D}}\leq P_{t}^{\text{D}}\leq\overline{P}^{% \text{D}}under¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT ≤ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT ≤ over¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT (8a)
RDDPt+1DPtDRUD𝑅superscript𝐷Dsuperscriptsubscript𝑃𝑡1Dsuperscriptsubscript𝑃𝑡D𝑅superscript𝑈D\displaystyle-RD^{\text{D}}\leq P_{t+1}^{\text{D}}-P_{t}^{\text{D}}\leq RU^{% \text{D}}- italic_R italic_D start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT ≤ italic_P start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT - italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT ≤ italic_R italic_U start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT (8b)
Et+1B=(1εΔt)EtB+Δt(ηB,cPtB,cPtB,d/ηB,d)superscriptsubscript𝐸𝑡1B1𝜀Δ𝑡superscriptsubscript𝐸𝑡BΔ𝑡superscript𝜂B,csuperscriptsubscript𝑃𝑡B,csuperscriptsubscript𝑃𝑡B,dsuperscript𝜂B,d\displaystyle E_{t+1}^{\text{B}}=(1-\varepsilon\Delta t)E_{t}^{\text{B}}+% \Delta t(\eta^{\text{B,c}}P_{t}^{\text{B,c}}-P_{t}^{\text{B,d}}/{\eta^{\text{B% ,d}}})italic_E start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT = ( 1 - italic_ε roman_Δ italic_t ) italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT + roman_Δ italic_t ( italic_η start_POSTSUPERSCRIPT B,c end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,c end_POSTSUPERSCRIPT - italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT / italic_η start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT ) (9a)
E¯BEtBE¯Bsuperscript¯𝐸Bsuperscriptsubscript𝐸𝑡Bsuperscript¯𝐸B\displaystyle\underline{E}^{\text{B}}\leq E_{t}^{\text{B}}\leq\overline{E}^{% \text{B}}under¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT ≤ italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT ≤ over¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT (9b)
ETBE0Bsuperscriptsubscript𝐸𝑇Bsuperscriptsubscript𝐸0B\displaystyle E_{T}^{\text{B}}\geq E_{0}^{\text{B}}italic_E start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT ≥ italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT (9c)
0PtB,cP¯B0superscriptsubscript𝑃𝑡B,csuperscript¯𝑃B\displaystyle 0\leq P_{t}^{\text{B,c}}\leq\overline{P}^{\text{B}}0 ≤ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,c end_POSTSUPERSCRIPT ≤ over¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT (9d)
0PtB,dP¯B0superscriptsubscript𝑃𝑡B,dsuperscript¯𝑃B\displaystyle 0\leq P_{t}^{\text{B,d}}\leq\overline{P}^{\text{B}}0 ≤ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT ≤ over¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT (9e)
0PtLξtL0superscriptsubscript𝑃𝑡Lsuperscriptsubscript𝜉𝑡L\displaystyle 0\leq P_{t}^{\text{L}}\leq\xi_{t}^{\text{L}}0 ≤ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT ≤ italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT (10a)
0PtRξtR0superscriptsubscript𝑃𝑡Rsuperscriptsubscript𝜉𝑡R\displaystyle 0\leq P_{t}^{\text{R}}\leq\xi_{t}^{\text{R}}0 ≤ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT ≤ italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT (10b)
0PtGP¯tG0superscriptsubscript𝑃𝑡Gsuperscriptsubscript¯𝑃𝑡G\displaystyle 0\leq P_{t}^{\text{G}}\leq\overline{P}_{t}^{\text{G}}0 ≤ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT G end_POSTSUPERSCRIPT ≤ over¯ start_ARG italic_P end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT G end_POSTSUPERSCRIPT (11)
PtG+PtR+(PtB,dPtB,c)+(PtH,dPtH,c)+PtL=ξtLsuperscriptsubscript𝑃𝑡Gsuperscriptsubscript𝑃𝑡Rsuperscriptsubscript𝑃𝑡B,dsuperscriptsubscript𝑃𝑡B,csuperscriptsubscript𝑃𝑡H,dsuperscriptsubscript𝑃𝑡H,csuperscriptsubscript𝑃𝑡Lsuperscriptsubscript𝜉𝑡L\displaystyle P_{t}^{\text{G}}+P_{t}^{\text{R}}+(P_{t}^{\text{B,d}}-P_{t}^{% \text{B,c}})+(P_{t}^{\text{H,d}}-P_{t}^{\text{H,c}})+P_{t}^{\text{L}}=\xi_{t}^% {\text{L}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT G end_POSTSUPERSCRIPT + italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT + ( italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT - italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,c end_POSTSUPERSCRIPT ) + ( italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT - italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,c end_POSTSUPERSCRIPT ) + italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT = italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT (12)

where P¯Dsuperscript¯𝑃D\underline{P}^{\text{D}}under¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT and P¯Dsuperscript¯𝑃D\overline{P}^{\text{D}}over¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT are the lower and upper power bounds of diesel generator. RDD𝑅superscript𝐷DRD^{\text{D}}italic_R italic_D start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT and RUD𝑅superscript𝑈DRU^{\text{D}}italic_R italic_U start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT are downward and upward ramping rates of diesel generator. PtB,csuperscriptsubscript𝑃𝑡B,cP_{t}^{\text{B,c}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,c end_POSTSUPERSCRIPT, PtB,dsuperscriptsubscript𝑃𝑡B,dP_{t}^{\text{B,d}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT, and EtBsuperscriptsubscript𝐸𝑡BE_{t}^{\text{B}}italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT are decision variables for the charge power, discharge power, and SoC of battery. ηB,csuperscript𝜂B,c\eta^{\text{B,c}}italic_η start_POSTSUPERSCRIPT B,c end_POSTSUPERSCRIPT and ηB,dsuperscript𝜂B,d\eta^{\text{B,d}}italic_η start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT are the charge and discharge efficiency of battery. ε𝜀\varepsilonitalic_ε is the self-discharge rate of battery. E¯Bsuperscript¯𝐸B\underline{E}^{\text{B}}under¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT and E¯Bsuperscript¯𝐸B\overline{E}^{\text{B}}over¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT are the lower and upper SoC bounds of battery. P¯Bsuperscript¯𝑃B\overline{P}^{\text{B}}over¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT is the upper power bound of battery. ξtLsuperscriptsubscript𝜉𝑡L\xi_{t}^{\text{L}}italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT and ξtRsuperscriptsubscript𝜉𝑡R\xi_{t}^{\text{R}}italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT are the load power and available RES power with uncertainties. PtRsuperscriptsubscript𝑃𝑡RP_{t}^{\text{R}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT is the dispatched RES power. The set of stochastic parameters is given by 𝝃={ξtL,ξtR}𝝃superscriptsubscript𝜉𝑡Lsuperscriptsubscript𝜉𝑡R\bm{\xi}=\{\xi_{t}^{\text{L}},\xi_{t}^{\text{R}}\}bold_italic_ξ = { italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT }. The set of decision variables is given by 𝒙={PtL,PtD,PtR,PtB,c/d,PtH,c/d,EtB,EtH,htc/d,ztc/d}𝒙superscriptsubscript𝑃𝑡Lsuperscriptsubscript𝑃𝑡Dsuperscriptsubscript𝑃𝑡Rsuperscriptsubscript𝑃𝑡B,c/dsuperscriptsubscript𝑃𝑡H,c/dsuperscriptsubscript𝐸𝑡Bsuperscriptsubscript𝐸𝑡Hsuperscriptsubscript𝑡c/dsuperscriptsubscript𝑧𝑡c/d\bm{x}=\{P_{t}^{\text{L}},P_{t}^{\text{D}},P_{t}^{\text{R}},P_{t}^{\text{B,c/d% }},P_{t}^{\text{H,c/d}},E_{t}^{\text{B}},E_{t}^{\text{H}},h_{t}^{\text{c/d}},z% _{t}^{\text{c/d}}\}bold_italic_x = { italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT , italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT , italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT , italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,c/d end_POSTSUPERSCRIPT , italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,c/d end_POSTSUPERSCRIPT , italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT , italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT , italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT c/d end_POSTSUPERSCRIPT , italic_z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT c/d end_POSTSUPERSCRIPT }.

The multi-time-period economic dispatch of microgrid with H-BES (P1subscriptP1\textbf{P}_{1}P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT) is summarized in (3.2). Next, we present the methodology for solving this problem.

(P1)min𝒙subscriptP1𝒙\displaystyle(\textbf{P}_{1})\hskip 4.0pt\underset{\bm{x}}{\min}( P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) underbold_italic_x start_ARG roman_min end_ARG G(𝒙,𝝃)𝐺𝒙𝝃\displaystyle{\hskip 4.0pt}G(\bm{x},{\bm{\xi}})italic_G ( bold_italic_x , bold_italic_ξ )
s.t. (6)(8)(12)italic-(6italic-)italic-(8italic-)italic-(12italic-)\displaystyle{\hskip 4.0pt}\eqref{hydrogen}\text{, }\eqref{DG}-\eqref{powerbalance}italic_( italic_) , italic_( italic_) - italic_( italic_) (13)

4 Prediction-free coordinated optimization framework

4.1 Motivations

Solving the problem (3.2) has the following challenges:

(1) Non-anticipatively : The long-term energy management of the microgrid typically spans more than one month or one season. Nevertheless, the forecast accuracy is acceptable only for several hours ahead. Hence, the load power and available RES power are unanticipated in the long-term optimization. And online optimization methods should be adopted to decompose the long-term optimization problem into several short-term optimization problems.

(2) Storage Dispatch Priority: Batteries with lower marginal discharge costs will be given priority over hydrogen storage with higher marginal discharge costs. We defer the complete proof to Appendix A. The battery-prioritized strategy is feasible and economical for short-term operation. However, this approach does not account for seasonal variations in RES and load, which will result in a lack of pre-stored hydrogen and load losses in long-term operations. Therefore, it is necessary to design a “reference” with a global view to help guide hydrogen storage actions.

(3) Convexity: The piecewise linearization will introduce nonconvexity to the optimization, which contradicts the overall logic of most convex optimization approaches. However, introducing a global “reference” can mitigate this challenge by pre-determining the efficiency.

4.2 Two-stage coordinated optimization framework

We propose a two-stage coordinated optimization framework as illustrated in Fig 5. The proposed framework consists of both online and offline stage optimization. The offline stage aims to generate the ex-post SoC references for hydrogen storage using historical data on RES and load. These references can help avoid myopic decision-making and will be incrementally updated by kernel regression with newly observed data. Subsequently, online decisions are made using an adaptive virtual-queue-based OCO algorithm.

Refer to caption
Figure 5: Diagram of prediction-free two-stage coordinated optimization framework.

4.3 Offline-stage optimization

Firstly, sequential sequences of scenarios, denoted as 𝝃𝒔={ξs,tL,ξs,tR}subscript𝝃𝒔superscriptsubscript𝜉𝑠𝑡Lsuperscriptsubscript𝜉𝑠𝑡R\bm{\xi_{s}}=\{\xi_{s,t}^{\text{L}},\xi_{s,t}^{\text{R}}\}bold_italic_ξ start_POSTSUBSCRIPT bold_italic_s end_POSTSUBSCRIPT = { italic_ξ start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT },t𝛀T={1,2,,T},s𝛀S={1,2,,N}formulae-sequence𝑡subscript𝛀𝑇12𝑇𝑠subscript𝛀𝑆12𝑁t\in{{\bm{\Omega}}_{T}}=\{1,2,\cdots,T\},s\in{{\bm{\Omega}}_{S}}=\{1,2,\cdots,N\}italic_t ∈ bold_Ω start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = { 1 , 2 , ⋯ , italic_T } , italic_s ∈ bold_Ω start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT = { 1 , 2 , ⋯ , italic_N }, are generated from historical data of the past few years. Additionally, to account for climate change and enhance the diversity of references, we can also collect references from different months and seasons. For instance, if we focus on a seasonal dispatch problem and have historical data for 5 years, then T=1𝑇1T=1italic_T = 1 season and N=5×4𝑁54N=5\times 4italic_N = 5 × 4. To enhance adaptability to extreme weather conditions, we add extreme scenarios into the historical data using Generative Adversarial Networks [11]. Afterward, we can solve the deterministic mixed-integer linear programming (MILP) as (4.3) to generate the SoC references of hydrogen storage, i.e., 𝑬sH,*={Es,tH,*}superscriptsubscript𝑬𝑠H,*superscriptsubscript𝐸𝑠𝑡H,*\bm{E}_{s}^{\text{H,*}}=\{E_{s,t}^{\text{H,*}}\}bold_italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,* end_POSTSUPERSCRIPT = { italic_E start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,* end_POSTSUPERSCRIPT },t𝛀T,s𝛀Sformulae-sequence𝑡subscript𝛀𝑇𝑠subscript𝛀𝑆t\in{{\bm{\Omega}}_{T}},s\in{{\bm{\Omega}}_{S}}italic_t ∈ bold_Ω start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT , italic_s ∈ bold_Ω start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT.

(P2)min𝒙𝒔subscriptP2subscript𝒙𝒔\displaystyle(\textbf{P}_{2})\hskip 4.0pt\underset{\bm{x_{s}}}{\min}( P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_UNDERACCENT bold_italic_x start_POSTSUBSCRIPT bold_italic_s end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_min end_ARG G(𝒙𝒔,𝝃𝒔)𝐺subscript𝒙𝒔subscript𝝃𝒔\displaystyle{\hskip 4.0pt}G(\bm{x_{s}},{\bm{\xi_{s}}})italic_G ( bold_italic_x start_POSTSUBSCRIPT bold_italic_s end_POSTSUBSCRIPT , bold_italic_ξ start_POSTSUBSCRIPT bold_italic_s end_POSTSUBSCRIPT )
s.t. (6)(8)(12)italic-(6italic-)italic-(8italic-)italic-(12italic-)\displaystyle{\hskip 4.0pt}\eqref{hydrogen}\text{, }\eqref{DG}-\eqref{powerbalance}italic_( italic_) , italic_( italic_) - italic_( italic_) (14)

4.4 Online-stage optimization

(1) Data-Driven Reference Tracking

Inspired by [13], we propose a data-driven reference tracking method to combine both the ’lookback’ pattern from historical data and the ’lookahead’ pattern from newly observed data. Firstly, we define 𝝃[t]subscript𝝃delimited-[]𝑡\bm{\xi}_{[t]}bold_italic_ξ start_POSTSUBSCRIPT [ italic_t ] end_POSTSUBSCRIPT as the observed sequence for uncertainties from the first time slot to the current time slot t in (15a). Additionally, 𝝃s,[t]subscript𝝃𝑠delimited-[]𝑡\bm{\xi}_{s,[t]}bold_italic_ξ start_POSTSUBSCRIPT italic_s , [ italic_t ] end_POSTSUBSCRIPT defined in (15b) represents the corresponding historical sequence for uncertainties in scenario s. Subsequently, by checking the similarity between 𝝃[t]subscript𝝃delimited-[]𝑡\bm{\xi}_{[t]}bold_italic_ξ start_POSTSUBSCRIPT [ italic_t ] end_POSTSUBSCRIPT and 𝝃s,[t]subscript𝝃𝑠delimited-[]𝑡\bm{\xi}_{s,[t]}bold_italic_ξ start_POSTSUBSCRIPT italic_s , [ italic_t ] end_POSTSUBSCRIPT, dynamic weights ωs,tsubscript𝜔𝑠𝑡\omega_{s,t}italic_ω start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT are assigned to each historical scenario based on the Gaussian kernel function and Euclidean distance, as outlined in (15c). To account for the temporal dynamics, the Gaussian kernel function is modified with a scaling factor t. And the optimal bandwidth σ𝜎\sigmaitalic_σ can be found through heuristic methods such as the bisection method. Additionally, the weights are updated in real-time dispatch instead of using average or heuristic values. Finally, the SoC reference of hydrogen storage is updated as (15d). This updated reference also determines the efficiency segment of hydrogen storage, eliminating the nonconvexity issue that arises when using convex optimization approaches.

𝝃[t]={ξ1G,ξ1L,ξ1R,,ξtG,ξtL,ξtR}subscript𝝃delimited-[]𝑡superscriptsubscript𝜉1Gsuperscriptsubscript𝜉1Lsuperscriptsubscript𝜉1Rsuperscriptsubscript𝜉𝑡Gsuperscriptsubscript𝜉𝑡Lsuperscriptsubscript𝜉𝑡R\displaystyle\bm{\xi}_{[t]}=\{\xi_{1}^{\text{G}},\xi_{1}^{\text{L}},\xi_{1}^{% \text{R}},\cdots,\xi_{t}^{\text{G}},\xi_{t}^{\text{L}},\xi_{t}^{\text{R}}\}bold_italic_ξ start_POSTSUBSCRIPT [ italic_t ] end_POSTSUBSCRIPT = { italic_ξ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT G end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT , ⋯ , italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT G end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT } (15a)
𝝃s,[t]={ξs,1G,ξs,1L,ξs,1R,,ξs,tG,ξs,tL,ξs,tR}subscript𝝃𝑠delimited-[]𝑡superscriptsubscript𝜉𝑠1Gsuperscriptsubscript𝜉𝑠1Lsuperscriptsubscript𝜉𝑠1Rsuperscriptsubscript𝜉𝑠𝑡Gsuperscriptsubscript𝜉𝑠𝑡Lsuperscriptsubscript𝜉𝑠𝑡R\displaystyle\bm{\xi}_{s,[t]}=\{\xi_{s,1}^{\text{G}},\xi_{s,1}^{\text{L}},\xi_% {s,1}^{\text{R}},\cdots,\xi_{s,t}^{\text{G}},\xi_{s,t}^{\text{L}},\xi_{s,t}^{% \text{R}}\}bold_italic_ξ start_POSTSUBSCRIPT italic_s , [ italic_t ] end_POSTSUBSCRIPT = { italic_ξ start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT G end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT , ⋯ , italic_ξ start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT G end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT R end_POSTSUPERSCRIPT } (15b)
ωs,t=Kt(ξ[t],ξs,[t])s=1NKt(ξ[t],ξs,[t]),Kt(x,y)=e(xy2)2tσ2formulae-sequencesubscript𝜔𝑠𝑡subscript𝐾𝑡subscript𝜉delimited-[]𝑡subscript𝜉𝑠delimited-[]𝑡superscriptsubscriptsuperscript𝑠1𝑁subscript𝐾𝑡subscript𝜉delimited-[]𝑡subscript𝜉superscript𝑠delimited-[]𝑡subscript𝐾𝑡𝑥𝑦superscript𝑒superscriptsubscriptnorm𝑥𝑦22𝑡superscript𝜎2\displaystyle\omega_{s,t}=\dfrac{K_{t}(\xi_{[t]},\xi_{s,[t]})}{\sum_{{s^{% \prime}=1}}^{N}K_{t}(\xi_{[t]},\xi_{s^{\prime},{[t]}})},\ K_{t}(x,y)=e^{{-% \frac{(\|x-y\|_{2})^{2}}{t\sigma^{2}}}}italic_ω start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT = divide start_ARG italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT [ italic_t ] end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_s , [ italic_t ] end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT [ italic_t ] end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , [ italic_t ] end_POSTSUBSCRIPT ) end_ARG , italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , italic_y ) = italic_e start_POSTSUPERSCRIPT - divide start_ARG ( ∥ italic_x - italic_y ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_t italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT (15c)
𝑬[t]H,R=s=1Nωs,t𝑬s,[t]H,*superscriptsubscript𝑬delimited-[]𝑡H,Rsuperscriptsubscript𝑠1𝑁subscript𝜔𝑠𝑡superscriptsubscript𝑬𝑠delimited-[]𝑡H,*\displaystyle\bm{E}_{[t]}^{\text{H,R}}=\sum\nolimits_{{s=1}}^{N}\omega_{s,t}% \bm{E}_{s,[t]}^{\text{H,*}}bold_italic_E start_POSTSUBSCRIPT [ italic_t ] end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,R end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_s = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_ω start_POSTSUBSCRIPT italic_s , italic_t end_POSTSUBSCRIPT bold_italic_E start_POSTSUBSCRIPT italic_s , [ italic_t ] end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,* end_POSTSUPERSCRIPT (15d)

(2) Real-Time Corrective Dispatch

Real-time corrective dispatch (P3subscriptP3\textbf{P}_{3}P start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT) is formulated in (4.4), which aims to minimize the instant operational cost while tracking the SoC reference of hydrogen storage. φ𝜑\varphiitalic_φ is the penalty coefficient to control the SoC deviation from the reference. P3subscriptP3\textbf{P}_{3}P start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT admits a compact form in (17a). ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and gtsubscript𝑔𝑡g_{t}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT represent the time-varying objective function and time-varying constraints due to hydrogen storage SoC reference EtH,Rsuperscriptsubscript𝐸𝑡H,R{E}_{t}^{\text{H,R}}italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,R end_POSTSUPERSCRIPT and uncertainties 𝝃𝝃\bm{\xi}bold_italic_ξ, respectively. By leveraging the Lagrangian Relaxation, we can obtain the optimum by (17b). λtsubscript𝜆𝑡\lambda_{t}italic_λ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the dual variables of the constraints gtsubscript𝑔𝑡g_{t}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. x,y𝑥𝑦\left\langle x,y\right\rangle⟨ italic_x , italic_y ⟩ denotes the standard inner product. However, without prior knowledge of uncertainties 𝝃𝝃\bm{\xi}bold_italic_ξ, ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and gtsubscript𝑔𝑡g_{t}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT are unknown to the online decision-maker. Hence, we next design a VQB-OCO algorithm to solve this issue.

(P3)min𝒙tsubscriptP3subscript𝒙𝑡\displaystyle(\textbf{P}_{3})\hskip 4.0pt\underset{\bm{x}_{t}}{\min}( P start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) start_UNDERACCENT bold_italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_min end_ARG G(𝒙t,𝝃t)+φ(EtHEtH,R)2𝐺subscript𝒙𝑡subscript𝝃𝑡𝜑superscriptsuperscriptsubscript𝐸𝑡Hsuperscriptsubscript𝐸𝑡H,R2\displaystyle{\hskip 4.0pt}G(\bm{x}_{t},{\bm{\xi}_{t}})+\varphi({E}_{t}^{\text% {H}}-{E}_{t}^{\text{H,R}})^{2}italic_G ( bold_italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_ξ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + italic_φ ( italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT - italic_E start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,R end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
s.t. (6)(8)(12)italic-(6italic-)italic-(8italic-)italic-(12italic-)\displaystyle{\hskip 4.0pt}\eqref{hydrogen}\text{, }\eqref{DG}-\eqref{powerbalance}italic_( italic_) , italic_( italic_) - italic_( italic_) (16)
min𝒙ft(xt)s.t.gt(xt)0formulae-sequencesubscript𝒙subscript𝑓𝑡subscript𝑥𝑡stsubscript𝑔𝑡subscript𝑥𝑡0\displaystyle\min_{\bm{x}}\ f_{t}(x_{t})\ \mathrm{~{}s.t.~{}}g_{t}(x_{t})\leq 0roman_min start_POSTSUBSCRIPT bold_italic_x end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) roman_s . roman_t . italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ≤ 0 (17a)
xt=argminx{ft(x)+λt,gt(x)}subscript𝑥𝑡subscript𝑥subscript𝑓𝑡𝑥subscript𝜆𝑡subscript𝑔𝑡𝑥\displaystyle x_{t}=\arg\min_{x}\{f_{t}(x)+\left\langle\lambda_{t},\ g_{t}(x)% \right\rangle\}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT { italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) + ⟨ italic_λ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) ⟩ } (17b)

(3) VQB-OCO Algorithm

The key idea of VQB-OCO is to use information from past time to approximate the current situation. The virtual queue is employed as the substitution of unknown dual variables. Hence, we design the update policy for the virtual queue, decisions, and weights in (18)-(20). Finally, we can obtain the weighted average value of dispatch decision as (21). The VQB-OCO algorithm and the overall two-stage coordinated optimization framework are summarized in Algorithm 1.

Qi,t1=Qi,t2+βt1[gt1(xt1)]+subscript𝑄𝑖𝑡1subscript𝑄𝑖𝑡2subscript𝛽𝑡1subscriptdelimited-[]subscript𝑔𝑡1subscript𝑥𝑡1Q_{i,t-1}=Q_{i,t-2}+\beta_{t-1}[g_{t-1}(x_{t-1})]_{+}italic_Q start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT = italic_Q start_POSTSUBSCRIPT italic_i , italic_t - 2 end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT [ italic_g start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT (18)
xi,tsubscript𝑥𝑖𝑡\displaystyle x_{i,t}italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT =argminx{αi,t1ft1(xt1),x+\displaystyle=\arg\min_{x}\{\alpha_{i,t-1}\left\langle\partial f_{t-1}(x_{t-1}% ),\ x\right\rangle+= roman_arg roman_min start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT { italic_α start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT ⟨ ∂ italic_f start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ) , italic_x ⟩ + (19)
αi,t1βi,t1Qi,t1,[gt1(x)]++xxi,t12}\displaystyle\alpha_{i,t-1}\beta_{i,t-1}\left\langle Q_{i,t-1},\ [g_{t-1}(x)]_% {+}\right\rangle+\|x-x_{i,t-1}\|^{2}\}italic_α start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT ⟨ italic_Q start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT , [ italic_g start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ( italic_x ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ⟩ + ∥ italic_x - italic_x start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT }
i,t1=ft1(xt1),xi,t1xt1,ρi,t=ρi,t1eγi,t1i=1Nρi,t1eγi,t1formulae-sequencesubscript𝑖𝑡1subscript𝑓𝑡1subscript𝑥𝑡1subscript𝑥𝑖𝑡1subscript𝑥𝑡1subscript𝜌𝑖𝑡subscript𝜌𝑖𝑡1superscript𝑒𝛾subscript𝑖𝑡1superscriptsubscript𝑖1𝑁subscript𝜌𝑖𝑡1superscript𝑒𝛾subscript𝑖𝑡1\ell_{i,t-1}=\left\langle\partial f_{t-1}(x_{t-1}),\ x_{i,t-1}-x_{t-1}\right% \rangle,\ \rho_{i,t}=\dfrac{\rho_{i,t-1}e^{-\gamma\ell_{i,t-1}}}{\sum_{i=1}^{N% }\rho_{i,t-1}e^{-\gamma\ell_{i,t-1}}}roman_ℓ start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT = ⟨ ∂ italic_f start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ) , italic_x start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ⟩ , italic_ρ start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT = divide start_ARG italic_ρ start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_γ roman_ℓ start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_γ roman_ℓ start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG (20)
xt=t=1Nρi,txi,tsubscript𝑥𝑡superscriptsubscript𝑡1𝑁subscript𝜌𝑖𝑡subscript𝑥𝑖𝑡x_{t}=\sum\nolimits_{t=1}^{N}\rho_{i,t}x_{i,t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT (21)
Remark 1 (Approximation)

ft(x)subscript𝑓𝑡𝑥f_{t}(x)italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) is approximated using the first-order Taylor expansion ft1(xt1),xsubscript𝑓𝑡1subscript𝑥𝑡1𝑥\left\langle\partial f_{t-1}(x_{t-1}),\ x\right\rangle⟨ ∂ italic_f start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ) , italic_x ⟩. The term λtsubscript𝜆𝑡\lambda_{t}italic_λ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is substituted by a virtual queue Qi,t1subscript𝑄𝑖𝑡1Q_{i,t-1}italic_Q start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT. The constraint function gt(x)subscript𝑔𝑡𝑥g_{t}(x)italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) is replaced by the clipped constraint function [gt1(x)]+subscriptdelimited-[]subscript𝑔𝑡1𝑥\left[g_{t-1}(x)\right]_{+}[ italic_g start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ( italic_x ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT. A regularization term |xxi,t1|2superscript𝑥subscript𝑥𝑖𝑡12\left|x-x_{i,t-1}\right|^{2}| italic_x - italic_x start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is added to ensure the convexity of the optimization problem and to enhance the convergence of the algorithm.

Remark 2 (Parallel Learning)

Determining the learning rate (step size) is important yet challenging. We assign different learning rates to the first two terms, αi,t1subscript𝛼𝑖𝑡1\alpha_{i,t-1}italic_α start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT and βi,t1subscript𝛽𝑖𝑡1\beta_{i,t-1}italic_β start_POSTSUBSCRIPT italic_i , italic_t - 1 end_POSTSUBSCRIPT. Rather than utilizing fixed or adaptive learning rates, we employ the expert-tracking algorithm proposed by [38], which computes xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in parallel with various learning rates as described in equation (19). The weights for each expert ρi,tsubscript𝜌𝑖𝑡\rho_{i,t}italic_ρ start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT are updated based on their empirical performance using an exponential function, as shown in equation (20).

Remark 3 (Virtual Queue Updates)

Based on our previous work [25], the dual variables of the long-term constraints remain fixed when the optimum does not reach the constraint bounds. However, when the optimum reaches these bounds, the dual variables increase, representing a penalty. The update of the virtual queue follows the same pattern as described in equation (18) to limit constraint violations.

Stage1: Offline Optimization
Input: Historical scenarios of RES and load 𝝃𝒔subscript𝝃𝒔\bm{\xi_{s}}bold_italic_ξ start_POSTSUBSCRIPT bold_italic_s end_POSTSUBSCRIPT
Output: Historical reference for hydrogen storage 𝑬sH,*superscriptsubscript𝑬𝑠H,*\bm{E}_{s}^{\text{H,*}}bold_italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,* end_POSTSUPERSCRIPT
for S=1𝑆1S=1italic_S = 1 to N𝑁Nitalic_N  do
       Solve the deterministic MILP problems (P2subscriptP2\textbf{P}_{2}P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT) ;
       as (4.3) to generate the SoC references of;
       hydrogen storage. ;
      
end for
Stage2: Online Optimization
Input: Historical reference for hydrogen storage 𝑬sH,*superscriptsubscript𝑬𝑠H,*\bm{E}_{s}^{\text{H,*}}bold_italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,* end_POSTSUPERSCRIPT; ;
Real-time observation of RES and load 𝝃𝒕subscript𝝃𝒕\bm{\xi_{t}}bold_italic_ξ start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT
. Output: Real-Time Dispatch Decisions xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
Step 1 -Initialization
       Set Qi,t=0,xi,1𝑿,x1=i=1Nρi,1xi,1formulae-sequencesubscript𝑄𝑖𝑡0formulae-sequencesubscript𝑥𝑖1𝑿subscript𝑥1superscriptsubscript𝑖1𝑁subscript𝜌𝑖1subscript𝑥𝑖1Q_{i,t}=0,\ x_{i,1}\in\bm{X},\ x_{1}=\sum_{i=1}^{N}\rho_{i,1}x_{i,1}italic_Q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT = 0 , italic_x start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT ∈ bold_italic_X , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT,
      ρi,1=(M+1)/[i(i+1)M],i{1,2,,M}.formulae-sequencesubscript𝜌𝑖1𝑀1delimited-[]𝑖𝑖1𝑀for-all𝑖12𝑀\rho_{i,1}=(M+1)/[i(i+1)M],\ \forall i\in\{1,2,\cdotp\cdotp\cdotp,M\}.italic_ρ start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT = ( italic_M + 1 ) / [ italic_i ( italic_i + 1 ) italic_M ] , ∀ italic_i ∈ { 1 , 2 , ⋯ , italic_M } .
Step 2 - Reference Tracking & VQB-OCO
       for t=2𝑡2t=2italic_t = 2 to T𝑇Titalic_T  do
             Update real-time SoC reference as (15); ;
             for i=1𝑖1i=1italic_i = 1 to M𝑀Mitalic_Mparallel do
                   Update virtual queue Qi,tsubscript𝑄𝑖𝑡Q_{i,t}italic_Q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT as (18); ;
                   Update decisions xi,tsubscript𝑥𝑖𝑡x_{i,t}italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT as (19); ;
                   Update weights ρi,tsubscript𝜌𝑖𝑡\rho_{i,t}italic_ρ start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT as (20). ;
                  
             end for
            Calculate the dispatch decision xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as (21).
       end for
      
Algorithm 1 Prediction-Free Two-Stage Online Optimization Algorithm

(4) Performance of VQB-OCO

OCO focuses on the performance of regret (Reg), as defined in (22), where ytsubscript𝑦𝑡y_{t}italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the global optimum. Various OCO algorithms ensure that Reg is a sublinear function of T by designing parameters and update policy, as it implies that the algorithm performs as well as the global optimum in hindsight as T approaches infinity. Next, we provide parameter settings and a proof to achieve strictly sublinear dynamic regret.

Reg=t=1T[ft(xt)ft(yt)]Regsuperscriptsubscript𝑡1𝑇delimited-[]subscript𝑓𝑡subscript𝑥𝑡subscript𝑓𝑡subscript𝑦𝑡\mathrm{Reg}=\sum\nolimits_{t=1}^{T}[f_{t}(x_{t})-f_{t}(y_{t})]roman_Reg = ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT [ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ] (22)

Assumption 1. The functions ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and gtsubscript𝑔𝑡g_{t}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT are convex. The feasible set 𝑿𝑿\bm{X}bold_italic_X is convex and closed, and it has a bounded diameter d(𝑿)𝑑𝑿d(\bm{X})italic_d ( bold_italic_X ), i.e.,

xyd(𝑿),x,y𝑿formulae-sequencenorm𝑥𝑦𝑑𝑿for-all𝑥𝑦𝑿\parallel x-y\parallel\leq d(\bm{X}),\ \forall x,y\in\bm{X}∥ italic_x - italic_y ∥ ≤ italic_d ( bold_italic_X ) , ∀ italic_x , italic_y ∈ bold_italic_X (23)

Assumption 2. There exists a positive constant F such that

ft(x)ft(y)F,gt(x)F,t𝛀𝑻,x,y𝑿formulae-sequencedelimited-∣∣subscript𝑓𝑡𝑥subscript𝑓𝑡𝑦𝐹formulae-sequencenormsubscript𝑔𝑡𝑥𝐹formulae-sequencefor-all𝑡subscript𝛀𝑻for-all𝑥𝑦𝑿\mid f_{t}(x)-f_{t}(y)\mid\leq F,\ \parallel g_{t}(x)\parallel\leq F,\ \forall t% \in\bm{\Omega_{T}},\ \forall x,y\in\bm{X}∣ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y ) ∣ ≤ italic_F , ∥ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) ∥ ≤ italic_F , ∀ italic_t ∈ bold_Ω start_POSTSUBSCRIPT bold_italic_T end_POSTSUBSCRIPT , ∀ italic_x , italic_y ∈ bold_italic_X (24)

Assumption 3. The subgradients ft(x)subscript𝑓𝑡𝑥\partial f_{t}(x)∂ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) and gt(x)subscript𝑔𝑡𝑥\partial g_{t}(x)∂ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) exist. And there exists a positive constant G such that

ft(x)G,gt(x)G,t𝛀𝑻,x,y𝑿formulae-sequencenormsubscript𝑓𝑡𝑥𝐺formulae-sequencenormsubscript𝑔𝑡𝑥𝐺formulae-sequencefor-all𝑡subscript𝛀𝑻for-all𝑥𝑦𝑿\parallel\partial f_{t}(x)\parallel\leq G,\ \parallel\partial g_{t}(x)% \parallel\leq G,\ \forall t\in\bm{\Omega_{T}},\ \forall x,y\in\bm{X}∥ ∂ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) ∥ ≤ italic_G , ∥ ∂ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) ∥ ≤ italic_G , ∀ italic_t ∈ bold_Ω start_POSTSUBSCRIPT bold_italic_T end_POSTSUBSCRIPT , ∀ italic_x , italic_y ∈ bold_italic_X (25)
Theorem 1

Given the assumptions 1–3, and parameters setting as (26), κ[0,c]𝜅0𝑐\kappa\in[0,c]italic_κ ∈ [ 0 , italic_c ], c(0,1)𝑐01c\in(0,1)italic_c ∈ ( 0 , 1 ), α0>0subscript𝛼00\alpha_{0}>0italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0, β0>0subscript𝛽00\beta_{0}>0italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0, and γ0(0,1/(2G))subscript𝛾0012𝐺\gamma_{0}\in(0,1/(\sqrt{2G}))italic_γ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ ( 0 , 1 / ( square-root start_ARG 2 italic_G end_ARG ) ) are constants. Then, we have the performance of Reg and Vio as (27).

M=κlog2(1+T)+1,αi,t=α02i1tc,βi,t=β0αi,t,γ=γ0Tcformulae-sequence𝑀𝜅subscript21𝑇1formulae-sequencesubscript𝛼𝑖𝑡subscript𝛼0superscript2𝑖1superscript𝑡𝑐formulae-sequencesubscript𝛽𝑖𝑡subscript𝛽0subscript𝛼𝑖𝑡𝛾subscript𝛾0superscript𝑇𝑐\displaystyle M=\lfloor\kappa\log_{2}(1+T)\rfloor+1,\ \alpha_{i,t}=\dfrac{% \alpha_{0}2^{i-1}}{t^{c}},\ \beta_{i,t}=\dfrac{\beta_{0}}{\sqrt{\alpha_{i,t}}}% ,\ \gamma=\dfrac{\gamma_{0}}{T^{c}}italic_M = ⌊ italic_κ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + italic_T ) ⌋ + 1 , italic_α start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT = divide start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_t start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT end_ARG , italic_β start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT = divide start_ARG italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_α start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG end_ARG , italic_γ = divide start_ARG italic_γ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT end_ARG (26)
Reg=𝒪(Tc(1+Px)1κ+T1c(1+Px)κ)Reg𝒪superscript𝑇𝑐superscript1subscript𝑃𝑥1𝜅superscript𝑇1𝑐superscript1subscript𝑃𝑥𝜅\text{Reg}=\mathcal{O}(T^{c}(1+P_{x})^{1-\kappa}+T^{1-c}(1+P_{x})^{\kappa})Reg = caligraphic_O ( italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ( 1 + italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 - italic_κ end_POSTSUPERSCRIPT + italic_T start_POSTSUPERSCRIPT 1 - italic_c end_POSTSUPERSCRIPT ( 1 + italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT ) (27)

Proof: The performance of the proposed OCO algorithm achieves a similar performance with [22] which achieves 𝒪(Tmax{1ac,c})𝒪superscript𝑇1𝑎𝑐𝑐\mathcal{O}(T^{\max\{1-a-c,c\}})caligraphic_O ( italic_T start_POSTSUPERSCRIPT roman_max { 1 - italic_a - italic_c , italic_c } end_POSTSUPERSCRIPT ) for dynamic regret with the help of prediction data. And it outperforms the performance of [20] and [6], achieving dynamic regret with a linear function of Pxsubscript𝑃𝑥P_{x}italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT, which is not satisfactory. Moreover, by setting κ=c=0.5𝜅𝑐0.5\kappa=c=0.5italic_κ = italic_c = 0.5, the proposed OCO algorithm achieve the performance of 𝒪(T(1+Px))𝒪𝑇1subscript𝑃𝑥\mathcal{O}(\sqrt{T(1+P_{x})})caligraphic_O ( square-root start_ARG italic_T ( 1 + italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) end_ARG ), which aligns with the performance of [38] where long term constraints are not considered. Hence, the proposed OCO algorithm is no worse than the existing versions. We defer the complete proof to Appendix B.

5 Case studies

5.1 Set-up

The main parameters and configurations are listed in Table 2. Specifically, the capacities of the battery and hydrogen storage are half of the load capacity. The storage durations of the battery and hydrogen are 2 hours and 400 hours, respectively. The installed capacity of renewables is 200 kW, comprising an equal share of solar and wind. The cost coefficients can be found in [13].

We demonstrate the effectiveness of the proposed method based on two datasets: (1) We use the 15-minute historical data on solar, wind, and load from 2014 to 2023 obtained from Belgium’s transmission system operator (Elia) [8] for the baseline case study. (2) We also use the hourly historical data of wind and load from 1981 to 2020 in North China [12] to demonstrate the impact of data resolution and data quantity.

The optimization is coded in MatLab with Yalmip interface and solved by Gurobi 11.0 solver. The programming environment is Intel Core i9-13900HX @ 2.30GHz with RAM 32 GB.

Table 2: Parameters and configuration of the test microgrid.
Parameters Value Parameters Value Parameters Value
Initial SoC 0.5 P¯Hsuperscript¯𝑃H\overline{P}^{\text{H}}over¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT 50 kW cLsuperscript𝑐Lc^{\text{L}}italic_c start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT $5/kWh
ηB,c/dsuperscript𝜂B,c/d\eta^{\text{B,c/d}}italic_η start_POSTSUPERSCRIPT B,c/d end_POSTSUPERSCRIPT 0.9 E¯Hsuperscript¯𝐸H\overline{E}^{\text{H}}over¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT 20 MWh P¯Dsuperscript¯𝑃D\overline{P}^{\text{D}}over¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT 50kW
ϵitalic-ϵ\epsilonitalic_ϵ 1%/month cBsuperscript𝑐Bc^{\text{B}}italic_c start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT $0.02/kWh Wind Capacity 100kW
P¯Bsuperscript¯𝑃B\overline{P}^{\text{B}}over¯ start_ARG italic_P end_ARG start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT 50 kW cHsuperscript𝑐Hc^{\text{H}}italic_c start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT $0.03/kWh Solar Capacity 100kW
E¯Bsuperscript¯𝐸B\overline{E}^{\text{B}}over¯ start_ARG italic_E end_ARG start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT 100kW cDsuperscript𝑐Dc^{\text{D}}italic_c start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT $0.3/kWh Load Capacity 100kW

5.2 Offline-stage optimization

(1) Data visualization

We first show the monthly average available renewable and load power of Elia from 2014 to 2023 in Figure 6. It is observed that all of them exhibit seasonal patterns. Wind power is abundant in spring and winter but scarce in summer, while solar power is relatively high in summer and extremely low in winter. Load power peaks in winter. Correspondingly, the net load also peaks in winter and hits a low in summer. Therefore, it indicates the critical role of hydrogen storage to address the seasonal variations in renewables and load, as well as to maintain the long-term energy balance of the microgrid.

Refer to caption
Figure 6: Data visualization on Elia dataset from 2014 to 2023: (a) monthly average load power, (b) monthly average solar power, and (c) monthly average wind power.

(2) Impact of hydrogen storage efficiency model

Next, we compare the offline energy management performance in 2023 with different hydrogen models, including:

(E1): Piecewise linear model as proposed in (6), and the parameters are fitted based on the experimental data as shown in Figure 2 and 3.

(E2): Constant efficiency model with both the highest charging and discharging efficiencies of 63%.

(E3): Constant efficiency model with the lowest charging and discharging efficiencies, i.e., 53% and 45%, respectively.

The hydrogen storage SoC is shown in Figure 7. Other results are also summarized in Table 3. It is observed that using the highest constant efficiency model results in the most optimistic performance, with the lowest operational cost ($97,188), whereas the lowest constant efficiency model yields the highest operational cost ($132,933) due to the significant increase in costs of diesel generation and loss of load. However, using a constant efficiency model can lead to feasibility issues in practical operation, resulting in losses in either charging or discharging power. Additionally, the optimistic strategy generated by the highest efficiency model will introduce an additional loss of load cost of 1.94 MWh. Considering the practical consequences, E2 and E3 will increase the total system costs by 10% and 36%, respectively, compared to E1. This result demonstrates that the proposed model can capture the characteristics of power-dependent efficiency and achieve more reliable and economical performance in practice.

Refer to caption
Figure 7: Hydrogen storage SoC over one representative year using different efficiency models.
Table 3: Yearly operational performance of the microgrid using different efficiency models.
Model CostT/PsuperscriptCostT/P\text{Cost}^{\text{T/P}}Cost start_POSTSUPERSCRIPT T/P end_POSTSUPERSCRIPT ($104)currency-dollarsuperscript104(\$10^{4})( $ 10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ) PtD,T/PΔtsuperscriptsubscript𝑃𝑡D,T/PΔ𝑡\sum P_{t}^{\text{D,T/P}}\Delta t∑ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D,T/P end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh ) PtL,T/PΔtsuperscriptsubscript𝑃𝑡L,T/PΔ𝑡\sum P_{t}^{\text{L,T/P}}\Delta t∑ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L,T/P end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh ) ΔPtH,cΔtΔsuperscriptsubscript𝑃𝑡H,cΔ𝑡\sum\Delta P_{t}^{\text{H,c}}\Delta t∑ roman_Δ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,c end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh ) ΔPtH,dΔtΔsuperscriptsubscript𝑃𝑡H,dΔ𝑡\sum\Delta P_{t}^{\text{H,d}}\Delta t∑ roman_Δ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh )
E1 9.87/9.87 324.68/324.68 0.00/0.00 0.00 0.00
E2 9.81/10.78 322.19/322.19 0.00/1.94 0.00 -1.94
E3 13.29/13.29 374.33/374.19 3.85/3.85 -0.14 0.00

5.3 Online-stage optimization

(1) Reference tracking

We test the reference tracking performance in 2023 with different methods, including:

(R1): Global optimal reference generated by deterministic multi-period optimization with perfect knowledge of uncertainty realizations.

(R2): The proposed data-driven reference tracking, trained with 2014-2022 historical data.

(R3): The proposed data-driven reference tracking, trained with 2014-2022 historical data and AI-generated data. The AI-generated data is produced by randomly reducing the historical solar power but increasing the wind power by 10%-50% each quarter.

(R4-R6): Similar to the (R3) method, the AI-generated data is produced by: - (R4): Randomly reducing both the historical solar and wind power by 10%-50% each quarter. - (R5): Increasing the historical solar and wind power by 10%-50% each quarter. - (R6): Using all the AI-generated data from (R3) to (R5).

(R7): The reference generated using the average historical performance.

The hydrogen SoC references are compared in Figure 8, and the tracking performance is summarized in Table 4. The root mean square error (RMSE) is calculated as the average difference between the generated reference and the global optimal reference. The optimal choice of σ𝜎\sigmaitalic_σ obtained through the bisection method is 0.098. It is observed that the references generated by the proposed methods R2-R6 can better track the seasonal variations of RES and load, resulting in lower RMSE compared to the reference generated by R7. This is because the proposed methods employ kernel regression to update the weights of historical references instead of using fixed and average values. Additionally, additional generated data inputs will increase the tracking performance as they create new potential extreme scenarios. However, as is shown in the performance of R6, an excessive amount of generated data can lead to overfitting in the regression model, thereby reducing tracking accuracy. The average computation time for a single time interval is around 2 ms, which is acceptable even for minute-level scheduling and control.

Refer to caption
Figure 8: Hydrogen storage SoC references using different reference tracking methods.
Table 4: Reference tracking performance with different reference tracking methods.
Method R1 R2 R3 R4 R5 R6 R7
RMSE —— 8.41% 8.96% 8.25% 8.01% 8.33% 10.54%
Time (ms) —— 0.35 1.93 1.88 2.01 6.27 0.00
Data Size (Year) —— 9 45 45 45 116 9

Furthermore, we test the reference tracking performance on the North China dataset. The data visualization and reference tracking performance are shown in Figure 13 in Appendix C. Compared with the Elia dataset, both the wind and load data exhibit less variation across seasons and years. The maximum variations across years are 0.22 and 0.09 for wind and load, respectively, while for the Elia dataset, they are 0.25 and 0.15 for wind and load, respectively. Specifically, the load data in North China maintains the same shape across the years. The optimal choice of σ𝜎\sigmaitalic_σ obtained through the bisection method is 50. The proposed tracking method performs the same as the averaged method, with an RMSE of 0.046. This is because the historical data and historical references show significant similarity across years, causing the proposed method to select the average value when updating weights. The above results demonstrate the benefit of using a data-driven reference tracking method when historical uncertainties exhibit significant variations across years. Additionally, an appropriate amount of AI-generated data can improve adaptability to extreme weather scenarios.

(2) Online decision-making

We test the online decision-making in 2023 with different dispatch methods, including:

(M0): Deterministic optimization with perfect knowledge of uncertainty realizations for the whole year results in the global optimum. This method is optimistic and not applicable in practice. Hence, it only serves as a baseline.

(M1): The proposed prediction-free coordinated approach, which utilizes OCO for online optimization and R5 for reference tracking.

(M2): The scheduling-correction method proposed by [13], which utilizes MPC for online optimization and R5 for reference tracking.

(M3): Online optimization by OCO without reference tracking.

(M4): Online optimization by MPC without reference tracking.

The operational performance is summarized in Table 5. It is observed that the proposed method M1 outperforms the others in terms of cost-effectiveness, achieving an optimality gap of 27% compared with M0. This is due to its smallest loss of load and RMSE compared to the reference. Additionally, M1 and M2 obtain much better economic performance than M3 and M4. This can be explained by the hydrogen SoC as shown in Figure 9. M3 and M4 generate myopic decisions by continuously discharging the hydrogen storage to reduce short-term operational costs during the winter peak. They fail to charge the hydrogen storage during the renewable-rich spring and summer, resulting in an extremely low SoC after winter. Consequently, these myopic decisions prevent hydrogen storage from effectively shifting energy seasonally, leading to a substantial loss of load and low utilization of RES in practice. In contrast, M1 and M2 follow the pattern of reference while M1 has the better reference following performance (lower RMSE) since OCO utilizes the real-time observed data. This result demonstrates the benefit of introducing a global reference for the online optimization method. We also apply SDP algorithm in [4] to this problem. However, the SDP cannot converge within 24 hr due to the “curse of dimensionality”. Therefore, it is infeasible to use SDP for long-term energy management of microgrid with H-BES.

Refer to caption
Figure 9: Hydrogen storage SoC strategies in Elia using different optimization methods.

Moreover, we compare the power dispatch strategies of H-BES and DG using M1 and M2, as shown in Figure 10. It is observed that M1 can better track the net load curve using only hydrogen storage actions. In contrast, M2 keeps charging hydrogen storage and uses DG when renewables are insufficient. This is because the OCO-based method simultaneously tracks the previous decisions and the reference, updating the strategy based on newly observed data, which is more adaptive to the time-varying environment. While the MPC-based method only tracks the reference and updates the strategy based on forecast data. Therefore, if the reference or forecast is not accurate, the MPC-based method may struggle to achieve good performance. This can also be explained by SoC gaps as shown in Figure 9. Furthermore, due to prediction errors, MPC-based online optimization may encounter infeasibility issues, resulting in additional loss of load and penalty costs. In contrast, the OCO-based method makes decisions based on observed data, thereby avoiding infeasibility issues. Regarding computational efficiency, it can be seen that the OCO method has better performance than MPC. Both methods achieve single-step optimization in tens of ms, which is acceptable for most online optimization scenarios.

Refer to caption
Figure 10: Comparison of power dispatch strategies with different optimization methods: (a) M1 and (b) M2.
Table 5: Yearly operational performance of the microgrid in Elia using different optimization methods.
Method Cost ($104)currency-dollarsuperscript104(\$10^{4})( $ 10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ) PtDΔtsuperscriptsubscript𝑃𝑡DΔ𝑡\sum P_{t}^{\text{D}}\Delta t∑ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh ) PtLΔtsuperscriptsubscript𝑃𝑡LΔ𝑡\sum P_{t}^{\text{L}}\Delta t∑ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh ) RMSE Time (ms)
M0 9.87 324.68 0.00 0.00 25.20
M1 12.55 336.30 4.59 10.46 87.87
M2 17.38 488.46 5.19 19.70 97.77
M3 29.43 257.77 43.19 53.19 48.59
M4 38.05 457.58 48.45 52.93 51.64

Additional tests on the North China dataset align with the above results, showing that optimization with reference outperforms optimization without reference, and OCO-based methods outperform MPC-based methods. The results are summarized in Table 7 and Figure 14 in Appendix C. Compared with the case results in Elia, both M1 and M2 achieve SoC strategies with smaller gaps from M0, due to better reference tracking performance. However, in terms of cost and reliability performance, the North China case shows worse results. This is because renewable energy in the North China case is solely supplied by wind power, leading to insufficient generation and higher load curtailment.

5.4 Sensitivity analysis

In this subsection, we further investigate the key impact factor of the proposed optimization framework.

(1) Penalty Coefficient of Reference Tracking. The penalty coefficient represents the tradeoff between instant operational cost and reference tracking performance. However, since this reference is estimated from historical data, it may not be optimal for the current year. We compare the cost and tracking performance in Figure 11 when scaling up the penalty coefficient φ𝜑\varphiitalic_φ. It is observed that RMSE is monotonically decreased with the penalty coefficient, while the operational cost initially decreases to a minimum value at φ=90000𝜑90000\varphi=90000italic_φ = 90000 but then gradually increases with the penalty coefficient. This suggests that improving tracking performance does not always lead to lower operational costs.

Refer to caption
Figure 11: Reliability performance of microgrid with different renewable capacity using M1 and M2 methods.

(2) Reference and step size of OCO Algorithm. As illustrated in Section 5.3, the reference has a critical impact on operational performance. Table 6 summarizes the performance using M1 with different references (i.e., fixed reference generated by R7 and updated reference generated by R5) and different step sizes (i.e., fixed step size proposed by [22] and step size generated by the proposed expert-tracking algorithm). It is observed that compared to fixed reference, using the proposed updated reference reduces operational cost by 1.09%-1.70% and the loss of load by 5.68%-8.52%. This highlights the importance of finding the right reference for hydrogen storage. Additionally, the proposed step size setting decreases the operational cost by 0.67%-1.29% and the loss of load by 0.06%-3.16%. This is because the step size generated by expert-tracking can better adapt to the changing cost function and avoid heuristic settings.

Table 6: Yearly operational performance of the microgrid in Elia using M1 with different references and step sizes.
Reference step size CostT/PsuperscriptCostT/P\text{Cost}^{\text{T/P}}Cost start_POSTSUPERSCRIPT T/P end_POSTSUPERSCRIPT ($104)currency-dollarsuperscript104(\$10^{4})( $ 10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ) PtD,T/PΔtsuperscriptsubscript𝑃𝑡D,T/PΔ𝑡\sum P_{t}^{\text{D,T/P}}\Delta t∑ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D,T/P end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh ) PtL,T/PΔtsuperscriptsubscript𝑃𝑡L,T/PΔ𝑡\sum P_{t}^{\text{L,T/P}}\Delta t∑ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L,T/P end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh ) RMSE
Updated Fixed 12.55 336.30 4.60 10.87
Expert-Tracking 12.47 333.66 4.60 10.99
Fixed Fixed 12.77 336.33 5.03 9.26
Expert-Tracking 12.61 333.58 4.87 9.26

(3) Sizing of Renewables. We further compare the reliability performance of M1 and M2 in Figure 12 when scaling up renewable capacity. It is observed that the loss of load decreases with increased renewable capacity. Moreover, M1 achieves an acceptable reliability level with twice the renewable capacity, while M2 requires at least 4 times the renewable capacity to meet reliability requirements. This demonstrates the benefit of the proposed method in reducing renewable planning costs.

Refer to caption
Figure 12: Reliability performance of microgrid with different renewable capacity using M1 and M2 methods.

6 Conclusion

This paper proposes a prediction-free coordinated optimization framework for long-term energy management of microgrid with H-BES. To accurately captures the power-dependent efficiency of hydrogen storage, we propose an approximate semi-empirical hydrogen storage model using piecewise linear relaxation. Moreover, to address the long-term operational patterns of renewables and load and to eliminate dependence on predictions, we introduce a prediction-free, two-stage coordinated optimization framework. The key idea is to generate and track the SoC reference of hydrogen storage using historical scenarios and kernel regression to avoid myopic online decisions. And online decisions are made based on the proposed VQB-OCO algorithm by leveraging feedback control policy and newly observed data. Case studies on Elia and North China verify that:

(1) Compared to the constant efficiency model, the proposed approximation model avoids both overly optimistic and overly conservative strategies.

(2) The proposed optimization framework outperforms existing online optimization methods and achieves an acceptable gap compared to deterministic optimization with perfect foresight of uncertainties.

(3) The SoC reference of hydrogen storage is critical to overall performance. Therefore, more reliable historical or AI-generated scenarios, along with sophisticated techniques for setting penalty coefficients, are highly required.

(4) The OCO algorithm typically lacks a global view and is sensitive to step size settings. Thus, it is beneficial to incorporate a penalty term for long-term pattern tracking and expert-tracking for step size updates.

Further work will focus on addressing constraints violations in the OCO algorithm and extending the proposed framework to operations in a market environment.

Appendix

Appendix A Proof of storage priority

The marginal discharge cost of battery is $10/MWh-$30/MWh, while it is $1/kg-$2/kg ($30/MWh-$60/MWh) for hydrogen storage, which is much higher than battery. Assume we have an optimal discharge power from H-BES, which is entirely supplied by the battery, i.e., x1=PtB,dsuperscript𝑥1superscriptsubscript𝑃𝑡B,dx^{\text{1}}=P_{t}^{\text{B,d}}italic_x start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT = italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT. Considering another optimum where the discharge power is a mix from both the battery and hydrogen storage, i.e., x2=ρPtB,d+(1ρ)PtH,dsuperscript𝑥2𝜌superscriptsubscript𝑃𝑡B,d1𝜌superscriptsubscript𝑃𝑡H,dx^{\text{2}}=\rho P_{t}^{\text{B,d}}+(1-\rho)P_{t}^{\text{H,d}}italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_ρ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT + ( 1 - italic_ρ ) italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT. From the H-BES cost of the two optima (28) and the marginal cost & efficiency of two types of storages, we can draw the conclusion that CtH-BES,2>CtH-BES 1superscriptsubscript𝐶𝑡H-BES,2superscriptsubscript𝐶𝑡H-BES 1C_{t}^{\text{H-BES,2}}>C_{t}^{\text{H-BES 1}}italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H-BES,2 end_POSTSUPERSCRIPT > italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H-BES 1 end_POSTSUPERSCRIPT. This indicates that hydrogen storage will not be activated until the battery is fully discharged or charged. Hence, we finish the proof.

PtB,d=ρPtB,d+(1ρ)PtH,dsuperscriptsubscript𝑃𝑡B,d𝜌superscriptsubscript𝑃𝑡B,d1𝜌superscriptsubscript𝑃𝑡H,d\displaystyle P_{t}^{\text{B,d}}=\rho P_{t}^{\text{B,d}}+(1-\rho)P_{t}^{\text{% H,d}}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT = italic_ρ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT + ( 1 - italic_ρ ) italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT (28a)
CtH-BES,1=cBPtB,dΔtsuperscriptsubscript𝐶𝑡H-BES,1superscript𝑐Bsuperscriptsubscript𝑃𝑡B,dΔ𝑡\displaystyle C_{t}^{\text{H-BES,1}}=c^{\text{B}}P_{t}^{\text{B,d}}\Delta titalic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H-BES,1 end_POSTSUPERSCRIPT = italic_c start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT roman_Δ italic_t (28b)
CtH-BES,2=(cBρPtB,d+cH(1ρ)PtH,d)Δtsuperscriptsubscript𝐶𝑡H-BES,2superscript𝑐B𝜌superscriptsubscript𝑃𝑡B,dsuperscript𝑐H1𝜌superscriptsubscript𝑃𝑡H,dΔ𝑡\displaystyle C_{t}^{\text{H-BES,2}}=(c^{\text{B}}\rho P_{t}^{\text{B,d}}+c^{% \text{H}}(1-\rho)P_{t}^{\text{H,d}})\Delta titalic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H-BES,2 end_POSTSUPERSCRIPT = ( italic_c start_POSTSUPERSCRIPT B end_POSTSUPERSCRIPT italic_ρ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT B,d end_POSTSUPERSCRIPT + italic_c start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( 1 - italic_ρ ) italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H,d end_POSTSUPERSCRIPT ) roman_Δ italic_t (28c)

Appendix B Proof of bounded dynamic regret

Let {xi,t}subscript𝑥𝑖𝑡\{x_{i,t}\}{ italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT } and {xt}subscript𝑥𝑡\{x_{t}\}{ italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } be the sequences generated by Algorithm 1. Let {yt}subscript𝑦𝑡\{y_{t}\}{ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } be a global optimum in the feasible set 𝑿𝑿\bm{X}bold_italic_X. From ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is convex and (25), we have:

ft(xi,t)ft(yt)ft(xi,t),xi,tytsubscript𝑓𝑡subscript𝑥𝑖𝑡subscript𝑓𝑡subscript𝑦𝑡subscript𝑓𝑡subscript𝑥𝑖𝑡subscript𝑥𝑖𝑡subscript𝑦𝑡\displaystyle f_{t}(x_{i,t})-f_{t}(y_{t})\leq\left\langle\partial f_{t}(x_{i,t% }),\ x_{i,t}-y_{t}\right\rangleitalic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ≤ ⟨ ∂ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) , italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ (29)
Gxi,txi,t+1+ft(xi,t),xi,t+1ytabsent𝐺normsubscript𝑥𝑖𝑡subscript𝑥𝑖𝑡1subscript𝑓𝑡subscript𝑥𝑖𝑡subscript𝑥𝑖𝑡1subscript𝑦𝑡\displaystyle\leq G\parallel x_{i,t}-x_{i,t+1}\parallel+\left\langle\partial f% _{t}(x_{i,t}),\ x_{i,t+1}-y_{t}\right\rangle≤ italic_G ∥ italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ + ⟨ ∂ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) , italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩
G2αi,t2+12αi,txi,txi,t+12+ft(xi,t),xi,t+1ytabsentsuperscript𝐺2subscript𝛼𝑖𝑡212subscript𝛼𝑖𝑡superscriptnormsubscript𝑥𝑖𝑡subscript𝑥𝑖𝑡12subscript𝑓𝑡subscript𝑥𝑖𝑡subscript𝑥𝑖𝑡1subscript𝑦𝑡\displaystyle\leq\frac{G^{2}\alpha_{i,t}}{2}+\frac{1}{2\alpha_{i,t}}\parallel x% _{i,t}-x_{i,t+1}\parallel^{2}+\left\langle\partial f_{t}(x_{i,t}),\ x_{i,t+1}-% y_{t}\right\rangle≤ divide start_ARG italic_G start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + divide start_ARG 1 end_ARG start_ARG 2 italic_α start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG ∥ italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ⟨ ∂ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) , italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩

For the rightmost term of (29), we have:

ft(xi,t),xi,t+1yt=βi,t+1([gt(xi,t+1)]+)TQi,t,ytxi,t+1+ft(xi,t)+βi,t+1([gt(xi,t+1)]+)TQi,t,xi,t+1ytmissing-subexpressionsubscript𝑓𝑡subscript𝑥𝑖𝑡subscript𝑥𝑖𝑡1subscript𝑦𝑡missing-subexpressionabsentsubscript𝛽𝑖𝑡1superscriptsubscriptdelimited-[]subscript𝑔𝑡subscript𝑥𝑖𝑡1𝑇subscript𝑄𝑖𝑡subscript𝑦𝑡subscript𝑥𝑖𝑡1missing-subexpressionsubscript𝑓𝑡subscript𝑥𝑖𝑡subscript𝛽𝑖𝑡1superscriptsubscriptdelimited-[]subscript𝑔𝑡subscript𝑥𝑖𝑡1𝑇subscript𝑄𝑖𝑡subscript𝑥𝑖𝑡1subscript𝑦𝑡\displaystyle\begin{aligned} &\left\langle\partial f_{t}(x_{i,t}),\ x_{i,t+1}-% y_{t}\right\rangle\\ &=\left\langle\beta_{i,t+1}(\partial[g_{t}(x_{i,t+1})]_{+})^{T}Q_{i,t},\ y_{t}% -x_{i,t+1}\right\rangle\\ &+\left\langle\partial f_{t}(x_{i,t})+\beta_{i,t+1}(\partial[g_{t}(x_{i,t+1})]% _{+})^{T}Q_{i,t},\ x_{i,t+1}-y_{t}\right\rangle\end{aligned}start_ROW start_CELL end_CELL start_CELL ⟨ ∂ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) , italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = ⟨ italic_β start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ( ∂ [ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_Q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ⟩ end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + ⟨ ∂ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) + italic_β start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ( ∂ [ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_Q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ end_CELL end_ROW (30)

Since gtsubscript𝑔𝑡g_{t}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is a convex function, it is trivial to show that [gt]+subscriptdelimited-[]subscript𝑔𝑡[g_{t}]_{+}[ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT is also convex; hence the first term of (30) can be relaxed:

βt+1([gt(xi,t+1)]+)TQi,t,ytxi,t+1subscript𝛽𝑡1superscriptsubscriptdelimited-[]subscript𝑔𝑡subscript𝑥𝑖𝑡1𝑇subscript𝑄𝑖𝑡subscript𝑦𝑡subscript𝑥𝑖𝑡1\displaystyle\left\langle\beta_{t+1}(\partial[g_{t}(x_{i,t+1})]_{+})^{T}Q_{i,t% },\ y_{t}-x_{i,t+1}\right\rangle⟨ italic_β start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ( ∂ [ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_Q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ⟩ (31)
βt+1Qi,t,[gt(yt)]+βi,t+1Qi(t),[gt(xi,t+1)]+absentsubscript𝛽𝑡1subscript𝑄𝑖𝑡subscriptdelimited-[]subscript𝑔𝑡subscript𝑦𝑡subscript𝛽𝑖𝑡1subscript𝑄𝑖𝑡subscriptdelimited-[]subscript𝑔𝑡subscript𝑥𝑖𝑡1\displaystyle\leq\beta_{t+1}\left\langle Q_{i,t},\ [g_{t}(y_{t})]_{+}\right% \rangle-\beta_{i,t+1}\left\langle Q_{i}(t),\ [g_{t}(x_{i,t+1})]_{+}\right\rangle≤ italic_β start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ⟨ italic_Q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT , [ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ⟩ - italic_β start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ⟨ italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , [ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ⟩

From Lemma 1 in [35], we have:

ft(xi,t)+βi,t+1([gt(xi,t+1)]+)TQi,t,xi,t+1ytsubscript𝑓𝑡subscript𝑥𝑖𝑡subscript𝛽𝑖𝑡1superscriptsubscriptdelimited-[]subscript𝑔𝑡subscript𝑥𝑖𝑡1𝑇subscript𝑄𝑖𝑡subscript𝑥𝑖𝑡1subscript𝑦𝑡\displaystyle\left\langle\partial f_{t}(x_{i,t})+\beta_{i,t+1}(\partial[g_{t}(% x_{i,t+1})]_{+})^{T}Q_{i,t},\ x_{i,t+1}-y_{t}\right\rangle⟨ ∂ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) + italic_β start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ( ∂ [ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_Q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ (32)
1αi,t(ytxi,t2ytxi,t+12xi,t+1xi,t2)absent1subscript𝛼𝑖𝑡superscriptnormsubscript𝑦𝑡subscript𝑥𝑖𝑡2superscriptnormsubscript𝑦𝑡subscript𝑥𝑖𝑡12superscriptnormsubscript𝑥𝑖𝑡1subscript𝑥𝑖𝑡2\displaystyle\leq\frac{1}{\alpha_{i,t}}(\parallel y_{t}-x_{i,t}\parallel^{2}-% \parallel y_{t}-x_{i,t+1}\parallel^{2}-\parallel x_{i,t+1}-x_{i,t}\parallel^{2})≤ divide start_ARG 1 end_ARG start_ARG italic_α start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG ( ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT )

Combining (20), (29)-(32), we have:

t(xi,t)t(yt)subscript𝑡subscript𝑥𝑖𝑡subscript𝑡subscript𝑦𝑡\displaystyle\ell_{t}(x_{i,t})-\ell_{t}(y_{t})roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) - roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) G2αi,t2+1α(ytxi,t2ytxi,t+12)absentsuperscript𝐺2subscript𝛼𝑖𝑡21𝛼superscriptnormsubscript𝑦𝑡subscript𝑥𝑖𝑡2superscriptnormsubscript𝑦𝑡subscript𝑥𝑖𝑡12\displaystyle\leq\frac{G^{2}\alpha_{i,t}}{2}+\frac{1}{\alpha}(\parallel y_{t}-% x_{i,t}\parallel^{2}-\parallel y_{t}-x_{i,t+1}\parallel^{2})≤ divide start_ARG italic_G start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG + divide start_ARG 1 end_ARG start_ARG italic_α end_ARG ( ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) (33)
+βt+1Qi(t),[gt(yt)]+subscript𝛽𝑡1subscript𝑄𝑖𝑡subscriptdelimited-[]subscript𝑔𝑡subscript𝑦𝑡\displaystyle+\beta_{t+1}\left\langle Q_{i}(t),\ [g_{t}(y_{t})]_{+}\right\rangle+ italic_β start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ⟨ italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) , [ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ⟩

Since the last term of (33) is non-negative, we have:

t=1T(t(xi,t)t(yt))t=1TG2αi,t2superscriptsubscript𝑡1𝑇subscript𝑡subscript𝑥𝑖𝑡subscript𝑡subscript𝑦𝑡superscriptsubscript𝑡1𝑇superscript𝐺2subscript𝛼𝑖𝑡2\displaystyle\sum_{t=1}^{T}(\ell_{t}(x_{i,t})-\ell_{t}(y_{t}))\leq\sum_{t=1}^{% T}\frac{G^{2}\alpha_{i,t}}{2}∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) - roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) ≤ ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT divide start_ARG italic_G start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG (34)
+t=1T1αi,t(ytxi,t2ytxi,t+12)superscriptsubscript𝑡1𝑇1subscript𝛼𝑖𝑡superscriptnormsubscript𝑦𝑡subscript𝑥𝑖𝑡2superscriptnormsubscript𝑦𝑡subscript𝑥𝑖𝑡12\displaystyle+\sum_{t=1}^{T}\frac{1}{\alpha_{i,t}}(\parallel y_{t}-x_{i,t}% \parallel^{2}-\parallel y_{t}-x_{i,t+1}\parallel^{2})+ ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_α start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG ( ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT )

For the first term of (34), we have:

t=1TG2αi,t22i1G22t=1T1tc2i1G22(1c)T1csuperscriptsubscript𝑡1𝑇superscript𝐺2subscript𝛼𝑖𝑡2superscript2𝑖1superscript𝐺22superscriptsubscript𝑡1𝑇1superscript𝑡𝑐superscript2𝑖1superscript𝐺221𝑐superscript𝑇1𝑐\displaystyle\sum_{t=1}^{T}\frac{G^{2}\alpha_{i,t}}{2}\leq\frac{2^{i-1}G^{2}}{% 2}\sum_{t=1}^{T}\frac{1}{t^{c}}\leq\frac{2^{i-1}G^{2}}{2(1-c)}T^{1-c}∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT divide start_ARG italic_G start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ≤ divide start_ARG 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_t start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT end_ARG ≤ divide start_ARG 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT italic_G start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 ( 1 - italic_c ) end_ARG italic_T start_POSTSUPERSCRIPT 1 - italic_c end_POSTSUPERSCRIPT (35)

By leveraging (23) and update policy (26), we have:

t=1Ttcα02i1(ytxi,t2ytxi,t+12)superscriptsubscript𝑡1𝑇superscript𝑡𝑐subscript𝛼0superscript2𝑖1superscriptnormsubscript𝑦𝑡subscript𝑥𝑖𝑡2superscriptnormsubscript𝑦𝑡subscript𝑥𝑖𝑡12\displaystyle\sum_{t=1}^{T}\frac{t^{c}}{\alpha_{0}2^{i-1}}\left(\|y_{t}-x_{i,t% }\|^{2}-\|y_{t}-x_{i,t+1}\|^{2}\right)∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT divide start_ARG italic_t start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT end_ARG ( ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) (36)
=1α02i1t=1T(tcytxi,t2(t+1)cyt+1xi,t+12\displaystyle=\frac{1}{\alpha_{0}2^{i-1}}\sum_{t=1}^{T}\big{(}t^{c}\|y_{t}-x_{% i,t}\|^{2}-(t+1)^{c}\|y_{t+1}-x_{i,t+1}\|^{2}= divide start_ARG 1 end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ( italic_t + 1 ) start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
+(t+1)cyt+1xi,t+12tcytxi,t+12superscript𝑡1𝑐superscriptnormsubscript𝑦𝑡1subscript𝑥𝑖𝑡12superscript𝑡𝑐superscriptnormsubscript𝑦𝑡subscript𝑥𝑖𝑡12\displaystyle+(t+1)^{c}\|y_{t+1}-x_{i,t+1}\|^{2}-t^{c}\|y_{t}-x_{i,t+1}\|^{2}+ ( italic_t + 1 ) start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_t start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
+tcytxi,t+12tcytxi,t2)\displaystyle+t^{c}\|y_{t}-x_{i,t+1}\|^{2}-t^{c}\|y_{t}-x_{i,t}\|^{2}\big{)}+ italic_t start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t + 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_t start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT )
1α02i1y1xi,12+1α02i1t=1T((t+1)ctc)(d(𝕏))2absent1subscript𝛼0superscript2𝑖1superscriptnormsubscript𝑦1subscript𝑥𝑖121subscript𝛼0superscript2𝑖1superscriptsubscript𝑡1𝑇superscript𝑡1𝑐superscript𝑡𝑐superscript𝑑𝕏2\displaystyle\leq\frac{1}{\alpha_{0}2^{i-1}}\|y_{1}-x_{i,1}\|^{2}+\frac{1}{% \alpha_{0}2^{i-1}}\sum_{t=1}^{T}\left((t+1)^{c}-t^{c}\right)(d(\mathbb{X}))^{2}≤ divide start_ARG 1 end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT end_ARG ∥ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( ( italic_t + 1 ) start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT - italic_t start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) ( italic_d ( blackboard_X ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
+2α02i1t=1Ttcd(𝑿)yt+1yt2subscript𝛼0superscript2𝑖1superscriptsubscript𝑡1𝑇superscript𝑡𝑐𝑑𝑿normsubscript𝑦𝑡1subscript𝑦𝑡\displaystyle+\frac{2}{\alpha_{0}2^{i-1}}\sum_{t=1}^{T}t^{c}d(\bm{X})\|y_{t+1}% -y_{t}\|+ divide start_ARG 2 end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_t start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT italic_d ( bold_italic_X ) ∥ italic_y start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥
1α02i1(1+(T+1)c1)(d(𝑿))2+2Tcd(𝑿)Pxα02i1absent1subscript𝛼0superscript2𝑖11superscript𝑇1𝑐1superscript𝑑𝑿22superscript𝑇𝑐𝑑𝑿subscript𝑃𝑥subscript𝛼0superscript2𝑖1\displaystyle\leq\frac{1}{\alpha_{0}2^{i-1}}\left(1+(T+1)^{c}-1\right)(d(\bm{X% }))^{2}+\frac{2T^{c}d(\bm{X})P_{x}}{\alpha_{0}2^{i-1}}≤ divide start_ARG 1 end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT end_ARG ( 1 + ( italic_T + 1 ) start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT - 1 ) ( italic_d ( bold_italic_X ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 2 italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT italic_d ( bold_italic_X ) italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT end_ARG
2α02i1(d(𝑿))2Tc(1+Pxd(𝑿))absent2subscript𝛼0superscript2𝑖1superscript𝑑𝑿2superscript𝑇𝑐1subscript𝑃𝑥𝑑𝑿\displaystyle\leq\frac{2}{\alpha_{0}2^{i-1}}(d(\bm{X}))^{2}T^{c}\left(1+\frac{% P_{x}}{d(\bm{X})}\right)≤ divide start_ARG 2 end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT end_ARG ( italic_d ( bold_italic_X ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ( 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( bold_italic_X ) end_ARG )

Let i0=12log2(1+Pxd(𝑿))+1[N]subscript𝑖012subscript21subscript𝑃𝑥𝑑𝑿1delimited-[]𝑁i_{0}=\left\lfloor\frac{1}{2}\log_{2}(1+\frac{P_{x}}{d(\bm{X})})\right\rfloor+% 1\in[N]italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ⌊ divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( bold_italic_X ) end_ARG ) ⌋ + 1 ∈ [ italic_N ], such that we have:

2i011+Pxd(𝑿)2i0.superscript2subscript𝑖011subscript𝑃𝑥𝑑𝑿superscript2subscript𝑖0\displaystyle 2^{i_{0}-1}\leq\sqrt{1+\frac{P_{x}}{d(\bm{X})}}\leq 2^{i_{0}}.2 start_POSTSUPERSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT ≤ square-root start_ARG 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( bold_italic_X ) end_ARG end_ARG ≤ 2 start_POSTSUPERSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (37)

Combining (35)-(37) yields:

t=1T(t(xi0,t)t(yt))superscriptsubscript𝑡1𝑇subscript𝑡subscript𝑥subscript𝑖0𝑡subscript𝑡subscript𝑦𝑡\displaystyle\sum_{t=1}^{T}(\ell_{t}(x_{i_{0},t})-\ell_{t}(y_{t}))∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_t end_POSTSUBSCRIPT ) - roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) 4α0(d(𝑿))2Tc(1+Pxd(𝑿))1κabsent4subscript𝛼0superscript𝑑𝑿2superscript𝑇𝑐superscript1subscript𝑃𝑥𝑑𝑿1𝜅\displaystyle\leq\frac{4}{\alpha_{0}}(d(\bm{X}))^{2}T^{c}\left(1+\frac{P_{x}}{% d(\bm{X})}\right)^{1-\kappa}≤ divide start_ARG 4 end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ( italic_d ( bold_italic_X ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ( 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( bold_italic_X ) end_ARG ) start_POSTSUPERSCRIPT 1 - italic_κ end_POSTSUPERSCRIPT (38)
+G2α02(1c)T1c(1+Pxd(𝑿))κsuperscript𝐺2subscript𝛼021𝑐superscript𝑇1𝑐superscript1subscript𝑃𝑥𝑑𝑿𝜅\displaystyle+\frac{G^{2}\alpha_{0}}{2(1-c)}T^{1-c}\left(1+\frac{P_{x}}{d(\bm{% X})}\right)^{\kappa}+ divide start_ARG italic_G start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 ( 1 - italic_c ) end_ARG italic_T start_POSTSUPERSCRIPT 1 - italic_c end_POSTSUPERSCRIPT ( 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( bold_italic_X ) end_ARG ) start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT

Applying Lemma 1 in reference [38] to (20) and (21) yields:

t=1Tt(xt)mini[N]{t=1Tt(xi,t)+1γln1ρi,1}γ(Gd(𝑿))2T2superscriptsubscript𝑡1𝑇subscript𝑡subscript𝑥𝑡subscript𝑖delimited-[]𝑁superscriptsubscript𝑡1𝑇subscript𝑡subscript𝑥𝑖𝑡1𝛾1subscript𝜌𝑖1𝛾superscript𝐺𝑑𝑿2𝑇2\displaystyle\sum_{t=1}^{T}\ell_{t}(x_{t})-\min_{i\in[N]}\{\sum_{t=1}^{T}\ell_% {t}(x_{i,t})+\frac{1}{\gamma}\ln\frac{1}{\rho_{i,1}}\}\leq\frac{\gamma(Gd(\bm{% X}))^{2}T}{2}∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_i ∈ [ italic_N ] end_POSTSUBSCRIPT { ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ) + divide start_ARG 1 end_ARG start_ARG italic_γ end_ARG roman_ln divide start_ARG 1 end_ARG start_ARG italic_ρ start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT end_ARG } ≤ divide start_ARG italic_γ ( italic_G italic_d ( bold_italic_X ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_T end_ARG start_ARG 2 end_ARG (39)
t=1T(t(xt)t(xi0,t))γ0(Gd(𝑿))2T1c2+1γ0Tcln1ρi0,1superscriptsubscript𝑡1𝑇subscript𝑡subscript𝑥𝑡subscript𝑡subscript𝑥subscript𝑖0𝑡subscript𝛾0superscript𝐺𝑑𝑿2superscript𝑇1𝑐21subscript𝛾0superscript𝑇𝑐1subscript𝜌subscript𝑖01\displaystyle\sum_{t=1}^{T}(\ell_{t}(x_{t})-\ell_{t}(x_{i_{0},t}))\leq\frac{% \gamma_{0}(Gd(\bm{X}))^{2}T^{1-c}}{2}+\frac{1}{\gamma_{0}}T^{c}\ln\frac{1}{% \rho_{i_{0},1}}∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) - roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_t end_POSTSUBSCRIPT ) ) ≤ divide start_ARG italic_γ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G italic_d ( bold_italic_X ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_T start_POSTSUPERSCRIPT 1 - italic_c end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG + divide start_ARG 1 end_ARG start_ARG italic_γ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT roman_ln divide start_ARG 1 end_ARG start_ARG italic_ρ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , 1 end_POSTSUBSCRIPT end_ARG (40)

From ρi,1=(M+1)/[i(i+1)M]subscript𝜌𝑖1𝑀1delimited-[]𝑖𝑖1𝑀\rho_{i,1}=(M+1)/[i(i+1)M]italic_ρ start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT = ( italic_M + 1 ) / [ italic_i ( italic_i + 1 ) italic_M ], we have:

ln1ρi0,1ln(i0(i0+1))2ln(i0+1)2ln(κlog2(1+Pxd(X)))1subscript𝜌subscript𝑖01subscript𝑖0subscript𝑖012subscript𝑖012𝜅subscript21subscript𝑃𝑥𝑑𝑋\displaystyle\ln\frac{1}{\rho_{i_{0},1}}\leq\ln(i_{0}(i_{0}+1))\leq 2\ln(i_{0}% +1)\leq 2\ln(\left\lfloor\kappa\log_{2}(1+\frac{P_{x}}{d(X)})\right\rfloor)roman_ln divide start_ARG 1 end_ARG start_ARG italic_ρ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , 1 end_POSTSUBSCRIPT end_ARG ≤ roman_ln ( italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 ) ) ≤ 2 roman_ln ( italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 ) ≤ 2 roman_ln ( ⌊ italic_κ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( italic_X ) end_ARG ) ⌋ ) (41)

From (20) and that ftsubscript𝑓𝑡f_{t}italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is convex, we have

ft(xt)ft(yt)t(xt)t(yt)subscript𝑓𝑡subscript𝑥𝑡subscript𝑓𝑡subscript𝑦𝑡subscript𝑡subscript𝑥𝑡subscript𝑡subscript𝑦𝑡\displaystyle f_{t}(x_{t})-f_{t}(y_{t})\leq\ell_{t}(x_{t})-\ell_{t}(y_{t})italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) - italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ≤ roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) - roman_ℓ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) (42)

Combining (38)-(42) yields:

Reg4α0(d(𝑿))2Tc(1+Pxd(𝑿))1κ+γ0(Gd(𝑿))2T1c2Reg4subscript𝛼0superscript𝑑𝑿2superscript𝑇𝑐superscript1subscript𝑃𝑥𝑑𝑿1𝜅subscript𝛾0superscript𝐺𝑑𝑿2superscript𝑇1𝑐2\displaystyle\text{Reg}\leq\frac{4}{\alpha_{0}}(d(\bm{X}))^{2}T^{c}\left(1+% \frac{P_{x}}{d(\bm{X})}\right)^{1-\kappa}+\frac{\gamma_{0}(Gd(\bm{X}))^{2}T^{1% -c}}{2}Reg ≤ divide start_ARG 4 end_ARG start_ARG italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ( italic_d ( bold_italic_X ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ( 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( bold_italic_X ) end_ARG ) start_POSTSUPERSCRIPT 1 - italic_κ end_POSTSUPERSCRIPT + divide start_ARG italic_γ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_G italic_d ( bold_italic_X ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_T start_POSTSUPERSCRIPT 1 - italic_c end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG (43)
+G2α02(1c)T1c(1+Pxd(𝑿))κ+2γ0Tcln([κlog2(1+Pxd(𝑿))])superscript𝐺2subscript𝛼021𝑐superscript𝑇1𝑐superscript1subscript𝑃𝑥𝑑𝑿𝜅2subscript𝛾0superscript𝑇𝑐delimited-[]𝜅subscript21subscript𝑃𝑥𝑑𝑿\displaystyle+\frac{G^{2}\alpha_{0}}{2(1-c)}T^{1-c}\left(1+\frac{P_{x}}{d(\bm{% X})}\right)^{\kappa}+\frac{2}{\gamma_{0}}T^{c}\ln([\kappa\log_{2}\left(1+\frac% {P_{x}}{d(\bm{X})}\right)])+ divide start_ARG italic_G start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 ( 1 - italic_c ) end_ARG italic_T start_POSTSUPERSCRIPT 1 - italic_c end_POSTSUPERSCRIPT ( 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( bold_italic_X ) end_ARG ) start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT + divide start_ARG 2 end_ARG start_ARG italic_γ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG italic_T start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT roman_ln ( [ italic_κ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_d ( bold_italic_X ) end_ARG ) ] )

Hence, we finish the proof.

Appendix C Results on North China Dataset

Refer to caption
Figure 13: Data visualization and reference tracking on North China dataset: (a) monthly average wind power, (b) monthly average load power, and (c) hydrogen storage SoC reference.
Table 7: Yearly operational performance of the microgrid in North China using different optimization methods.
Method Cost ($105)currency-dollarsuperscript105(\$10^{5})( $ 10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT ) PtDΔtsuperscriptsubscript𝑃𝑡DΔ𝑡\sum P_{t}^{\text{D}}\Delta t∑ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT D end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh ) PtLΔtsuperscriptsubscript𝑃𝑡LΔ𝑡\sum P_{t}^{\text{L}}\Delta t∑ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT L end_POSTSUPERSCRIPT roman_Δ italic_t (MWh)MWh(\text{MWh})( MWh ) RMSE Time (ms)
M0 5.30 411.78 80.60 0.00 24.28
M1 11.74 424.92 208.85 0.08 78.28
M2 15.49 257.52 1289.70 0.10 85.39
M3 13.35 349.87 245.83 53.19 35.46
M4 23.24 217.91 2104.74 52.93 58.31
Refer to caption
Figure 14: Hydrogen storage SoC strategies in North China using different optimization methods.
\printcredits

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Data availability

The original data can be downloaded from [8] and [12].

Acknowledgements

This work is partly supported by the National Science Foundation under award ECCS-2239046 and partly supported by the Columbia University SEAS Interdisciplinary Research Seed (SIRS) fund. Ning Qi graciously acknowledges special funding from the China Postdoctoral Science Foundation (No.2023TQ0169).

References

  • Alzahrani et al. [2023] Alzahrani, A., Sajjad, K., Hafeez, G., Murawwat, S., Khan, S., Khan, F.A., 2023. Real-time energy optimization and scheduling of buildings integrated with renewable microgrid. Applied Energy 335, 120640.
  • Amphlett et al. [1995] Amphlett, J.C., Baumert, R., Mann, R.F., Peppley, B.A., Roberge, P.R., Harris, T.J., 1995. Performance modeling of the ballard mark iv solid polymer electrolyte fuel cell: Ii. empirical model development. Journal of the Electrochemical Society 142, 9.
  • Baumhof et al. [2023] Baumhof, M.T., Raheli, E., Johnsen, A.G., Kazempour, J., 2023. Optimization of hybrid power plants: When is a detailed electrolyzer model necessary?, in: 2023 IEEE Belgrade PowerTech, IEEE. pp. 1–10.
  • Darivianakis et al. [2017] Darivianakis, G., Eichler, A., Smith, R.S., Lygeros, J., 2017. A data-driven stochastic optimization approach to the seasonal storage energy management. IEEE control systems letters 1, 394–399.
  • Dey et al. [2023] Dey, B., Misra, S., Marquez, F.P.G., 2023. Microgrid system energy management with demand response program for clean and economical operation. Applied Energy 334, 120717.
  • Ding et al. [2021] Ding, X., Chen, L., Zhou, P., Xu, Z., Wen, S., Lui, J.C., Jin, H., 2021. Dynamic online convex optimization with long-term constraints via virtual queue. Information Sciences 577, 140–161.
  • Eghbali et al. [2022] Eghbali, N., Hakimi, S.M., Hasankhani, A., Derakhshan, G., Abdi, B., 2022. Stochastic energy management for a renewable energy based microgrid considering battery, hydrogen storage, and demand response. Sustainable Energy, Grids and Networks 30, 100652.
  • Elia [2024] Elia, 2024. Historical data from elia. URL: https://www.elia.be/en/grid-data.
  • Fan et al. [2021] Fan, F., Zhang, R., Xu, Y., Ren, S., 2021. Robustly coordinated operation of an emission-free microgrid with hybrid hydrogen-battery energy storage. CSEE Journal of Power and Energy Systems 8, 369–379.
  • Feng and Wei [2024] Feng, S., Wei, W., 2024. Hybrid energy storage sizing in energy hubs: A continuous spectrum splitting approach. Energy 300, 131504.
  • Gui et al. [2021] Gui, J., Sun, Z., Wen, Y., Tao, D., Ye, J., 2021. A review on generative adversarial networks: Algorithms, theory, and applications. IEEE transactions on knowledge and data engineering 35, 3313–3332.
  • Guo [2024] Guo, Z., 2024. Historical data from north china. URL: https://github.com/ZhongjieGuo/Seasonal-data.
  • Guo et al. [2023] Guo, Z., Wei, W., Bai, J., Mei, S., 2023. Long-term operation of isolated microgrids with renewables and hybrid seasonal-battery storage. Applied Energy 349, 121628.
  • Hajiaghasi et al. [2019] Hajiaghasi, S., Salemnia, A., Hamzeh, M., 2019. Hybrid energy storage system for microgrids applications: A review. Journal of Energy Storage 21, 543–570.
  • Hu et al. [2022] Hu, C., Cai, Z., Zhang, Y., Yan, R., Cai, Y., Cen, B., 2022. A soft actor-critic deep reinforcement learning method for multi-timescale coordinated operation of microgrids. Protection and Control of Modern Power Systems 7, 29.
  • Jansen et al. [2021] Jansen, G., Dehouche, Z., Corrigan, H., 2021. Cost-effective sizing of a hybrid regenerative hydrogen fuel cell energy storage system for remote & off-grid telecom towers. International Journal of Hydrogen Energy 46, 18153–18166.
  • Kim and Giannakis [2016] Kim, S.J., Giannakis, G.B., 2016. An online convex optimization approach to real-time energy pricing for demand response. IEEE Transactions on Smart Grid 8, 2784–2793.
  • Li et al. [2015] Li, Z., Guo, Q., Sun, H., Wang, J., 2015. Sufficient conditions for exact relaxation of complementarity constraints for storage-concerned economic dispatch. IEEE Transactions on Power Systems 31, 1653–1654.
  • Li et al. [2021] Li, Z., Wu, L., Xu, Y., Moazeni, S., Tang, Z., 2021. Multi-stage real-time operation of a multi-energy microgrid with electrical and thermal energy storage assets: A data-driven mpc-adp approach. IEEE Transactions on Smart Grid 13, 213–226.
  • Liu et al. [2022] Liu, Q., Wu, W., Huang, L., Fang, Z., 2022. Simultaneously achieving sublinear regret and constraint violations for online convex optimization with time-varying constraints. ACM SIGMETRICS Performance Evaluation Review 49, 4–5.
  • Mariam et al. [2016] Mariam, L., Basu, M., Conlon, M.F., 2016. Microgrid: Architecture, policy and future trends. Renewable and Sustainable Energy Reviews 64, 477–489.
  • Muthirayan et al. [2022] Muthirayan, D., Yuan, J., Khargonekar, P.P., 2022. Online convex optimization with long-term constraints for predictable sequences. IEEE Control Systems Letters 7, 979–984.
  • Pang et al. [2023] Pang, K., Wang, C., Hatziargyriou, N.D., Wen, F., 2023. Microgrid formation and real-time scheduling of active distribution networks considering source-load stochasticity. IEEE Transactions on Power Systems .
  • Qi et al. [2023] Qi, N., Pinson, P., Almassalkhi, M.R., Cheng, L., Zhuang, Y., 2023. Chance-constrained generic energy storage operations under decision-dependent uncertainty. IEEE Trans. on Sustainable Energy 14, 2234–2248.
  • Qi et al. [2024] Qi, N., Zheng, N., Xu, B., 2024. Chance-constrained energy storage pricing for social welfare maximization. arXiv preprint arXiv:2407.07068 .
  • Qiu et al. [2023] Qiu, Y., Li, Q., Ai, Y., Chen, W., Benbouzid, M., Liu, S., Gao, F., 2023. Two-stage distributionally robust optimization-based coordinated scheduling of integrated energy system with electricity-hydrogen hybrid energy storage. Protection and Control of Modern Power Systems 8, 1–14.
  • Sánchez et al. [2018] Sánchez, M., Amores, E., Rodríguez, L., Clemente-Jul, C., 2018. Semi-empirical model and experimental validation for the performance evaluation of a 15 kw alkaline water electrolyzer. International Journal of Hydrogen Energy 43, 20332–20345.
  • Shi et al. [2015] Shi, W., Li, N., Chu, C.C., Gadh, R., 2015. Real-time energy management in microgrids. IEEE Transactions on Smart Grid 8, 228–238.
  • Shuai et al. [2018] Shuai, H., Fang, J., Ai, X., Tang, Y., Wen, J., He, H., 2018. Stochastic optimization of economic dispatch for microgrid based on approximate dynamic programming. IEEE Transactions on Smart Grid 10, 2440–2452.
  • Solanki et al. [2018] Solanki, B.V., Cañizares, C.A., Bhattacharya, K., 2018. Practical energy management systems for isolated microgrids. IEEE Transactions on Smart Grid 10, 4762–4775.
  • Trifkovic et al. [2013] Trifkovic, M., Sheikhzadeh, M., Nigim, K., Daoutidis, P., 2013. Modeling and control of a renewable hybrid energy system with hydrogen storage. IEEE Transactions on Control Systems Technology 22, 169–179.
  • Ulleberg [2003] Ulleberg, Ø., 2003. Modeling of advanced alkaline electrolyzers: a system simulation approach. International journal of hydrogen energy 28, 21–33.
  • Wang et al. [2023] Wang, Z., Wei, W., Pang, J.Z.F., Liu, F., Yang, B., Guan, X., Mei, S., 2023. Online optimization in power systems with high penetration of renewable generation: Advances and prospects. IEEE/CAA Journal of Automatica Sinica 10, 839–858.
  • Yan et al. [2023] Yan, D., Huang, S., Chen, Y., 2023. Real-time feedback based online aggregate ev power flexibility characterization. IEEE Transactions on Sustainable Energy .
  • Yi et al. [2020] Yi, X., Li, X., Xie, L., Johansson, K.H., 2020. Distributed online convex optimization with time-varying coupled inequality constraints. IEEE Transactions on Signal Processing 68, 731–746.
  • Yi et al. [2022] Yi, X., Li, X., Yang, T., Xie, L., Chai, T., Johansson, K.H., 2022. Regret and cumulative constraint violation analysis for distributed online constrained convex optimization. IEEE Transactions on Automatic Control 68, 2875–2890.
  • Yu et al. [2022] Yu, L., Xu, Z., Guan, X., Zhao, Q., Dou, C., Yue, D., 2022. Joint optimization and learning approach for smart operation of hydrogen-based building energy systems. IEEE Transactions on Smart Grid 14, 199–216.
  • Zhang et al. [2018] Zhang, L., Lu, S., Zhou, Z.H., 2018. Adaptive online learning in dynamic environments. Advances in neural information processing systems 31.
  • Zhao et al. [2020] Zhao, T., Parisio, A., Milanović, J.V., 2020. Distributed control of battery energy storage systems for improved frequency regulation. IEEE Transactions on Power Systems 35, 3729–3738.
  • Zheng and Cai [2014] Zheng, L., Cai, L., 2014. A distributed demand response control strategy using lyapunov optimization. IEEE Transactions on Smart Grid 5, 2075–2083.
  • Zheng et al. [2023] Zheng, N., Qin, X., Wu, D., Murtaugh, G., Xu, B., 2023. Energy storage state-of-charge market model. IEEE Transactions on Energy Markets, Policy and Regulation 1, 11–22.
  • Zhou et al. [2023] Zhou, S., Han, Y., Zalhaf, A.S., Chen, S., Zhou, T., Yang, P., Elboshy, B., 2023. A novel multi-objective scheduling model for grid-connected hydro-wind-pv-battery complementary system under extreme weather: A case study of sichuan, china. Renewable Energy 212, 818–833.
  • Zhu et al. [2020] Zhu, D., Yang, B., Liu, Q., Ma, K., Zhu, S., Ma, C., Guan, X., 2020. Energy trading in microgrids for synergies among electricity, hydrogen and heat networks. Applied Energy 272, 115225.