1. Introduction and Related Works
In the context of the perspicacious decrease of fossil fuel resources and ongoing increase of energy consumption innate to the intensification of human urban activities during the last decades, the management of energy consumption in commercial and residential buildings has become a vital question. Regarding the works of [
1] and [
2], in the USA, the contribution of energy consumption in space heating was responsible for 43 percent in residential buildings in 2015, and in commercial buildings, this contribution was about 25 percent in 2012. This shows the huge slice of energy consumption related to space heating in the above-mentioned two sections. The recent enhancement in smart building energy management systems (SBEMSs) or smart building management systems (SBMS) by controlling and reducing the above-mentioned share of energy consumption is becoming the most efficient trend for facing energy consumption growth in residential and commercial buildings.
Smart building energy management systems (SBEMSs) in smart dwellings provide the inhabitants with advanced monitoring and control of the building’s functions and a clever way to manage heavy power-consuming appliances (as heating devices) in order to achieve energy efficiency while optimizing and preserving the inhabitants’ (or users’) comfortable environment [
3].
Although the sensors’ quality and the technological features of the remote devices forming the physical part of the automated or smart buildings play an undeniable role in the performance of SBEMSs in optimizing the building’s energy consumption, the primary inefficiency of such systems in declining energy consumption is related to the quality of the models that bear either the identification of the relationship between the building’s behavior and the controller that hatches up the actions of implemented sensors and remote devices or to the excellence of the control strategy in charge of the building’s behavior control. Thus, the identification and modeling of the building’s operational dynamics remain key points in BMSs and especially in SBEMSs. On the other hand, the diversity of the involved factors (parameters) as well as their highly nonlinear variation make the identification and modeling of the dynamic behavior of a building a challenging task. Within this context and by considering that besides the living space’s intrinsic structural features, the occupation of the living space (by users or residents) may affect the model of heating dynamics of the concerned living space, we have investigated the design, implementation, and validation of a data-driven machine learning-based identifier supplied by the time-series prediction paradigm’s formalism. In fact, the human body continuously produces thermal energy, mostly in the form of heat radiation emission. Regarding black body law, a human in a sitting position and at about 1.80 m in height can emit 100 watts [
4,
5,
6].
A number of works address model-free approaches coping with buildings’ heating. Related to conventional controllers, the authors of [
7] introduced a control heating system for supporting the heating comfort of the user based on a very simple thermostatic controller (operating on an “on/off” strategy) with the help of a microcontroller. When the temperature is higher than the desired temperature, the fan will turn on, and when the temperature is lower than the desired temperature, the heater will turn on. The proposed simplistic control of the space heating operates on the difference between the desired temperature and actual temperature, and could be seen as a model-free heating approach. While taking advantage of its independency from the effective complexity of the concerned edifice’s hitting-dynamics, the proposed strategy is applicable to very specific homogenious living spaces, and cannot be generalized to more sophisticated buildings including heterogeneous living-spaces. In the work of [
8], the investigator presents a gray-box methodology for thermal modelling of buildings. Gray-box modelling is a hybrid of data-driven and physics-based models, where coefficients of the equations from physics-based models are ajusted using data. The authors claim that the proposed methodology allows to capture the dynamics of the buildings while avoiding the effective complexity of the physics-based modelling, and results in simpler models. In fact, after first developing the individual components of the building such as temperature evolution, flow controller, and so on, the authors integrate these individual models into what they call the “complete gray-box model” of the building. The model has been validated using data collected from one of the buildings at Luleå, a city on the coast of northern Sweden. While using a simpler and generic model (compared with the physics-based complex heating models), the proposed approach remains far from convincing concerning its generalization to the other buildings.
The investigators of [
9] propose a model-free and sensor-free
heating, ventilation and air-conditioning (HVAC) control algorithm that uses simple user input (hot/cold) and adapts to changing office occupancy or ambient temperature in real time. As an alternative, the proposed strategy includes users in the HVAC control loop through distributed smart-phone based votes about their thermal comfort for aggregated control of HVAC. The developed iterative data fusion algorithm finds the optimal temperature in offices with multiple users and addresse techniques that can aggressively save energy by drifting indoor temperatures towards the outdoor temperature. The evaluation has been based on empirical data collected in 12 offices over a three-week period and showed that the proposed control may save up to 60% of energy at a relatively small increase in average occupant discomfort of 0.3 °C. While the idea is appealing, the concerned technique here also is very specific.
The control systems designed in [
7,
8,
9] operate without any pre-knowledge of the living spaces that they are supposed to heat. In other words, the proposed solutions are based exclusively on data provided by temperature sensors within the frame of specific edifices for which the model of heating-dynamics is available. This makes the proposed models and issued controllers specific to the considered case studies, and thus not applicable to other structures (i.e., other buildings).
On the basis of the above-mentioned points, in the present article, we focus on the design and implementation of a data-driven machine learning-based identification of the building’s living-space dynamic heating conduct, taking into account the occupancy (by the residents) of the heated space. This step is necessary for pulling off a comprehensive (i.e., interpretable) model handling the dynamic heating conduct of a living space with and without human presence. The proposed data-driven machine learning-based identifier will be applied for modeling the dynamic heating conduct of a real five-floor building’s living spaces located at Senart Campus of University Paris-Est Créteil, taking into account their occupancy (by users of this public building).
From a general standpoint, identification approaches are divided into two main categories: white-box modeling (WBS) and black-box modeling (BBS) [
10]. In WBS-based methods, the modeling of a system is performed on the basis of the formal relationship of the physical properties of the concerned system. If the main advantage of WBS-based methods remains their comprehensive and interpretable nature, however, often the effective complexity of real-world conditions causes WBS to lead to insolvable equations, and hence frequently to a strongly simplified issued model, making it quite far from the realistic behavior of the target system. In BBS-based methods, the modeling is done by mapping of an approximate behavior of the target system through the input–output relationship of that system. In contrast to WBS, if BBS-based methods achieve more accurate approximation of the effective complexity of the modeled system’s behavior, often they lead to a shortfall of comprehensive and interpretable foundation related to the issued model.
Numerous research works have been accomplished in the past decades within the areas of identification and modeling of nonlinear systems related to our purpose. Wiener and Hammerstein-type models [
11], Volterra series [
12], and machine-learning based approaches such as fuzzy logic-based models [
13] and artificial neural network-based approaches [
14] have been presented. The authors of [
15] identify a solar heating system utilizing BBS based on what they call the “recursive prediction error method” (RPEM). It is on the basis of a state-space model. The target system (namely a solar heater) includes two inputs (solar radiation energy and speed of the fan) and one output (air’s temperature). They claim that the small amount of data necessary for the proposed approach is an advantage. However, the related simplicity of the target system and complicated expected behavior identification do not persuade the extendibility of the proposed approach to a realistic system including a large number of parameters (inputs and outputs).
In the work of [
16], the identification of a heating system is done by investigation by means of an auto-regressive (ARX) model, auto-regressive and moving average (ARMAX) model, and Box–Jenkins (BJ) model. The target system includes a lamp and a metallic plate. It contains just one input (the lamp’s voltage) and one output (the metallic plate’s temperature). For the aforesaid case study, the authors used the system identification toolbox of MATLAB. However, the relative simplicity of the target system does not allow assessing the effectuality of the considered approach. It just presents that MATLAB’s system identification toolbox is able to imitate this uncomplicated case study example. Similarly, the authors of [
17] used MATLAB’s identification toolbox for identification of the behavior of a boiler and heat exchanger transfer function. Nevertheless, the stated result does not end up with the accuracy of the target system identification. It results in a tough target device modeling. The authors of [
18] provide the consequences of a dwelling’s thermal model identification. It includes two bedrooms heated by electrical baseboard heaters. Owing to the modeling of the target system, the authors used EnergyPlus (software for simulating the building energy system providing functional modeling of energy consumption for heating, cooling, ventilation, and lighting in buildings). The control signal was simulated by MATLAB. The Building Controls Virtual Test Bed open-source software (of Berkley Lab. [
19]) is a free, available co-simulation software linking different simulation programs as EnergyPlus, Modelica (an object-oriented language for complex systems’ simulation [
20]), and MATLAB/Simulink. In the account of the approximating dynamic of the system in Energy Plus, a low order state-space model is utilized. Concerning the identification of the system, they used N4SID subspace identification [
21]. The authors in this investigation end up with a satisfactory average root-mean-square-error (RMSE) throughout ten reported simulated apartments. Nonetheless, they concluded that the time-consuming implementation makes it difficult to extend the proposed approach to more complicated systems.
The aforesaid investigations put emphasis on the pertinence of identification approaches for the modeling of buildings’ heating dynamics. Indeed, all of the referenced investigations underline the tough limitations of the overviewed solutions in matching the complex behavior of space heating systems in buildings. The main shortages are either related to the eager simplification of the actual operative complexity of involved equations, in order to ease their computational solutions, or inherent to the nonlinearity and outsized number of the involved parameters. If the analysis of the aforementioned research works highlights the diversity of the covered fields and applications, they confirm what we mentioned before related to the advantages and shortages of each category (i.e., WBS-like and BBS-like) of identification-based nonlinear systems’ modeling approaches. Meanwhile, the overviewed research works reveal the appealing capacity of the nonlinear autoregressive exogenous (NARX) model in modeling and forecasting complex systems’ behaviors. In fact, the proposed modeling strategy takes advantage, on the one hand, from the forecasting capacity of the time-series of the NARX model, and on the other hand, from the multi-layer perceptron’s (MLP) learning and generalization skills. If the NARX model has already been used for modeling in various paradigms, the originality of its application in the present article concerns its usage, and especially its closed-loop version, in the uninterrupted (i.e., continual) identification of the heating dynamics within a fully data-driven context. However, the additional novelties of the reported investigations, on the one hand, relate to the application of the aforementioned model for solving real-world problems addressing complex behaviors, and on the other hand, concern the effective implementation of the developed system by the use of standard technology (i.e., market available), overcoming complex technological obstacles.
Section 2 of this article presents the method and concepts of the proposed data-driven identification approach.
Section 3 details the implementation of the issued method on SBEMS of the above-mentioned five-floor experimental building. The experimental setup, the experimental protocol, and the obtained results are presented and discussed. Finally,
Section 4 concludes the article.
2. Machine Learning-Based Identification of the Heating Dynamics of the Living Space
Before bestowing the proposed living space heating dynamics identification approach, we consider the following work hypothesis relating to the identification strategy:
- -
The concerned living space is supposed to be part of a typical building including various quarters (such as flats and rooms for a residential building or working spaces, office rooms, classrooms, and practice rooms for a public building, and so on).
- -
The building is supposed to be heated by a central heater supplying radiators located in the aforementioned living spaces.
- -
The regulation is supposed to be done by a conventional controller adjusting the radiators’ valves versus the magnitude of the outdoor temperature and the target (i.e., desired) indoor temperature.
- -
The target model considers the system to be identified as an overall system including the heat transmitters (radiators) and the heating space.
- -
The concerned living space is supposed to contain an amount of (with ) occupants (residents or users). corresponds to an empty living space, while characterizes a fully occupied living space. Thus, corresponds to maximum capacity of the living space and is determined according to the construction norms and occupation regulations.
Thus, within the aforementioned work hypothesis, the considered parameters are as follows: “Valve-position at time t” (denoted by ), providing the heated water’s flow (expressed as a normalized ratio of debit versus the maximum debit of the valve); “Outdoor-Temperature measured at time t” (denoted by , expressed in °C); “Indoor-Temperature measured at time t” (denoted by , expressed in °C); and “Occupancy-Rate at time t” (denoted by , expressed in %).
As mentioned in the introductory section, the identification method of the proposed system is accomplished by an MLP-based NARX ([
22,
23,
24]) with a feed-forward back-propagation learning algorithm ([
25,
26]). Equation (1) and (2) respectively specify the overall open-loop NARX model, where
is the activation function of the ANN,
is the estimated (i.e., predicted) output,
is the actual output value of the model (i.e., at time t),
are n-past values of the y(t),
is the present input value, and
are the actual and tapped delayed exogenous inputs in m-past input values.
Figure 1 illustrates the overall schema of the NARX model
.
Figure 2 reveals the proposed identification structure of the target heating model, taking into account the above-stated work hypothesis including the influence of the occupancy. The learning process is performed by utilizing the open-loop NARX scheme. The learning dataset contains the operation of the real system’s sequences within different valve positions (i.e.,
), providing various heating powers, the occupancy-rate at time t (i.e.,
), the actual and m-past measures of outdoor temperature (i.e.,
,
, … and
), and the actual and n-past values of indoor temperature (i.e.,
,
, … and
).
The influence of occupancy is modeled through the following hypothesis:
occupancy of the living space by residents increases the effective overall heating power. The right-side picture of
Figure 2 gives the general diagram of the modeled living-space within the aforementioned hypothesis. In other words, we assume that occupation of the living space by residents (i.e., bodies providing additional heating sources) is equivalent to an incensement of the heating device’s nominal power. To determine the above-mentioned equivalent nominal power, we exploit the usual policy of setting the adequate heater’s nominal power versus the living space’s characteristics. In fact, building designers determine the adequate heater’s nominal power on the basis of the volume of the concerned living space (i.e., the living space that is supposed to be heated by the heating device) by keeping constant a parameter called “
Heating Ratio” (denoted by
, expressed in W/m
3), defined by Equation (3), where
denotes the heating device’s nominal power (which depends on the technological and structural features of the heating device) and
denotes the volume of the living space (room, and so on). The appropriate value of
is determined versus construction norms (materials, processes, and so on used for constructing the concerned building) and urban, social, and environmental regulations (imposed by legal authorities).
Taking into account the hypothesis related to the occupancy’s influence, Equation (3) may be extended in terms of Equation (4) taking into account the occupancy’s influence. In this equation,
denotes the “
Heating Ratio” taking into account the occupancy,
states for additional heating power provided by the living space’s occupancy (with
), and
denotes the volume of the living space (room, and so on).
One can notice that
corresponds to the
Heating Ratio of the same living space when it is empty. We determine
through a fuzzy-logic-based concept by considering, on the one hand, three categories (fuzzy intervals) of living spaces (i.e., three fuzzy intervals of living space’s volume), and on the other hand, by considering five categories of occupancy-rate (denoted by
, expressed in %). Namely, the three categories for living space’s volume are as follows: “Large”, “Medium”, and “Small”. The five categories of occupancy-rate are as follows: “Empty” (corresponding to
), “Small occupation” (i.e.,
), “Medium occupation” (i.e.,
), “High occupation” (i.e.,
), and “Full” (i.e.,
). Within this formulation, Equation (4) may be written in terms of Equation (5), where
denotes the “
fuzzy value” of
.
The main advantage of such a formulation is that the above-mentioned intervals may be quantified by the use of data-driven statistical clustering methods, reflecting the reality of the concerned building’s usage (occupancy). Thus, the equivalent nominal power of a given living space occupied by N bodies may be estimated as .
Flooding back to the hypothesis we made related to the effect of the living space’s occupancy, another way of interpreting the aforementioned hypothesis is to note that the occupancy of the living space by residents will decrease the required time for reaching the target indoor temperature. In other words, the higher the living space’s occupancy, the shorter the necessary delay to heat it. In order to quantify this, we introduce what we call “
Heating Slop” (denoted by
and expressed in °C/s), defined as the derivative of
versus the time and approximated by Equation (6) within the context of a discrete sampling (i.e., discrete measurement) of the indoor temperature. In Equation (6),
and
stand for consecutive values of indoor temperature (supposed to be provided by the temperature sensor at times
and
, respectively) and
.
It is pertinent to notice that an escalation of the living space’s occupancy (introducing additional bodies and thus additional sources of heating) or decrease of occupancy will result in the so-called heating slop’s modification: the higher the living space’s occupancy, the stronger the heating slop. Within the general standpoint, and as formulated by Equation (6),
is time-dependent, and thus may vary along with time. However, because of the fact that the heating of buildings abides by slow dynamics, often
remains constant (with regard to the time), albeit its value would vary along with the valve-position (i.e., with
) that controls the heating device’s actual power. On the basis of the aforementioned points, actually, the predicted indoor temperature (i.e.,
) may be computed from Equation (7), where
denotes the identified (estimated) value of
.
3. Implementation of the Proposed Living-Spaces’ Dynamic Heating Model
As has been mentioned, a real five-floor building located at Senart Campus of University Paris-Est Créteil (UPEC) served as an experimental platform for the evaluation and validation of the proposed model. The concerned building (namely Building A of the campus) is a fully automated building hosting the Electrical Engineering and Industrial Informatics Department of Senart-Fontainebleau Institute of Technology of UPEC. The building (i.e., system to be identified) is heated by a conventional central heater supplying radiators (i.e., heating devices) located in various living spaces (namely, office rooms, classrooms, practical rooms, and so on) of the building. The central heater is common to three buildings of the campus, and thus the control of the local heating devices of the concerned buildings (including Building A) is performed through the local valves of each radiator. The two other buildings are conventional buildings (i.e., not automated) and the sole Building A is automated. In fact, Building A is equipped with numerous sensors and connected devices allowing the recording of data related to environmental information (such as temperatures in each living space and the outdoor temperature) and the operational states of whole installed connected devices (such as radiators’ valves). Four different kinds of sensors outfit each living space (including corridors) the entire five floors of this building: “temperature sensors” (TSs), “magnetic sensors” (MSs), “presence detectors” (PDs), and “luminance sensors” (LSs). The main connected devices (actuator) deployed in the aforementioned experimental building are as follows: “motor valves” (MVs), which control radiators supplied by the abovementioned central water-flowed heating system, and connected “lighting elements” (LEs).
Sensors and connected devices concerned by the purpose of the present paper are TSs and MVs. They use “EnOcean” technology; an energy harvesting wireless technology provided by EnOcean [
27]. EnOcean-technology-based modules fuse micro-energy converters with ultralow power electronics and reliable wireless communications, allowing to provide self-powered wireless sensors or actuators for building energy management systems as well as for industrial applications.
Figure 3 presents the implementation diagram of the concerned building (Building A) heating system.
The connected heating system includes three operational layers:
- -
Supervision layer (SL): It consists of a PC including TopKapi server supervision software (a supervisory control and data acquisition software), which acts as a supervision agent. It also includes a number of adequate interface agents (software units) concerning the control layer and storage memory [
28]. It is relevant to note that, while nowadays micro-controllers are able to handle diverse computational skills, they may still be limited regarding computational needs relating to the context of the presented work. In fact, in our work, we deal with machine learning-based identification, where a number of computational tasks need improved computational ability (especially for the training task). Actually, the effective adaptability to the real-usage context of the system would require updating the models’ parameters versus the evolution of effective conditions (i.e., bring up to date the system’s “knowledge”). That is why the choice was directed toward integrating a server. Moreover, the target system addresses smart-buildings’ context, and thus would deal with a rather large number of living spaces. This reinforces the choice of superior computational ability.
- -
Control layer (CL): This layer contains the programmable logic controller (PLC) and EnOcean modules (pilots and interfaces) necessary to conduct the related sensors and devices composing the physical layer [
29]. The concerned PLC is a “WAGO-I/O-SYSTEM” belonging to the family of ETHERNET programmable Fieldbus controllers distributed by WAGO company [
30]. It supports both MODBUS/TCP and a wide variety of standards ETHERNET/IP protocols in order to integrate easily into various IT environments.
- -
Physical layer (PL): It consists of the aforesaid sensors and actuators devices.
Composing the heating control chain of the SBEMS of the aforementioned fully automated experimental building, the CL and PL are replicated for each floor, making possible the set up collecting data characterizing the heating state of each living space of the building and controlling valve position of each heating device (radiator) in the building through the five PLCs (one for each floor). The proposed identification approach was implemented in the SBEMS of the aforementioned fully automated experimental building.
For evaluation of the proposed identification strategy and the issued model, two experimental assessments were considered. The first one appraises the obtained model’s “one-step prediction” (OSP) accuracy and the second one sizes up the ability of the issued model on “multi-step prediction” (MSP). The purpose of OSP aims to predict the living space’s immediate upcoming indoor temperature from its previous history. Therefore, open-loop as well as closed-loop architectures could be used. Meanwhile, the objective in MSP relates to the prediction of several successive future steps of the concerned living space’s indoor temperatures, and thus the open-loop architecture remains no more pertinent.
5. Conclusions
Considering that, alongside the living space’s intrinsic structural features, the occupancy of the living space (by users or residents) may affect the model of heating dynamics of the concerned living space, we have investigated the design, implementation, and validation of a data-driven machine learning-based identifier supplied by the time-series prediction paradigm’s formalism. The proposed data-driven machine learning-based identifier was applied for modeling the dynamic heating conduct of a real fully automated five-floor building’s living spaces located at Senart Campus of University Paris-Est Créteil, taking into account their occupancy (by users of this public building). The proposed modeling strategy takes advantage, on the one hand, of the time-series’ forecasting capacity of the NARX model, and on the other hand, of the multi-layer perceptron’s (MLP) learning and generalization skills.
If, as expected, the one-step-prediction (OSP) model, operating on the basis of an open-loop scheme, achieved high prediction accuracy in forecasting of the upcoming value of the indoor temperature (i.e., less than 0.2 °C blunder comparing to the measured value), anchored in an open-loop NARX scheme, its main shortage appears when a longer-term forecasting of indoor temperature is required; especially, when the target model is used for designing an adaptive heating control strategy. Achieving a lower accuracy compared with the OSP model (i.e., an average error up to 0.4 °C and a maximum error of 1.23 °C for long-term prediction), the multi-step-prediction (MSP) model, operating in closed-loop, represents an attractive compromise for longer-term forecasting of the dynamic heating behavior, and thus offers an appealing perspective for designing adaptive heating controllers for SBEMSs.
The achieved results stress several appealing issues related to the denotation of these results as well as the status of the NARX-based forecaster regarding data-driven identification of heating dynamics in real smart-buildings. The first points come across the ability of the proposed approach in the modeling complex thermal conduct of buildings, including the effect of inhabitants’ presence on the discrepancy of their heating dynamics. In fact, this is visible through the obtained MSE and MAE values, highlighting a prediction of indoor temperature with a less than 0.2 °C blunder compared with the measured value. This foretells the perspective of effectual usage of the proposed approach for designing data-driven adaptive controllers of buildings’ heating behavior versus the context of their usage by potential residents. The second remark relates to the possibility of a standard-technology-based effective implementation of this investigated machine learning-based identifier in authentic smart-buildings, taking advantage of the robustness of those standard (and market available) technologies and avoiding the complexity and cost of designing specific implementation policies. Finally, the last mention goes to the accuracy of the achieved predictions related to well short-term (i.e., one-step) as well as long-term (i.e., closed-loop) forecasters. Another attractive feature, arising from the theoretical foundation of the proposed approach, relates to the comprehensive interpretation of the living space’s occupancy effect, with a quantitative appreciation of its influence on the smart-building’s heating conduct’s deviation.