3.2. Features Extraction
Some indicators in the raw data given by the contest organizers are sensitive to icing, and some indicators are almost not related to icing. So, the first step in this paper on the data is to pick out the icing-sensitive indicators from the raw data indicators. However, relying solely on these indicators does not well identify early icing data from non-icing data, this paper further processed the data and obtained some better indicators of icing and non-icing. In general, it is through the screening and supplementation of indicators to achieve better characterization of early icing with fewer features, which not only reduces the running time of the model but also gives better results.
This section will introduce the process of data preprocessing, including the screening of basic features and the construction of other features, with giving some figures to make features more intuitively judgment—whether it is easier to distinguish between icing and non-icing.
Extraction features, especially quantitative features [
31] are very essential for the fault diagnosis of equipment. On the one hand, because the inertia of the wind turbine blade will reduce the correlation between the instantaneous power and the instantaneous wind speed, taking the average value from the data over a certain time span can reduce the inertial effect to some extent. On the other hand, in the original data, about 8 samples are collected every minute, but because the data provider has deleted some data, the sample interval time in the data is not fixed. So, the data are resampled in one-minute intervals, the specific process is as follows. According to the timestamp, the SCADA data grouped every minute for the time span, and then the mean of each group sample is taken as the new sample characteristics.
where
V is the average wind speed—the new sample characteristics;
n is the number of wind_speed in one minute. The solutions of average power P and other new variables are the same as Equation (3).
Then the data is filtered.
Filter unspecified data. In the given raw data set, the data covers normal sample data, icing sample data, and other unspecified data, according to the status tag. Unspecified data will affect the classification of normal sample data/icing sample data, due to the uncertainty of its information; it will be classified as invalid data.
Filter samples below 80% of full power. Taking wind turbine 21# as an example, it can be found that when the wind turbine is at more than 80% full power icing status data does not exist by comparison of the original instantaneous power-wind speed scatter plot (shown in
Figure 3), and the processed average power-wind speed scatter plot (shown in
Figure 4). In other words, the wind turbine power cannot reach 80% of full power after the blade’s freeze. Filter the samples below 80% of full power can make it easier to identify early icing data.
In following figures, the green points are in the wind turbine normal state and the red points are in the icing state. The blue lines in
Figure 3 and
Figure 4 represent the dividing lines representing 80% of full power.
Filter unspecified data and samples below 80% of full power and normalize the remaining data (making the scale from 0 to 1), then plot the average power and average wind speed as a scatter plot (shown in
Figure 5).
Wind turbines are devices that convert wind energy into mechanical energy and then into electrical energy, where wind speed and power are regarded as the two basic features of icing prediction. When the blades freeze, the shape and aerodynamic characteristics of the blades will change, reducing the power output. Therefore, when the wind turbine blades freeze, the relationship of the output power and the wind speed will be changed.
In the non-icing condition, the wind machine operates according to the wind turbine power characteristic curve in the normal mode (the green part of
Figure 5). After the icing formation, the actual operation state of the wind turbine will deviate, and the power cannot reach the rated power. When the normal state sample data is used, the abnormal point eliminated, the power characteristic curve of the wind turbine is fitted to obtain a baseline model of the power characteristic curve [
32], and then this model is used to predict the output power at the corresponding wind speed. The baseline model obtained by curve fitting is shown in
Figure 6.
From
Figure 6 the icing sample is more deviating from the baseline model than the normal sample, thus constructing another feature of icing prediction, which can distinguish then better: the degree of deviation from the output power.
where
is the actual measured output power and
is the output power estimated by the actual wind speed and power curve.
After calculating the power degree by Equation (4), to facilitate visual observation of whether the variable is helpful for model classification, we draw a figure about relationship between the power degree and the average wind speed, as shown in
Figure 7. As can be seen from
Figure 7, there are more red dots (icing samples) that are distinguished from green dots (non-icing samples).
In the early stage of icing, the operation state of the wind turbine is similar to the normal state, and it is difficult to separate the icing state from the normal state. However, the detection of early icing conditions is a very important process, and the healthy operation of the wind turbine unit is of utmost importance. It can minimize the loss to the unit due to icing on the blades. Because icing is a cumulative process, instantaneous characteristics such as the wind speed, the power, and the degree of deviation make it difficult to characterize fully icing conditions, especially in the early icing part. Therefore, it is necessary to analyze the evolution of the icing process and extract features that can characterize icing changes to better distinguish early icing conditions and achieve early icing prediction.
This paper mainly extracts features of early icing based upon the characteristics of degree of deviation. The icing process of the wind turbine contains certain periodicity, thus calling for serialization of the original data. Calculate the average rate of change (
) of the degree of deviation at the corresponding time in each time segment.
where
represents the average rate of change of the current degree of deviation;
represents the current degree of deviation;
represents the degree of deviation from the previous moment;
represents the time span (
is the current time after digitization and
is the previous time after digitization).
Then, according to the sliding window method, take ten minutes as the window length and one minute as the moving step length to obtain the maximum value to the degree of deviation and the cumulative value of the within 10 min before the current time.
First, this paper selects two basic features from the 28-dimensional features in the given data, and then adds four additional features based upon the mechanism of the wind turbine operation and icing. Finally,
Table 3 represents the six groups of icing prediction features obtained.