1. Introduction
Electrical capacitance tomography (ECT) is a measurement technique for visualizing dielectric multi-phase flow processes, such as pneumatic conveying systems and fluidized beds, by generating cross-sectional images [
1,
2,
3]. A traditional ECT system mainly contains three parts: the ECT sensor, the capacitance measurement and data acquisition circuit, and the imaging computer. All the possible capacitance data among the non-redundant electrode combinations are measured based on a capacitance measurement circuit [
4], and the permittivity distribution can be reconstructed by certain algorithms. For an ECT sensor with N electrodes, the number of available capacitance data is (N − 1) × N/2.
In the past three decades, research concerning ECT sensor design [
5,
6,
7,
8], hardware design [
1,
9,
10,
11,
12], and image reconstruction algorithms [
13,
14,
15,
16,
17,
18,
19,
20,
21] and applications [
22,
23,
24,
25,
26,
27,
28,
29,
30] have been widely investigated and remarkable progress has been made. So far, ECT is still a very active field. The studies in the literature show that papers on the ECT field are published constantly, among which studies of ECT image reconstruction algorithms make up an important part of it.
Conventional algorithms such as the linear back project (LBP), the Landweber iteration, and the total variation (TV) based regularization are still adopted, meanwhile, in recent years, some distinctive works also have been reported. An example of such a distinctive work is the image reconstruction algorithm based on the sparsity constraint combined with the compressed sensing theory. Ye J. M. et al. designed an extended sensitivity matrix that consists of some normalized capacitance vectors corresponding to the base permittivity elements [
31]. Zhao J. et al. used a sparse reconstruction by a separable approximation algorithm to solve the ECT inverse problem [
32]. Yang Y. J. and Peng L. H. proposed an enhanced linear model and sparsity regularization for the image reconstruction algorithm [
33]. In other areas, Taylor S. H. and Garimella S. V. adopted a level set method to reconstruct ECT images [
34]. Ren S. J. et al. introduced the boundary element method for ECT image reconstruction, and this method was able to reconstruct the permittivity distribution profile in the imaging area well [
35].
In recent years, machine learning theory has flourished in many fields and researchers in the ECT area have also attempted to introduce it to solve the image reconstruction problem. Marashdeh et al. trained a combined multilayer feed-forward neural network and analogue Hopfield network [
36]. Wang et al. proposed a least square support vector machine and bacterial colony chemotaxis algorithm for ECT image reconstruction [
37]. Li et al. attempted to make the BP and RBF neural networks solve the ECT image reconstruction problem [
38]. Although these attempts made breakthroughs in ECT image reconstruction to some degree, most of these reported machine learning-based ECT image reconstruction methods are trained using a small-scale dataset that usually comprised of several tens to about one hundred instances. The generalization ability may be limited when the training dataset is small, which means that the training results may be good for the training dataset, but if given a new capacitance vector that the network has never seen before, the network may not be able to figure out the right corresponding permittivity distribution. So, a large-scale dataset is of great necessity for researchers in order to explore machine learning algorithms for ECT image reconstruction. However, there is still no public large-scale dataset in the ECT field.
As is known to all, a good public dataset, such as MNIST [
39] and ImageNet [
40] in the machine learning field, is a key part of machine learning research. For example, ImageNet, which is a large-scale dataset for researchers in the computer vision area, has millions of images under thousands of categories, with a typical category containing several hundred images [
40]. Such a public dataset inspires researchers to explore faster and more accurate image classifying or object detecting methods and launches a great campaign to promote the development of machine learning, especially deep learning theory.
The ImageNet example shows that not only models should be emphasized, but data should also be treated with more attention. The availability of more data would help researchers develop better algorithms. A free and open large-scale dataset is also expected (in the ECT field) to contribute more data to better map the relationship between capacitance and the permittivity distribution and to evaluate and compare the results of different image reconstruction methods under the same criteria.
On the other hand, in order to get the required amount of ECT capacitance and permittivity distribution data for a study on image reconstruction, a lot of simulation models need to be built or a practical ECT experiment system needs to be established, which will cost much in both the materials and time. However, when a large-scale benchmark dataset is brought out, researchers will find it convenient since they need not repeat their data acquisition work. Like the great benefits from ImageNet, it is hoped that such a large-scale public benchmark ECT dataset would also make researchers realize the importance of the dataset, start a revolution of solving the image reconstruction problem, encourage researchers to explore better image reconstruction methods, and have more communication, leading to a breakthrough in ECT image reconstruction theory.
In this paper, a benchmark dataset for ECT image reconstruction is proposed. It consists of tens of thousands of capacitance vectors and corresponding permittivity distribution vectors, as well as sensitivity matrices obtained from 2D simulation models, and 3D simulation models along with static and dynamic experiments. The benchmark dataset can be regarded as two parts. One subset, whose data is from the 2D simulation models, is for the training and testing of machine learning methods for ECT image reconstruction. The other subset, whose data is from 3D simulation models and experiments, is for evaluating and comparing different ECT image reconstruction algorithms. This study is concerned with typical two-phase flow patterns—annular, stratified, single bar, and two-bar. Additionally, three relative permittivity values—2.7, 3.8, and 80—are set on the phase in the higher permittivity value and the lower permittivity value is set to 1. The image reconstruction results of the three traditional algorithms, i.e., the LBP, the projected Landweber iteration, and the total variation (TV) based regularization, along with the deep learning-based method proposed in Reference [
41] are used as examples on how to compare different algorithms under the same evaluation criteria of the benchmark dataset.
The paper is organized as follows.
Section 2 and
Section 3 provide the benchmark dataset and the image reconstruction result examples based on the simulations and experiments, respectively. Finally, conclusions are drawn in
Section 4.
2. The Simulation Part of the Benchmark Dataset
In this section, the simulation part of the benchmark dataset based on 2D and 3D models is introduced. Permittivity distribution images are reconstructed by capacitance vectors in the dataset based on 3D models by using the three conventional ECT image reconstruction algorithms—i.e., the LBP, the projected Landweber iteration, and the TV-based regularization—as well as by using a machine learning-based image reconstruction method. The quantitative criteria for the comparison of the image reconstruction results are also provided.
2.1. The Simulation Part of the Benchmark Dataset Based on the 2D Models
One important part of the benchmark dataset is a large-scale dataset built as a public database for the training and testing of the machine learning-based ECT image reconstruction algorithms. The large-scale dataset is generated by a platform which is established on MATLAB with a GUI and worked on with 2D 8-electrode ECT sensor models built on the finite element analysis software, COMSOL Multiphysics [
42]. It contains totally 40,000 pairs of ECT data samples, with each pair of samples consisting of a normalized permittivity distribution vector with 3228 elements and the corresponding normalized capacitance vector of with an 8-electrode ECT sensor with 28 elements. The flow patterns of the samples are annular, stratified, single bar, and two-bar, respectively. Additionally, each flow pattern has 10,000 pairs of samples.
The 8-electrode ECT sensor model in COMSOL Multiphysics is shown in
Figure 1. The material of the sensor pipe is set to be PVC with a relative permittivity of 2. The lower and higher permittivity values of the flow are 1 and 2.7, respectively. The diameter of the pipe is 70 mm and the thickness of the pipe is 3.5 mm. The gap between two adjacent electrodes is 5 degrees so that the span angle of each electrode is 40 degrees. The round imaging cross-section is divided into a 64 × 64 mesh grid which, in total, has 3228 effective pixels.
The four flow patterns chosen for the benchmark dataset are typical two-phase flow patterns that commonly occur in the industrial field, and other complex flow patterns can be regarded as combinations of these flows. Although the names of these four flow patterns may not be the same—for example, the single bar flow is also called the core flow and two-bar flow may be mentioned as the two-object flow—they are mostly studied in ECT image reconstruction research, such as those reported in References [
1,
12,
14,
16]. To describe the phantoms of different flow patterns quantitatively, certain parameters are selected. The parameter describing annular flow is the thickness of the annular, which is normalized with respect to the radius of the sensor and denoted by T. For the stratified flow, the normalized height of the flow surface is selected, i.e., H. For the single bar, the position of the center point C(x,y) of the bar, of which the coordinates are normalized with respect to the sensor radius, is also used besides the normalized bar radius, R. For the two-bar distribution, the normalized radii—i.e., R1 and R2—and the positions of the center points of the two bars, i.e., C
1(x,y) and C
2(x,y), are all used.
Figure 2 depicts the four flow patterns with the corresponding parameters.
2.2. The Simulation Part of the Benchmark Dataset Based on the 3D Models
Another simulation part of the benchmark dataset for evaluating and comparing the ECT image reconstruction algorithms is also built based on 3D models. This part contains capacitance vectors corresponding to 80 cases, including the capacitance vectors of the full and empty pipes for calibration, 2 sensitivity matrices for the 8-electrode sensor, and the 12-electrode sensor, respectively, and 12 normalized permittivity distribution vectors.
The four flow patterns and the pixel division of the samples in the dataset based on the 3D simulation models are the same as those in
Section 2.1. Three relative permittivity values—2.7 (e.g., oil), 3.8 (e.g., sand), and 80 (e.g., water)—are investigated, covering situations of low-contrast and high-contrast permittivity changes.
Table 1 provides normalized parameters describing the phantoms and the corresponding phase ratio of the material with a high permittivity in each phantom.
To obtain the simulated capacitance data regarding the different phantoms, a 3D 8-electrode ECT sensor model and a 3D 12-electrode ECT sensor model are built in the COMSOL Multiphysics software.
Figure 3 depicts the 3D 8-electrode ECT sensor model for the simulation. The inner diameter of the pipe is 70 mm and the outer diameter is 80 mm. The length of the sensor is 370 mm, of which the electrode length is 140 mm. The gap between the two adjacent electrodes is 5 degrees so that the span angle of each electrode is 40 degrees and 25 degrees for the 8-electrode sensor and the 12-electrode sensor, respectively.
Considering both the computed accuracy and time cost, the capacitance data in this benchmark dataset are computed based on a custom mesh in COMSOL Multiphysics with the maximum element size set to 20.4 mm, the minimum element size set to 1.48 mm, and the maximum element growth rate set to 1.4. For the empty pipe case, the total mesh element number is 1,119,690.
The capacitances among the different electrode combinations are dependent on the relative permittivity, the phase ratio, and the flow pattern.
Figure 4 is an example of how these factors matter, where the capacitance vectors of each flow pattern with one phase ratio under three different permittivity values are compared, and the capacitance vectors of the empty pipe and the full pipe under these three permittivity values are also given for calibration. All these capacitance values are simulated based on the 8-electrode sensor. The corresponding capacitance data are given in
Table 2.
For an infinite parallel-plate capacitor, the capacitance value increases along with an increase of the permittivity value of the medium between the two electrode plates so that the relationship between variation of the capacitance and the variation of the permittivity value is linear. However, it can be found that for ECT sensor, the relationship between the capacitance value and the permittivity value is nonlinear, especially for the case in which the permittivity variation is of a high contrast. In
Figure 4a, the capacitance values of the adjacent electrode pairs with permittivity values of 3.8 and 80 are very close and, in
Figure 4d,e, the capacitance values of the adjacent electrode pairs with a permittivity value of 80 are even smaller than those with permittivity values of 2.7 and 3.8.
This phenomenon appears because, for adjacent electrode pairs, only a very small region in the circular ECT imaging area has a very sharp positive sensitivity while most of the region has a negative sensitivity. Meanwhile, for the opposite electrode pairs, most of the region in the circular ECT imaging area has a relatively high positive sensitivity while a relatively small region has a negative sensitivity. The comparisons of the sensitivity map appearance of the adjacent electrode pairs and the opposite electrode pairs regarding the negative sensitivity characterizations are demonstrated clearly in
Figure 5. The effect of the negative sensitivity map can also be reflected from the capacitance values of the adjacent electrode pairs while the permittivity value is 80 and the distributions are annular or stratified. It was found from
Figure 4b that the capacitance values of the adjacent electrode pairs, while the permittivity distribution is 50% annular, are about 3.55 pF, which are even larger than the values while the pipe is full. Furthermore, for the 19.58% stratified distribution (
Figure 4c), certain capacitance values of the adjacent electrode pairs reach 5.08 pF.
With the change of the medium’s permittivity from a low value to a high value in the ECT imaging area, the capacitance values among the different electrode pairs behave totally differently in terms of their properties and nonlinearities. Cui et al. [
43] and Yang et al. [
44] reported and preliminarily analyzed the effect of the nonlinearity of capacitances between different electrode pairs on ECT image reconstruction. This issue may need to be investigated more deeply in future studies on ECT image reconstruction, particularly while the permittivity distribution inside the sensor has a relatively high contrast variation.
2.3. The Deep Autoencoder and the Iteration Method Based on It
As is known in the ECT field, the nonlinear relationship between capacitance and permittivity deteriorates the quality of the reconstructed image based on the linear model when the permittivity variation becomes large. This is because the linear model approximates the nonlinear relationship between capacitance data and the corresponding permittivity distribution by neglecting the higher order terms of permittivity variation. When the permittivity variation becomes larger, the neglected terms matters more, thus, the imaging reconstruction quality worsens. However, if the nonlinear model is used for improving the quality of the image reconstruction, the real-time ability of the nonlinear model-based algorithms for online imaging should be considered. In this sense, better image reconstruction algorithms should be put forward to meet the requirements of both imaging quality and speed.
A deep autoencoder along with the iteration method proposed in Reference [
41] provides a new way to solve the ECT image reconstruction problem. This method is a deep supervised autoencoder which has an encoder and a decoder (with five layers each) that can deal with both the former problem and the inverse ECT problem. The nonlinear relationship from the permittivity distribution to the capacitance data is modeled by the encoder (
F(·)) and, conversely, the reconstruction from the capacitance data to the permittivity distribution is solved by the decoder (
G(·)) of the deep autoencoder. Suppose
x is the vector of the permittivity distribution,
y is the capacitance data vector,
is the reconstructed permittivity distribution, and
is the estimated capacitance data calculated from the permittivity distribution, then, according to the structure of the autoencoder in
Figure 6, there is
To take into account both the forward problem and the inverse ECT problem under the deep autoencoder framework, another two vectors—
and
—are defined as follows:
The autoencoder is trained by minimizing the loss function, which is denoted by
L. Because of the four estimated variables in Equations (1) and (2),
L consists of four parts, see Equation (3), where
,
,
, and
are the weights of these four parts of losses,
l is a particular reconstruction error which chosen to be mean squared error (MSE), as is described in Equation (4), for any two
n-dimensional vectors
v and
.
Although the proposed deep autoencoder would take a lot of time to train, when it is well-trained, the deep autoencoder can be faster than most traditional ECT image reconstruction algorithms where the forward problem is usually solved by some time-consuming finite element method (FEM) and the image reconstruction algorithm would also consume a lot of calculation resources. Some iterative algorithms even need to repeatedly solve the forward problem and the inverse problem, which will lead to a good image reconstruction quality but will sacrifice too much time to satisfy its online use. If the deep autoencoder is used to implement the iterative process, the calculation time can be saved and image reconstruction quality will be promoted. Thus, an iteration method is inspired by the following Landweber iteration [
45]:
where
xk is the calculated permittivity distribution at the
kth step and
y is the normalized capacitance vector.
S is the sensitivity map in the linear model of the ECT, which maps the permittivity distribution to the capacitance data and corresponds to
F(·) in Equation (1), and
ST maps the capacitance data to the permittivity distribution as
G(·). So, if the deep autoencoder is used to implement the Landweber iteration, the equation should be Equation (6).
2.4. Image Reconstruction Examples Based on the Simulation
In this paper, four image reconstruction algorithms, i.e., the LBP [
46], the projected Landweber [
45], the total variation (TV) based regularization algorithm [
47], and the deep autoencoder introduced above, are executed on the 3D model-based simulation part of the benchmark dataset. In order to quantitatively evaluate the ECT image reconstruction results and compare the performance of the different reconstruction algorithms, the evaluation criteria should be determined. The commonly used criteria include the relative image error of the reconstruction, the correlation coefficient between the real permittivity distribution and reconstructed permittivity distribution, and the other parameters related to the permittivity distribution, such as the phase ratio (phase concentration).
1. Relative image error
The relative image error is defined as the relative error of the reconstructed permittivity vector
with respect to the real permittivity vector
, as is shown below:
2. Correlation coefficient
The correlation coefficient indicates the similarity between the reconstructed permittivity distribution and the original permittivity distribution, which is defined as Equation (8), where is the mean of , is the ith element of , is the mean of , and is the ith element of .
3. Phase ratio error
The ECT image reconstruction is commonly used for evaluating the phase ratio in the application of the two-phase flow measurement, thus, the phase ratio error of the reconstructed image is also an important criterion. In this paper, the ‘phase ratio’ is the phase concentration of the medium with the higher permittivity value, which is computed by summing the permittivity distribution vector, i.e., the gray-scale value of the permittivity distribution. By using
to stand for the real phase ratio, which can be calculated according to the phantom, and
to stand for the estimated phase ratio from the reconstructed permittivity distribution, the phase ratio error can be defined as
Some image reconstruction examples calculated by the 8-electrode capacitance vectors in the four flow patterns with a relative permittivity of 2.7 in
Table 2 are shown in
Figure 7, where the comparison of the image reconstruction results by different reconstruction algorithms is demonstrated. The related criteria data are listed in
Table 3. Note that the phase ratio is estimated by summing the reconstructed normalized permittivity vector, and there is an artifact in the reconstructed images, therefore, the phase ratio error does not have a positive correlation with the relative image error.
In
Figure 7, the images reconstructed by the autoencoder are apparently much better than that the images constructed by the other three traditional algorithms in terms of the visual effect, and very close to their corresponding real permittivity distributions. As for the other three algorithms, the reconstructed images of the LBP are far from the real permittivity distributions, especially for the single bar and two-bar flow. This conclusion can also be supported by the results of the three criteria in
Table 3: all the criteria data show that the quality of image reconstruction by the autoencoder is much better than those constructed by the three traditional algorithms. The image reconstruction quality of the LBP is worse than the projected Landweber iteration and the TV. As for the reconstruction results of the projected Landweber iteration and the TV, they are much more similar to the real permittivity distribution than the LBP results visually, however, when compared to the criteria data in
Table 3, it can be found that the projected Landweber iteration and the TV perform similarly in the two annular flows and two stratified flows cases, but the TV algorithm shows a better performance in the single bar and two-bar flows cases evaluated by all the criteria data.
Figure 7 also shows that although the autoencoder is trained by the 2D simulation dataset, its performance is still satisfying in the 3D simulation dataset. This means that the autoencoder has a generalization ability to some extent. In order to further examine the generalization ability of the autoencoder, some flow patterns not in the training dataset are tested; see
Figure 8.
Because there are only four flow patterns in the training dataset (i.e., annular, stratified, single bar, and two-bar), the performance of recognizing other new flow patterns with the proposed autoencoder network depends on its generalization ability. The results of the autoencoder are better than the other three algorithms in phantom 1 and 2 of
Figure 8, showing that the autoencoder does have some generalization ability. However, the results of phantom 3 and 4 are quite unsatisfactory and bars inside the annulus cannot be reconstructed completely, implying that the generalization ability of the autoencoder is not good enough to recognize all of the new flow patterns. In order to promote the image reconstruction quality of the deep autoencoder, the iteration method introduced in
Section 2.3 is implemented; see
Figure 9. After using the iteration method, the image reconstruction results are improved as shown in
Figure 9 and bars inside the annulus can be recognized because there is a single bar flow in the training dataset. A good way of improving the generalization ability is by enhancing the diversity of the training data, so, in future, more other flow pattern data are considered to be supplemented in the dataset to increase the generation ability of the methods based on machine learning.
4. Conclusions
In this paper, a benchmark dataset for ECT based on 2D and 3D simulations, as well as static and dynamic experiments, is built. The 2D simulation part contains 40,000 pairs of samples with normalized capacitance vectors and their corresponding permittivity distribution vectors. The 3D simulation part contains capacitance vectors corresponding to 80 cases, including capacitance vectors of full and empty pipes for calibration, 2 sensitivity matrices for the 8-electrode model and the 12-electrode model, respectively, as well as 12 normalized permittivity distribution vectors. The static experiment part contains 14 capacitance vectors of the 14 cases, along with 3 capacitance vectors without flow pattern information. The dynamic experiment part contains three normalized capacitance data sequences in different GVFs.
Among these four parts of the benchmark dataset, the part based on the 2D simulation is used as the public database for researchers to use in training and testing their own machine learning-based ECT image reconstruction algorithms. Additionally, the other three parts of the benchmark dataset—i.e., the 3D simulation part, the static experiment part, and the dynamic experiment part—can be used as a benchmark for evaluating and comparing the different ECT image reconstruction methods. Three criteria—i.e., the relative image error, the correlation coefficient, and the ratio error—are put forward as the quantitative standard to evaluate and compare the ECT image reconstruction methods. The LBP, the projected Landweber, the total variation (TV) based regularization algorithm, and a deep learning method based on an autoencoder for ECT are used as examples of how to compare the different algorithms under the same evaluation criteria of our benchmark dataset. They are executed in the 3D simulation part, the static experiment part, and the dynamic experiment part of the benchmark dataset, respectively, and the corresponding image reconstruction results are evaluated using the three criteria. Most visual results and quantitative results show that the autoencoder-based deep learning method can perform better reconstructions than the three traditional algorithms and that it has a good generalization ability. However, some results show that the autoencoder is not perfect and that the generalization ability can be further improved.
The benchmark dataset that supported the new deep learning-based image reconstruction algorithm is still at its initial stage and it is not perfect enough at present. It mainly focuses on the research of data from mostly used the 8-electrode and 12-electrode ECT sensors, and there are only four types of flow patterns in the benchmark dataset. Supplements to the benchmark dataset could enhance the diversity of the training dataset for machine learning-based image reconstruction methods, improve the performance of these methods, and expand their application range. In the future, we will add more simulation and experiment data to improve this benchmark dataset. We also welcome other researchers to contribute to the dataset by integrating data from other ECT sensor models—including the 16-electrode ECT sensor, the 3D ECT sensor, and dates related to other flow patterns—and to evaluate their new image reconstruction algorithms under the criteria of the benchmark dataset.
We hope this benchmark dataset can be used by researchers to try new image reconstruction methods—especially faster and better methods based on machine learning, where the hardware system or the simulation model is not necessary—and to make the ECT image reconstruction research area more open and flexible, leading to a big breakthrough.