Above, we used 11 groups of source images for the improved method Our, and conducted experimental comparison with seven algorithms, respectively, and carried out subjective evaluation on the fusion results of the eight algorithms. This section will evaluate the performance of these eight fusion algorithms through three evaluation indexes, such as spatial frequency (SF).
4.2.1. TNO Dataset
Table 1 and
Figure 15 show the data table and corresponding line graph of the standard deviation evaluation index obtained by experimental comparison of seven other fusion methods and the improved method Our using seven TNO dataset image pairs, respectively. By analyzing the information in the data table and line chart, it can be found that 1/7 of the standard deviation best values are generated using the method proposed by Li et al., 2/7 of the standard deviation best values are generated using the method proposed by Wang et al., and 4/7 of the standard deviation best values are generated using the improved method Our. This shows that in the comparative experiment of these eight fusion methods, the optimal method in terms of standard deviation is the improved method Our.
Table 2 and
Figure 16 show the data table and corresponding line graph of the spatial frequency evaluation index obtained by experimental comparison of seven other fusion methods and the improved method Our using seven TNO dataset image pairs, respectively. By analyzing the information in the data table and line chart, it can be found that 1/7 of the spatial frequency best values are generated using the FPDE method, and 6/7 of the spatial frequency best values are generated using the improved method Our. This shows that in the comparative experiment of these eight fusion methods, the optimal method in terms of spatial frequency is the improved method Our.
Table 3 and
Figure 17 show the data table and corresponding line graph of the information entropy evaluation index obtained by experimental comparison of seven other fusion methods and the improved method Our using seven TNO dataset image pairs, respectively. By analyzing the information in the data table and line chart, it can be found that 1/7 of the information entropy best values are generated using the method proposed by Li et al. [
17], 1/7 of the information entropy best values are generated using the GTF method, 2/7 of the information entropy best values are generated using the method proposed by Wang et al. [
18], and 3/7 of the information entropy best values are generated using the improved method Our. This shows that in the comparative experiment of these eight fusion methods, the optimal method in terms of information entropy is the improved method Our.
4.2.2. MSRS Dataset
Table 4 and
Figure 18 show the data table and corresponding line graph of the standard deviation evaluation index obtained by experimental comparison of seven other fusion methods and the improved method Our using four MSRS dataset image pairs. By analyzing the information in the data table and line chart, it can be found that 100% of the standard deviation best values are generated using the improved method Our. This shows that in the comparative experiment of these eight fusion methods, the optimal method in terms of standard deviation is the improved method Our.
Table 5 and
Figure 19 show the data table and corresponding line graph of the spatial frequency evaluation index obtained by experimental comparison of seven other fusion methods and the improved method Our using four MSRS dataset image pairs. By analyzing the information in the data table and line chart, it can be found that 25% of the spatial frequency best values are generated using the method proposed by Wang et al., and 75% of the spatial frequency best values are generated using the improved method Our. This shows that in the comparative experiment of these eight fusion methods, the optimal method in terms of spatial frequency is the improved method Our.
Table 6 and
Figure 20 show the data table and corresponding line graph of the information entropy evaluation index obtained by experimental comparison of seven other fusion methods and the improved method Our using four MSRS dataset image pairs. By analyzing the information in the data table and line chart, it can be found that 25% of the information entropy best values are generated using the method proposed by Zhang et al., 25% of the information entropy best values are generated using the method proposed by Wang et al., and 50% of the information entropy best values are generated using the improved method Our. This shows that in the comparative experiment of these eight fusion methods, the optimal method in terms of information entropy is the improved method Our.
On the whole, most of the optimal values of both TNO dataset and MSRS dataset are concentrated in the method proposed in this paper. Then, the Wilcoxon signed rank test is used to test our proposed method. On the basis of these 11 image pairs, we pair seven comparative methods with the method proposed in this paper on three indicators. First, the null assumption is that the data difference between the two groups is zero; the alternative hypothesis is that there are differences in the data between the two groups. We pair the seven methods with the method proposed in this paper, and since there are three indicators, there are 21 sets of data tests. According to the Wilcoxon signed rank test, our difference value is assumed to be the data index of the method in this paper minus the data index of the comparative method. When the value of the method in this paper is higher than that of the comparative method for an image pair, it is positive rank, and vice versa. Secondly, the difference values are ranked according to their absolute values, and the ranking is assigned in order from small to large in absolute value. Then, we find the sum of the positive ranking and the negative ranking, respectively. Our test statistic W is the smallest absolute value of the sum of positive rankings and the sum of negative rankings. Since we have a total of 11 image pairs, n = 11. To determine whether the null hypothesis should be rejected, we refer to the Wilcoxon signed rank test critical value table to find the critical values. The critical value corresponding to a significance level of 0.1 and n = 11 is 13. If our test statistic W is less than or equal to the critical value 13 in the table, we can reject the null hypothesis. Otherwise, we cannot reject the null hypothesis.
(1) According to the calculation, in comparison with the method proposed by Li, we obtain the test statistic W1 = 9 using the standard deviation index, the test statistic W2 = 0 using the spatial frequency index, and the test statistic W3 = 11 using the information entropy index. Since the test statistic W for the three indexes is less than the critical value 13, we reject the null hypothesis, and there is sufficient evidence to prove that there are significant differences between the two methods. Moreover, the sum of our positive rank rankings is much larger than the sum of our negative rank rankings. So, compared with the method proposed by Li, the method proposed in this paper demonstrates a significant improvement in the three indexes. (2) Similarly, in comparison to the method proposed by Zhang, three test statistics are obtained as W1 = 0, W2 = 0, and W3 = 27. We reject the null hypothesis for the index of standard deviation and spatial frequency, while we cannot reject the null hypothesis for the index of information entropy. In other words, compared with the method proposed by Zhang, the method proposed in this paper demonstrates a significant improvement in standard deviation and spatial frequency index. However, for the index of information entropy, there is almost no difference with the method proposed by Zhang. (3) In comparison with the method proposed by Wang, three test statistics are obtained as W1 = 10, W2 = 5, and W3 = 28. We reject the null hypothesis for the index of standard deviation and spatial frequency, while we cannot reject the null hypothesis for the index of information entropy. In other words, compared with the method proposed by Wang, the method proposed in this paper demonstrates a significant improvement in standard deviation and spatial frequency index. However, for the index of information entropy, there is almost no difference with the method proposed by Wang. (4) Compared with the LatLRR method, three test statistics are obtained as W1 = 0, W2 = 2, and W3 = 8. We reject the null hypothesis for all three indexes. In other words, compared with the LatLRR method, the method proposed in this paper demonstrates a significant improvement in the three indexes. (5) Compared with the GTF method, three test statistics are obtained as W1 = 1, W2 = 4, and W3 = 11, and we reject the null hypothesis for all three indexes. In other words, compared with the GTF method, the method proposed in this paper demonstrates a significant improvement in the three indexes. (6) Compared with the MSVD method, three test statistics are obtained as W1 = 0, W2 = 7, and W3 = 4. We reject the null hypothesis for all three indexes. In other words, compared with the MSVD method, the method proposed in this paper has a significant improvement in the three indexes. (7) Compared with the FPDE method, three test statistics are obtained as W1 = 0, W2 = 9, and W3 = 7. We reject the null hypothesis for all three indexes. In other words, compared with the FPDE method, the method proposed in this paper demonstrates a significant improvement in the three indexes.
To sum up, in the statistical test compared with the seven comparative methods, the method proposed in this paper shows significant improvement in other indicators except that there is almost no difference between the method proposed by Zhang and the method proposed by Wang in the index of information entropy. Therefore, the improved algorithm proposed in this paper is superior to the other seven comparative methods.