Ad KMLS 2 052647
Ad KMLS 2 052647
1 Introduction
Mammograms are images with low contrast and a complicated structured back-
ground and generally contaminated by some noise. At the same time, mammo-
grams are very important non-invasive tests in the early detection of breast can-
cer, a disease that according to the World Health Organization (WHO), reaches
more than 2 billions of women every year, and is considered the second leading
cause of death from cancer among women, after lung cancer [1]. In 2018, 3 748
2 R. Travieso et al.
women were fatal victims of the disease in Cuba, which represented the 40% of
cases diagnosed that year4 . Then it is important to have algorithms that im-
prove the quality of mammograms so that specialists can make more accurate
diagnoses. Image smoothing is one of the most common task in image process-
ing, with the aim to improves image quality by some criteria, it means to obtain
noise free sharpness images ready for a forthcoming analysis. In this work we
are interested in image smoothing algorithms that keeps edges, edges from detail
existing at fine scales and edges from the boundaries of large objects existing at
large scales; which is particularly important as previous step for edge detection
and image segmentation.
Among the image smoothing PDE-based methods, Perona and Malik [2] pro-
posed for the first time a non linear diffusion based algorithm that overcame the
disadvantages of the linear diffusion such as blurring or dislocates the edges of
the images. Nonlinear anisotropic diffusion (AD) is a filtering tool that preserve
the position of such edges through the image smoothing process. In the seminal
paper [2], they proposed a non linear PDE with variable diffusion coefficient to
model the diffusion process. The diffusion coefficient c is chosen as an appropriate
function of the gradient of the brightness function c(x, y, t) = g(k∇I(x, y, t)k),
that encourages intra-region over inter-region smoothing. To achieve a better
performance in edge preserving, different expressions for the diffusion coefficient
have been proposed, see for instance Table 1 in [3]. All the given expressions for
this coefficient depend on a parameter k, called contrast parameter or thresh-
old for the magnitude of the gradient of the brightness with great influence on
edge preserving. There are few works that focus on estimating the contrast pa-
rameter k. The simplest is to use a constant k, with the difficulty that it does
not take into account smoothing performed on the previous scales and applies a
smoothing with equal intensity in each iteration. In order to decrease k for each
iteration and thus maintain the condition k∇Ik > k to preserve the edges, there
are some alternative methods as can be consult in [3–5]. Particularly interesting
for us is the proposal in [3], where a new method called Partition and Adjust-
ment (KMLS) to estimate k for gray scale images is presented. This method
combine the K-means algorithm and a least square fit to approximate the diffu-
sion coefficient. However, it does not behave well for all types of images, among
which we can mention mammograms, see [4].
This work aims to apply AD of Perona-Malik by regions in mammograpy
images to remove noise and heighlight edges, to compare different ways of esti-
mating the threshold of the gradient magnitud in the diffusion coefficient, and
its influence on edge detection. For this, a new estimator for the threshold of the
magnitud of the gradient is proposed. It is a modification of the one presented
in [3]. A SLIC technique is used to partitioned the image.
4
https://adncuba.com/noticias-de-cuba/actualidad/minsap-da-estadisticas-del-
cancerde-mama-fallecio-el-40-de-los-casos
Title Suppressed Due to Excessive Length 3
∂I(x, y, t)
= div(c(x, y, t) · ∇I(x, y, t)), in Ω × (0, T ) (1)
∂t
c(x, y, t) · ∇I(x, y, t) · N = 0, in ∂Ω × (0, T ) (2)
I(x, y, 0) = I0 (x, y), in Ω, (3)
Many interesting variations of the Perona-Malik model have been proposed, most
of them related to the regularization of the model considering the introduction
of new expressions for the diffusion coefficient, see for instance [6, 9]. This coeffi-
cient characterizes the diffusion process and the goal is to carry on the smoothing
inside regions preferentially over the smoothing along edges. The diffusion coef-
ficient also works as a time stopping function and all known expressions for it
depend on a parameter k in such a way that in those region of the image where
k∇Ik > k , the smoothing effect is weaker and therefore the edges are preserved
4 R. Travieso et al.
and where k∇Ik < k, the smoothing effect is stronger. It means k is a contrast
parameter or a threshold for the magnitude of the brightness gradient with great
influence in edge preserving. With the same number of iterations the result of
diffusion process for different values of k is also different. If we increase the value
of k, the diffusion process blurs the image, removing the details. During diffusion
the image is smoothed which means, the value of the gradient decreases, then the
parameter k that controls the diffusion rate must be readjusted in each iteration,
so that edges remains preserved. It is evident that the diffusion coefficient is very
sensitive to the choice of the parameter k. At the same time, k has to be chosen
considering not only global but local changes of the brightness gradient. Due
to, a diffusion process by regions is proposed in this work, dividing the image
using the SLIC segmentation algorithm, [10]. Then k should also depends on
time as was proposed in [3], where a method based on K-means clustering and
least square fitting (KMLS) to estimate k was introduced. Although the method
in [3] shows good results for Natural Images, its performance for mammograms
is very poor, even sometimes it doesn’t work, [4]. The goal of the present paper
is to address this topic by evaluating the quality of the edges for mammogram
images, and comparing it with other ways of estimate k.
In the literature review there are some ways to estimate the gradient threshold
k, but they they don’t work in the same way for all kind of images. In this
article, some of the them were selected with the aim to verify their effectiveness
for mammograms and to compare them with our proposal.
In order to decrease the value of k for each iteration and thus maintain
the condition k∇Ik > κ to preserve the edges, Francesco Voci [5] proposed
two alternative methods for setting k. A first one known as p-norm, where the
σ·kIt kp
threshold value is calculated as: κ = n·m , with It representing the image
I(x, y) in the t iteration. Because of this value is too high, adaptive scaling of
an image with n rows and m columns will be used. σ is a constant proportional
to the average intensity of the image. A second method presented in [5] is based
on a morphological approach. If the “open–close” difference is taken into account
as the quantization parameter, then κ must be at least equal to the averaged
noise amplitude value. Given an image It with n rows and m columns, and
given a structuring element st (usually 3 × 3 or 5 × 5), we can estimate κ as
follows: κ = (x,y)∈It (It (x,y)◦st) − (x,y)∈It (It (x,y)•st)
P P
n·m n·m , where the symbols “◦”
and “•” represent the open and close operation, respectively. Another methods
to estimate k is the noise estimator (Noise), initially described by [11], and later
used by [2]. In this method a histogram of the absolute values of the gradient in
the whole image is calculated and k is set equal to 90% of its integral value at
each iteration.
In [3] an algorithm called KMLS is proposed. It considers updating the
threshold of the gradient of the intensity in each iteration. Applying a clustering
K-Means, the set P of pixels is partitioned into three subsets: P1 , the subset of
Title Suppressed Due to Excessive Length 5
pixels that does not belong to the edges of the image, P3 subset of pixels belong-
ing to the edges of the image and P2 = P \ (P1 ∪ P3 ), subset of pixels called fuzzy
set because its elements can be weak edges or cannot be edges. From those sets,
the points used to approximate the contrast parameter k are selected, through a
Least Square curve fitting of the curve defined by the diffusion coefficient g2 (s)
given above. Let i21 and i31 be the minimal values of the gradients of the pixels
that belong to P2 and P3 respectively. To control the smoothing strength two
thresholds are defined: wep and sep of weak and strong edges preservation re-
spectively. These thresholds are in practice the values of the diffusion coefficient
g2 (s) that we want to make correspond to the values of the gradient of inten-
sities i21 and i31 . The pixels in P3 (strong edges) are smoothed with strength
less than sep and the pixels of weak edges are smoothed with strength less than
wep; therefore sep should be taken as close as possible to zero. On the other
hand, the threshold wep should be selected taking information from the image,
nevertheless this seems to be a pretty hard work, and requires experimenting in
a specific kind of images, to get some values from statistics. Having the points
(i21 , wep) and (i31 , sep), the contrast parameter k will be obtained using Least
Square fitting, [3]. We focused our attention in this estimator to applied it in
mammograms smoothing, but it was found that it works well for natural images
in scale of gray, but when we tried to apply it in mammogram images, it has
some drawbacks, [4].
where G = ln g2 , xj = k∇It (xj , yj )k, n = 2, having the points (i21 , wep) and
(i31 , sep). The gradient magnitude values i21 and i31 are selected as the value of
the final centroid of K-Means for the set G1 and G2 respectively. Then, solving
the equation
dS
= 0, (5)
dk
we obtain the following expression for the contrast parameter k
s
−(i421 + i431 )
k= . (6)
ln(wep)i221 + ln(sep)i231
Title Suppressed Due to Excessive Length 7
Since the diffusion process will be carried out by regions, it may happen that one
of these regions is also of very low contrast, that is, the values of the intensity
gradient are almost identical. In that case the initial centroids for the clusters in
K-Means will be equals (µ1 = µ2 ). Then, the fitting phase will be done using only
0 0 0 0
one point, (i0 , g2 (i0 )), with i0 = µ1 = µ2 . To choose g2 (i0 ) we take M axG and
M inG as the biggest and lowest gradient of the whole image, respectively. Then,
0 0
if: M axG − i0 < i0 − M inG holds, the sep threshold is used to fit g2 (s) making a
weak smoothing process. On the contrary wep will be used for a strong smoothing
process. In the case i0 0 = 0 then k = 0, which brings a complications because
it undefines the function g2 (s). However, as i0 0 = 0 the coefficient c(x, y, t) = 1
and we do not need to fit the curve. This is equivalent to an isotropic diffusion in
a homogeneous image (without contrast or edges), which is fine, because there
are not characteristics to preserve.
similar. Nevertheless, the value sep = 0.05 provides a good value of the measure
for 9 of the 10 images. In Fig.2 we show the objective performance measures as
a function of the values of wep, for a fixed value sep = 0.05. Again, the graphs
of Pratt and RMSE measures are very similar. Reasoning in the same way, we
choose wep = 0.3.
Using the same 20 images as before from the Inbreast database [13], the crop
function was applied to each image to reduce unwanted external areas to the
breast. For these images, the pixel meshes and the masks of the ground truth
of the lesions were extracted. In a previous step the images were divided into
regions using the SLIC clustering algorithm, with 1, 6, 12, 20, 30 and 100 clusters,
with the best results being for 6 clusters, used for the experiments that follow.
Three types of Anisotropic Diffusion (AD) smoothing were performed, that differ
in the way of estimating the threshold parameter k of the gradient in the diffu-
sion coefficient, namely: using pnorm (AD-pnorm) [5], using the noise estimator
(AD-Noise) [11], and using the proposed algorithm KMLS2 (AD-KMLS2), a
modification of KMLS [3]. The showed results correspond to 10 iterations. After
that, the classic edge detector Scharr is applied to each smoothed image. The
performance of the algorithms is first visually measured comparing the edge
images obtained for each filtered image, as shown in Fig. 3 and Fig. 4. The
Title Suppressed Due to Excessive Length 9
quality of the edge images using AD is always better than when pre-smoothing
is not applied. In both figures, visually the difference between AD-KMLS and
AD-Noise is the same, that in the later the edges are thicker. However using the
objective quality measures, for the image ID53586896 AD-KMLS has a better
result, (Fig. 5) and for the image ID53587663, the better result is for AD-Noise.
An interesting observation is that the visual results are better when the value of
the objective measures is close to 0.06 for PFoM* and close to 0.7 for RMSE.
Fig. 3. Mammogram ID53586896 (a) Mask of the anomaly ground truth edge image,
edge images: (b) without smoothing (c) and(d) after AD-KMLS2 and AD-Noise re-
spectively.
Fig. 4. Mammogram ID53587663 (a) Mask of the anomaly ground truth edge image,
edge images: (b) without smoothing (c) and(d) after AD-KMLS2 and AD-Noise re-
spectively.
5 Conclusions
edge images were evaluated using the PFoM* and the RMSE error measures. The
proposed method AD-KMLS2 achieved better results for 45% of the images in
terms of both quality measures. For the rest of the images the AD-Noise worked
better, although the difference according to the measurements is of the order
least equal 5 × 10−5 and visually the differences are barely perceptible. The best
results are reached by Scharr edge detector.
References
1. Shelley McGuire. World cancer report 2014. geneva, switzerland: World health or-
ganization, international agency for research on cancer, who press, (2016)
2. Perona Pietro and Malik Jitendra: Scale space anisotropic and edge detection using
anisotropic diffusion. IEEE Transactions on Pattern Analysis and Machine Intelli-
gence 12(7), 629—639 (1990)
3. Borroto-Fernández M., González-Hidalgo M., León-Mecías: New estimation method
of the contrast parameter for the Perona–Malik diffusion equation. Computer Meth-
ods in Biomechanics and Biomedical Engineering: Imaging & Visualization 4(3-4),
238–252 (2014)
4. Hidalgo-Gato, Elizabeth: Estimación del pámetro de contraste para el suavizado
por Difusión Anisotrópica aplicado por regiones. Tesis de Licenciatura, Universidad
de La Habana, Cuba (2015)
5. Voci F, Eiho S, Sugimoto N, Sekibuchi H.: Estimating the gradient in the Per-
ona–Malik equation. IEEE Signal Process Mag. 21(3), 39—65 (2004)
6. Weickert, Joachim: Anisotropic Diffusion in Image Processing. ECMI, Series. Teub-
ner, Stuttgart (2008)
7. Catté F., Lions P., Morel J. and Coll T.: Image selective smoothing and edge de-
tection by nonlinear diffusion. SIAM J. Numer. Anal. 29, 182–193 (1992)
8. Weickert, Joachim and Benhamouda, B.: A semidiscrete nonlinear scale-space theory
and its relation to the Perona-Malik paradox. Springer, Wien (1997)
9. Gilboa G, Sochen N, Zeevi YY.: Forward-and-backward diffusion processes for adap-
tive image enhancement and denoising. IEEE Trans Image Process. 11(7), 689—703
(2002)
10. Achanta, Radhakrishna, Appu Shaji, Kevin Smith, Aurelien Lucchi, Pascal Fua
and Sabine Susstrunk: SLIC superpixels compared to state-of-the-art superpixel
methods. Pattern Analysis and Machine Intelligence, IEEE Transactions. 34(11),
2274—2282 (2012)
11. Canny, J.: A Computational Approach To Edge Detection. IEEE Trans. Pattern
Analysis and Machine Intelligence 8, 679–714 (1986)
12. Scharr, H.: Optimal Operators in Digital Image Processing. PhD Thesis, University
of Heidelberg (2000)
13. Moreira, I., Amaral, I., Domingues I. et. al : INbreast: Toward a Full-field Digital
Mammographic Database. (2011)
14. Wikipedia, Mayo 2015. http://en.wikipedia.org/wiki/ Root-mean-
square_deviation
15. Abdou I., Pratt W.: Quantitative design and evaluation of enhance-
ment/thresholding edge detectors. Proc IEEE. 67(5), 753—763 (1979)