Locating Abnormal Heartbeats in ECG Segments Based On Deep Weakly Supervised Learning
Locating Abnormal Heartbeats in ECG Segments Based On Deep Weakly Supervised Learning
Locating Abnormal Heartbeats in ECG Segments Based On Deep Weakly Supervised Learning
A R T I C L E I N F O A B S T R A C T
Keywords: Electrocardiogram (ECG) examination has played a routine and crucial role in many aspects of clinic diagnosis.
Electrocardiogram An auxiliary diagnosis system that can extract effective information from ECG is valuable. In this study, we
Multiple instance learning propose to design a novel multi-instance neural network (MINN) model capable of detecting abnormal ECG
CNN
segments, meanwhile, locating abnormal heartbeats in them. The model is constructed by convolutional neural
network (CNN) and trained under the framework of multiple instance learning. It takes the interaction between
the individual heartbeat and whole ECG segment into consideration during the training process, making them
constrain each other. MIT-BIH arrhythmia database and CMUH database supported by the First Hospital of China
Medical University are used as data resources in this study. 44,332 ECG segments are extracted from both da
tabases to exploit the model. We test our model on ECG segments with various number of heartbeats which are 5,
10, 15 and 20 respectively. The best performance of MINN on detecting ECG segments can achieve a AUC and
sensitivity up to 0.9922 and 0.9809, while on locating abnormal heartbeats can achieve a sensitivity up to
0.9473. The test results indicate our system can offer available classification and location messages, having the
potential to be applied in the analysis of long-term ECG records.
1. Introduction perform well on one dataset obtained from particular acquisition facil
ities and nations, but barely acceptable when switching dataset, the
Electrocardiogram (ECG) is a foundational tool with 12 channels of practicability of those models still needs further exploration. Currently,
signals used for revealing the overall situation of cardiac, recording subject to such situations as misjudging disease types and appearance of
electrophysiological activities of myocardial cells which compose the false negative cases, it is unrealistic to rely entirely on automatic com
elementary functional units of cardiac. As for diagnosis of ECG, it either puter diagnosis over experts clinically. Thus, the ECG analysis algo
depends on experienced cardiologists directly drawing conclusions from rithms are also in the process of sustaining optimization, which mainly
the ectopic morphologies and rhythms, or needs to combine specific reflects in trim of classifier structures, transformation and diversity of
parameters computed by algorithms. Typically, the explicit information datasets, and options of learning style. For example, establishing a larger
provided by those algorithms are heart rate, interval duration, and axis ECG training dataset including data collected from over 50,000 patients
deviation, disease categories may be given occasionally but with limited and employing a modified residual network architecture, these opera
accuracy. tions elevated performance of the classifier effectively and competitively
Over the years, many algorithms directing at ECG analysis have [6]. As another example, during the training process, a generic model
emerged. Recent trends to construct those analysis models are basing on could be pre-trained with a common database and then retrained with a
deep learning (DL) method, where projects such as ECG features gen diverse dataset containing ECG of specific patients, aiming at making the
eration and fusion, data augmentation, location of key joints of waves, model dedicated [7]. Models in aforementioned researches were both
and arrhythmia detection can be fulfilled [1–5]. Since models may trained under supervised learning framework, which is a universal
* Corresponding authors at: Division of Biomedical Engineering, China Medical University, Shenyang, 110121, PR China.
E-mail addresses: [email protected] (X. Sha), [email protected] (S. Chang).
1
These two authors contributed equally to this work.
https://doi.org/10.1016/j.bspc.2021.102674
Received 29 January 2021; Received in revised form 7 April 2021; Accepted 24 April 2021
Available online 4 May 2021
1746-8094/© 2021 Elsevier Ltd. All rights reserved.
Y. Tong et al. Biomedical Signal Processing and Control 68 (2021) 102674
2
Y. Tong et al. Biomedical Signal Processing and Control 68 (2021) 102674
Table 1
Number of heartbeats and ECG segments involved in each dataset.
Type of heartbeat A B C D
This table lists the number of healthy and abnormal heartbeats. The abbreviations of heartbeat types were achieved from the documents in MIT-BIH database, which
provided the annotations for each heartbeat. For heartbeat types with less number, we consolidated them into “other Ab” type showing their total. “Segments” stands
for the total of ECG segments in each dataset. “Ab” is the abbreviation for abnormal.
heartbeats such as right bundle branch block (R), left bundle branch 8: 2.
block (L), atrial premature contraction (A), premature ventricular
contraction (V), junctional premature (J), and junctional escape (j) are 2.4.2. Dataset derived from CMUH database
included in the ECG signals. After cutting these long signals into seg This dataset consists of II lead ECG data of 315 people extracted from
ments with different number of heartbeats, we divided these segments CMUH database. Fig. 3 displays a sample of ECG segment from the
into dataset A, B, C and D, which contained 5, 10, 15 and 20 heartbeats, dataset. The number of different types of heartbeats in the dataset is
respectively. Fig. 2 shows samples of the preprocessed data from those exhibited in Table 2.
four datasets. Table 1 counts the number of heartbeat types contained in
each dataset. The training and test segments were assigned at a ratio of
3
Y. Tong et al. Biomedical Signal Processing and Control 68 (2021) 102674
Fig. 4. CNN model structure. The front part is five CONV blocks: conv1_64, conv2_128, conv3_256, conv4_512 and conv5_512. The kernel size of convolutional layers
and max pooling layers are 1 × 9 and 1 × 3. Each output of the convolutional layer is activated by rectification non-linearity (ReLU) [32], and then undergoes the
batch normalization (BN) transformation [33]. The amount of neuros in three fully connected layers are 2048, 1024 and 1. Dropout [34] is applied between fully
connected layers to prevent overfitting.
4
Y. Tong et al. Biomedical Signal Processing and Control 68 (2021) 102674
Fig. 5. ROC curves of MINN and VGG-16 tested on four datasets of MIT-BIH. The symbol ‘*’ indicates the three models with highest AUC values. Variables false
positive rate (FPR) and true positive rate equal to TP/(TP + FN) and FP/(FP + TN), where TP, TN, FP and FN are defined as true positives, true negatives, false
positives and false negatives, respectively.
2.6. CNN architecture of selecting features and making the related features activated when the
classification is ascertained. The last fully connected layer uses sigmoid
From invention of LeNet, VGG, ResNet, etc. initially to commonly function to generate instance-level predictions Yˆik . After the concurrent
used U-NET, GAN, etc. today, one of the stationary structures in those submodules is a MIL layer, which could aggregate the Yˆik through
networks is fitting convolutional layers. In recent years, as convolutional function f(∙) defined above. The previously detached instances are
layers gain an edge among image processing, they have also been used merged back into bags after the MIL layer, which could be expressed as
widely in the domains of disposing one-dimensional time sequences ( ) 1
Yˆi = f Yˆik = f(σ (h(xik , θ) ) ), where σ(v) = 1− exp(− v), and h(xik , θ) are
such as ECG [23–27] and electroencephalogram (EEG) [28–30].
the values output by the last dense layer of the corresponding submodule
Here, we created a model based on CNN, making it adjust to the
which is responsible for processing instance k of bag i. Logically, the
analysis of one-dimensional ECG data, meanwhile combined it with MIL
better Yˆi prediction outcomes achieved, the probabilities of capturing
to design a model with the function of locating anomalous heartbeats in
long ECG signals. The architecture of the model is displayed in Fig. 4. abnormal heartbeats precisely are higher. Equivalently, the selection of
The heartbeats Xik in an ECG segment Xi are concatenated in parallel to heartbeats would influence ECG segments recognition, thereby, the
serve as the input of the model, and the input shape is m × s, in which m output of Yˆi and Yˆik constrain each other during the training process.
denotes the quantity of the heartbeats and s denotes the number of
sampling points included in each heartbeat. Inspired by the convolu 3. Results
tional modules used in VGG net [31], we built similar structure in the
CONV blocks of MINN. Interactively superimposed convolutional and 3.1. Experiment A
pooling layers comprise the CONV blocks of MINN responsible for
extracting features from signals. Through regulating the parameters, we The main purpose of this experiment was to gain a well-trained
found that increasing the size of convolutional kernel appropriately is MINN model and test it using MIT-BIH dataset. The experiment was
conducive to the capture of detailed information of ECG signals and the implemented on NVIDIA GeForce GTX 1080 Ti with 11G Memory, using
improvement of classification results, thereby the larger kernel of 1 × 9 keras 2.1.2 and python 3.6.5.
is chose to perform convolutional operation with generated feature
maps. Separate submodules are attached to the terminal of CONV 3.1.1. Evaluation on bag-level prediction
blocks, and the amount of them would vary according to the number of One of the major premises of locating abnormal heartbeats well in a
instances contained by the bag. These submodules form different length of ECG is assuring that the accuracy of classifying ECG segments,
branches to process instances independently in parallel, and each of namely the performance of MINN in bag-level predictions is exceptional.
them is made up of three fully connected layers, executing the function The binary classification effect of MINN was evaluated in this section.
5
Y. Tong et al. Biomedical Signal Processing and Control 68 (2021) 102674
Fig. 6. AUC (95 % CI) values of MINN with different MIL functions.
Table 3
Performance of MINN tested on four datasets and comparison among three global functions.
Method dataset A dataset B dataset C dataset D
6
Y. Tong et al. Biomedical Signal Processing and Control 68 (2021) 102674
Table 4
Performance of MINN on instance-level and comparison among three global functions.
Method dataset A dataset B dataset C dataset D
functions. ISR, maxpool and noisy-or could make MINN achieve lower
RMSE on instance-level. We depict the distribution of Yˆik produced by
those three MIL functions through violin plots (Fig. 8). We can clearly
see that the Yˆik values have higher density in 0 and 1 position, meaning
the distribution of it is concentrated. But there are also some obvious
discrete points in the figure, which represent the false positive and false
negative instances emerging in the prediction process.
Table 4 shows Sen and Spec metrics of MINN in predicting Yˆik .
Compared to its performance in bag-level, we find that for the same test
dataset, Sen reduced large, while Spec did not fluctuate much. It means
that although the model could distinguish ECG segments well, the
abnormal heartbeats in them still could not be located very precisely.
The occurrence of the above situation is related to the principle of MIL,
where Yˆi is judged as 0 only if all the Yˆik in the corresponding bag Xi are
judged as 0, and Yˆi is judged as 1, provided any Yˆik in the corresponding
bag Xi is judged as 1. Thus, excellent Spec obtained at bag-level portends
high Spec at instance-level. Whereas, high Sen in bag-level just means
that at least one abnormal heartbeat in an ECG segment will be picked
out exactly, and not every abnormal one will be detected. Fig. 10. ROC curves of MINN and VGG-16 tested on CMUH dataset.
For each test sample, we visualize the last dense layer of MINN which
outputs Yˆik , and then map the probabilities back to the original data. As 3.2. Experiment B
shown in Fig. 9, it illustrates instance-level probability map calculated
by the model. The maps indicate the proposed model has the ability to The main purpose of this experiment was to evaluate the general
locate abnormal heartbeats, which is meaningful in auxiliary diagnosis. ization ability of MINN, and the operation was testing the trained model
7
Y. Tong et al. Biomedical Signal Processing and Control 68 (2021) 102674
Table 5 4. Discussion
Sensitivity and specificity for classifying ECG segments.
Method Sen Spec Clinically, for the diagnosis of ECG, the doctors would refer to pa
rameters calculated by computers. That information would assist in
MINN_ISR 0.9763 0.8288
MINN_maxpool 0.9467 0.6507 making final judgments, thus, designing auxiliary diagnosis systems,
MINN_noisy-or 0.9290 0.8082 considering which useful messages should be provided by systems to
better support doctors to interpret ECG is essential, especially in low and
middle-income regions. Those regions are limited by medical resources,
Table 6 and usually do not have seasoned doctors, resulting in more than 75 % of
Sensitivity and specificity for locating abnormal heartbeats. relevant cardiovascular diseases deaths [39]. Based on aforementioned
situation, we aim at creating an ECG analysis system to better assist
Method Sen Spec
physicians, especially those lacking experiences. The previous ECG
MINN_ISR 0.8844 0.8662 auxiliary diagnostic items generally inclined to directly give ECG types,
MINN_maxpool 0.7591 0.8551
MINN_noisy-or 0.8013 0.9039
which may interfere with the judgements of physicians. Considering that
shortage, our proposed model would give probability of each heartbeat
being abnormal in the form of heat map, that information could serve as
on a brand new dataset, CMUH dataset, which had never been a reference in diagnosis. For the weakly supervised learning mode MIL
encountered by our model before. This behavior could make the test adopted by our study, which has been widely used in many aspects of
results a more realistic reflection of clinical practice effect of our computer aided diagnosis [40–43], this method could avert people from
algorithm. consuming plenty of time to make tags for training dataset. Compared to
those models trained under supervised learning method, the superior
3.2.1. Evaluation on bag-level prediction thing about our model is that it could realize heartbeat detection
According to models’ prediction of Yˆi , the ROC curve was drawn in without labeling any heartbeat in the training datasets.
Fig. 10. We can achieve that although MINN has lower performance on In the part of model evaluation, we used data derived from databases
CMUH dataset, a dataset collected from a new database and not covered built on distinct countries and races to test model. This can largely
by the previous training process, MINN with ISR function still acquired a simulate a scenario, where the model would be really applied to clinical
maximum AUC of 0.9846 (95 % CI, 0.9708− 0.9985). And MINN using ECG diagnosis. The test results revealed that the generalization property
maxpool and noisy-or functions could also achieve AUC values of 0.9477 of MINN is outstanding, which makes the model a potential and pro
(95 % CI, 0.9243− 0.9712) and 0.9520 (95 % CI, 0.9281− 0.9758). spective method to apply in clinical. However, there are also some
Table 5 lists test results of Sen and Spec. Compared to the test on MIT- problems and limitations in our experiments. In clinical diagnosis, the
BIH dataset, metric Sen remains relatively stable while the Spec false negative events is worth noticing. Thereby, according to the test
declined considerably, indicating the probability of MINN mistaking results of MIT-BIH dataset, we calculated how many problematic ECG
normal for abnormal segments is high. However, for a completely new segments were mistaken for healthy ones by MINN, namely the number
dataset, which is quite different from MIT-BIH datasets in aspects of of false negative data. Based on the theory of MIL, the reason an ECG
country, races and acquisition device, the model with ISR function could segment was considered as normal was that the model thought every
still achieve Sen of 0.9763. That proves the model handles the problem heartbeat in the segment was normal. Here, for false negative segments,
of false negatives well, which is substantial in clinical diagnosis. we counted the types of abnormal heartbeats and the amount of corre
sponding types. The results are shown in Fig. 12. It can be discovered
3.2.2. Evaluation on instance-level prediction that types’ A’,’ F’,’ a’ and’ j’ are easily regarded as normal type. It may
Table 6 is the exhibition of evaluation metrics on instance-level. be due to (1) the disparity between the non-sinus P wave in type’ A’
Fig. 11 shows the effect of MINN in locating abnormal heartbeats. (atrial premature contraction) and sinus P wave in type’ N’ is not
Fig. 11. Illustration of heat maps of ECG segments. The symbols "N", "L", etc. in the figure are ground truth.
8
Y. Tong et al. Biomedical Signal Processing and Control 68 (2021) 102674
5. Conclusion
9
Y. Tong et al. Biomedical Signal Processing and Control 68 (2021) 102674
CRediT authorship contribution statement with convolutional neural network, Inf. Sci. (Ny). 405 (2017) 81–90, https://doi.
org/10.1016/j.ins.2017.04.012.
[18] S. Kiranyaz, T. Ince, M. Gabbouj, Real-time patient-specific ECG classification by 1-
Yanni Tong: Methodology, Software, Writing - original draft. Yinan D convolutional neural networks, IEEE Trans. Biomed. Eng. (2016), https://doi.
Sun: Methodology, Writing - review & editing. Peng Zhou: Software. org/10.1109/TBME.2015.2468589.
Yang Shen: Software. Hua Jiang: Validation. Xianzheng Sha: Super [19] S.L. Oh, E.Y.K. Ng, R.S. Tan, U.R. Acharya, Automated diagnosis of arrhythmia
using combination of CNN and LSTM techniques with variable length heart beats,
vision, Writing - review & editing. Shijie Chang: Writing - review & Comput. Biol. Med. (2018), https://doi.org/10.1016/j.compbiomed.2018.06.002.
editing. [20] T.G. Dietterich, R.H. Lathrop, T. Lozano-Pérez, Solving the multiple instance
problem with axis-parallel rectangles, Artif. Intell. 89 (1997) 31–71, https://doi.
org/10.1016/s0004-3702(96)00034-3.
Acknowledgement [21] P. Viola, J.C. Platt, C. Zhang, Multiple Instance boosting for object detection, Adv.
Neural Inf. Process. Syst. (2005) 1417–1424.
This work was supported in part by the National Science Fund of [22] Y. Xu, J.Y. Zhu, E.I.C. Chang, M. Lai, Z. Tu, Weakly supervised histopathology
cancer image segmentation and classification, Med. Image Anal. 18 (2014)
Liaoning under Grant 2018-64, in part by the Big Data Research for 591–604, https://doi.org/10.1016/j.media.2014.01.010.
Health Science of China Medical University under Grant Key Project 6. [23] Y. Zhao, J. Xiong, Y. Hou, M. Zhu, Y. Lu, Y. Xu, J. Teliewubai, W. Liu, X. Xu, X. Li,
Z. Liu, W. Peng, X. Zhao, Y. Zhang, Y. Xu, Early detection of ST-segment elevated
myocardial infarction by artificial intelligence with 12-lead electrocardiogram, Int.
Declaration of Competing Interest J. Cardiol. 317 (2020) 223–230, https://doi.org/10.1016/j.ijcard.2020.04.089.
[24] S.M. Mathews, C. Kambhamettu, K.E. Barner, A novel application of deep learning
The authors declare that they have no known competing financial for single-lead ECG classification, Comput. Biol. Med. 99 (2018) 53–62, https://
doi.org/10.1016/j.compbiomed.2018.05.013.
interests or personal relationships that could have appeared to influence
[25] S. Hong, Y. Zhou, J. Shang, C. Xiao, J. Sun, Opportunities and challenges of deep
the work reported in this paper. learning methods for electrocardiogram data: a systematic review, Comput. Biol.
Med. 122 (2020), 103801, https://doi.org/10.1016/j.compbiomed.2020.103801.
References [26] Z. Yao, Z. Zhu, Y. Chen, Atrial fibrillation detection by multi-scale convolutional
neural networks, 20th Int. Conf. Inf. Fusion, Fusion 2017 - Proc (2017), https://
doi.org/10.23919/ICIF.2017.8009782.
[1] M. Guanglong, W. Xiangqing, Y. Junsheng, ECG signal classification algorithm [27] U.R. Acharya, H. Fujita, S.L. Oh, Y. Hagiwara, J.H. Tan, M. Adam, Application of
based on fusion features, J. Phys. Conf. Ser. 1207 (2019), https://doi.org/10.1088/ deep convolutional neural network for automated detection of myocardial
1742-6596/1207/1/012003. infarction using ECG signals, Inf. Sci. (Ny). 415–416 (2017) 190–198, https://doi.
[2] M.Z. Poh, Y.C. Poh, P.H. Chan, C.K. Wong, L. Pun, W.W.C. Leung, Y.F. Wong, M.M. org/10.1016/j.ins.2017.06.027.
Y. Wong, D.W.S. Chu, C.W. Siu, Diagnostic assessment of a deep learning system [28] R. Schirrmeister, L. Gemein, K. Eggensperger, F. Hutter, T. Ball, Deep learning with
for detecting atrial fibrillation in pulse waveforms, Heart (2018), https://doi.org/ convolutional neural networks for decoding and visualization of EEG pathology,
10.1136/heartjnl-2018-313147. 2017 IEEE Signal Process. Med. Biol. Symp. SPMB 2017 - Proc. (2017), https://doi.
[3] M. Hammad, S. Zhang, K. Wang, A novel two-dimensional ECG feature extraction org/10.1109/SPMB.2017.8257015.
and classification algorithm based on convolution neural network for human [29] R.T. Schirrmeister, J.T. Springenberg, L.D.J. Fiederer, M. Glasstetter,
authentication, Future Gener. Comput. Syst. 101 (2019) 180–196, https://doi.org/ K. Eggensperger, M. Tangermann, F. Hutter, W. Burgard, T. Ball, Deep learning
10.1016/j.future.2019.06.008. with convolutional neural networks for EEG decoding and visualization, Hum.
[4] Y. Xia, N. Wulan, K. Wang, H. Zhang, Detecting atrial fibrillation by deep Brain Mapp. (2017), https://doi.org/10.1002/hbm.23730.
convolutional neural networks, Comput. Biol. Med. 93 (2018) 84–92, https://doi. [30] Y.R. Tabar, U. Halici, A novel deep learning approach for classification of EEG
org/10.1016/j.compbiomed.2017.12.007. motor imagery signals, J. Neural Eng. (2017), https://doi.org/10.1088/1741-
[5] X. Zhai, C. Tin, Automated ECG classification using dual heartbeat coupling based 2560/14/1/016003.
on convolutional neural network, IEEE Access 6 (2018) 27465–27472, https://doi. [31] K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale
org/10.1109/ACCESS.2018.2833841. image recognition, 3rd Int. Conf. Learn. Represent. ICLR 2015 - Conf. Track Proc.
[6] A.Y. Hannun, P. Rajpurkar, M. Haghpanahi, G.H. Tison, C. Bourn, M.P. Turakhia, (2015).
A.Y. Ng, Publisher correction: cardiologist-level arrhythmia detection and [32] X. Glorot, A. Bordes, Y. Bengio, Deep sparse rectifier neural networks, J. Mach.
classification in ambulatory electrocardiograms using a deep neural network, Nat. Learn. Res. 15 (2011) 315–323.
Med. 25 (1) (2019) 65–69, https://doi.org/10.1038/s41591-018-0268-3. Nat. [33] S. Ioffe, C. Szegedy, Batch normalization: accelerating deep network training by
Med. 25 (2019) 530. https://doi.org/10.1038/s41591-019-0359-9. reducing internal covariate shift, 32nd Int. Conf. Mach. Learn. ICML 2015 (2015)
[7] Y. Li, Y. Pang, J. Wang, X. Li, Patient-specific ECG classification by deeper CNN 448–456.
from generic to dedicated, Neurocomputing 314 (2018) 336–346, https://doi.org/ [34] N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, Dropout: A
10.1016/j.neucom.2018.06.068. simple way to prevent neural networks from overfitting, J. Mach. Learn. Res. 15
[8] Y. Xu, J.Y. Zhu, E. Chang, Z. Tu, Multiple clustered instance learning for (2014) 1929–1958.
histopathology cancer image classification, segmentation and clustering, Proc. [35] D.P. Kingma, J.L. Ba, Adam: a method for stochastic optimization, 3rd Int. Conf.
IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. (2012) 964–971, https:// Learn. Represent. ICLR 2015 - Conf. Track Proc. (2015) 1–15.
doi.org/10.1109/CVPR.2012.6247772. [36] E. DeLong, D. DeLong, D. Clarke-Pearson, Comparing the areas under two or more
[9] Z. Jia, X. Huang, E.I. Chang, Y. Xu, Constrained deep weak supervision for correlated receiver operating characteristic curves: a nonparametric, Approach,
histopathology image segmentation, IEEE Trans. Med. Imaging 36 (2017) Biometrics 44 (1988) 837–845, https://doi.org/10.2307/2531595.
2376–2388, https://doi.org/10.1109/TMI.2017.2724070. [37] J. Ramon, L. De Raedt, Multi instance neural networks, ICML Work. (2000) 53–60.
[10] Y. Xu, T. Mo, Q. Feng, P. Zhong, M. Lai, E.I. Chang, Deep learning of feature [38] O.Z. Kraus, J.L. Ba, B.J. Frey, Classifying and segmenting microscopy images with
representation with multiple instance learning for medical image analysis, ICASSP, deep multiple instance learning, Bioinformatics 32 (2016) i52–i59, https://doi.
IEEE Int. Conf. Acoust. Speech Signal Process. - Proc. (2014) 1626–1630, https:// org/10.1093/bioinformatics/btw252.
doi.org/10.1109/ICASSP.2014.6853873. [39] A.H. Ribeiro, M.H. Ribeiro, G.M.M. Paixão, D.M. Oliveira, P.R. Gomes, J.
[11] A. Vezhnevets, J.M. Buhmann, Towards weakly supervised semantic segmentation A. Canazart, M.P.S. Ferreira, C.R. Andersson, P.W. Macfarlane, M. Wagner, T.
by means of multiple instance and multitask learning, Proc. IEEE Comput. Soc. B. Schön, A.L.P. Ribeiro, Automatic diagnosis of the 12-lead ECG using a deep
Conf. Comput. Vis. Pattern Recognit. (2010) 3249–3256, https://doi.org/10.1109/ neural network, Nat. Commun. 11 (2020) 1–9, https://doi.org/10.1038/s41467-
CVPR.2010.5540060. 020-15432-4.
[12] P.O. Pinheiro, R. Collobert, From image-level to pixel-level labeling with [40] M.M. Dundar, S. Badve, V.C. Raykar, R.K. Jain, O. Sertel, M.N. Gurcan, A multiple
convolutional networks, Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern instance learning approach toward optimal classification of pathology slides, Proc.
Recognit. (2015) 1713–1721, https://doi.org/10.1109/CVPR.2015.7298780. - Int. Conf. Pattern Recognit. (2010) 2732–2735, https://doi.org/10.1109/
[13] D. Pathak, P. Krahenbuhl, T. Darrell, Constrained convolutional neural networks ICPR.2010.669.
for weakly supervised segmentation, Proc. IEEE Int. Conf. Comput. Vis. (2015) [41] L. Sun, Y. Lu, K. Yang, S. Li, ECG analysis using multiple instance learning for
1796–1804, https://doi.org/10.1109/ICCV.2015.209. myocardial infarction detection, IEEE Trans. Biomed. Eng. 59 (2012) 3348–3356,
[14] J. Feng, Z.H. Zhou, Deep MIML network, 31st AAAI Conf. Artif. Intell. AAAI 2017 https://doi.org/10.1109/TBME.2012.2213597.
(2017) 1884–1890. [42] D. Wang, A. Khosla, R. Gargeya, H. Irshad, A.H. Beck, Deep Learning for
[15] G.B. Moody, R.G. Mark, The impact of the MIT-BIH arrhythmia database, IEEE Eng. Identifying Metastatic Breast Cancer, 2016, pp. 1–6. http://arxiv.org/abs/1
Med. Biol. Mag. 20 (2001) 45–50, https://doi.org/10.1109/51.932724. 606.05718.
[16] M. Arif, I.A. Malagore, F.A. Afsar, Detection and localization of myocardial [43] Y. Xu, Y. Li, Z. Shen, Z. Wu, T. Gao, Y. Fan, M. Lai, E.I.C. Chang, Parallel multiple
infarction using K-nearest neighbor classifier, J. Med. Syst. 36 (2012) 279–289, instance learning for extremely large histopathology image analysis, BMC
https://doi.org/10.1007/s10916-010-9474-3. Bioinformatics 18 (2017) 1–15, https://doi.org/10.1186/s12859-017-1768-8.
[17] U.R. Acharya, H. Fujita, O.S. Lih, Y. Hagiwara, J.H. Tan, M. Adam, Automated
detection of arrhythmias using different intervals of tachycardia ECG segments
10