Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Yu, Hualonga; * | Ni, Junb | Xu, Senc | Qin, Bina | Jv, Hengronga
Affiliations: [a] School of Computer Science and Engineering, Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, China | [b] Department of Radiology, Carver College of Medicine, The University of Iowa, Iowa, IA, USA | [c] School of Information Engineering, Yancheng Institute of Technology, Yancheng, Jiangsu, China
Correspondence: [*] Corresponding author: Hualong Yu, School of Computer Science and Engineering, Jiangsu University of Science and Technology, No.2, Mengxi Road, Zhenjiang 212003, Jiangsu, China. Tel.: +86 511 88690470; Fax: +86 511 88690471; E-mail: [email protected].
Abstract: In many real world applications, class imbalance problems occur frequently, causing great underestimation for the classification performance of minority classes. In recent years, much effective solutions have been proposed to address this problem. However, the recent research found that not all skewed classification tasks are harmful and carrying out class imbalance learning methods on those unharmful ones can hardly improve and even degenerate classification performance, meanwhile increase training time to a large extent. Therefore, it is essential to design an efficient criterion to pre-estimate the harmfulness of class imbalance when we encounter skewed classification tasks. In this study, we explore the reason of harmfulness produced by class imbalance and propose a simple and ingenious strategy using scatter matrix based class separability measure to estimate the harmfulness of class imbalance with merely using training samples. The experimental results indicate that the proposed strategy can quantificationally estimate the damage for imbalanced classification tasks and provide priori information to guide us to select appropriate classification methods. Moreover, the computing complexity of the proposed strategy is quite low, thus it is practical in real-world applications.
Keywords: Class imbalance, harmfulness, scatter matrix based class separability measure, imbalanced ratio, classification
DOI: 10.3233/IDA-140637
Journal: Intelligent Data Analysis, vol. 18, no. 2, pp. 203-216, 2014
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]