[1]YE Zhi-fei,WEN Yi-min,LU Bao-liang.A survey of imbalanced pattern classification problems[J].CAAI Transactions on Intelligent Systems,2009,4(2):148-156.
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
4
Number of periods:
2009 2
Page number:
148-156
Column:
综述
Public date:
2009-04-25
- Title:
-
A survey of imbalanced pattern classification problems
- Author(s):
-
YE Zhi-fei1; WEN Yi-min 2; LU Bao-liang 1; 3
-
1. Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China;
2.Department of Information Engineering,Hunan Industry Polytechnic, Changsha 410208, China;
3.MOEMicrosoft Key Lab. for Intelligent Computing and Intelligent Systems, Shanghai Jiao Tong University, Shanghai 200240, China
-
- Keywords:
-
machine learning; imbalanced pattern classification; resampling; cost sensitive learning; task decomposition; classifier ensemble; evaluation matrices
- CLC:
-
TP181
- DOI:
-
-
- Abstract:
-
Imbalanced data sets have always been regarded as presenting significant difficulties when applying machine learning methods to realworld pattern classification problems. Although various approaches have been proposed during the past decade, limitations are imposed by many realworld imbalanced data sets, and as a result, a lot of further research is currently being done. In this paper, we provide an uptodate survey of research on imbalanced pattern classification problems. We first took a deep look into the problems that imbalanced data sets bring, and then we introduced different kinds of solutions in detail, with their representative approaches. Finally, using three real imbalanced data sets, we compared the performance of some typical methods including resampling, cost sensitive learning, training set partitions, and the performance of classifier ensembles. In addition, topics such as evaluation indexes and future areas of research were also discussed.