[1]LUAN Xun,GAO Wei.Two-pass AUC optimization[J].CAAI Transactions on Intelligent Systems,2018,13(3):395-398.[doi:10.11992/tis.201706079]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
13
Number of periods:
2018 3
Page number:
395-398
Column:
学术论文—机器学习
Public date:
2018-05-05
- Title:
-
Two-pass AUC optimization
- Author(s):
-
LUAN Xun; GAO Wei
-
National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China
-
- Keywords:
-
machine learning; AUC; ROC; one-pass learning; online learning; ranking; stochastic gradient descent; statistics
- CLC:
-
TP181
- DOI:
-
10.11992/tis.201706079
- Abstract:
-
The area under an ROC curve (AUC) has been an important performance index for class-imbalanced learning, cost-sensitive learning, learning to rank, etc. Traditional AUC optimization requires the entire dataset to be stored because AUC is defined as pairs of positive and negative instances. To solve this problem, the one-pass AUC (OPAUC) algorithm was introduced previously to scan the data only once and store the first- and second-order statistics. However, in many real applications, the second-order statistics require high storage and are computationally costly, especially for high-dimensional datasets. We introduce the two-pass AUC (TPAUC) optimization to calculate the mean of positive and negative instances in the first pass and then use the stochastic gradient descent method in the second pass. The new algorithm requires the storage of the first-order statistics but not the second-order statistics; hence, the efficiency is improved. Finally, experiments are used to verify the effectiveness of the proposed algorithm.