[1]CHEN Xiao-feng,WANG Shi-tong,CAO Su-qun.Gene function analysis of semisupervised multilabel learning[J].CAAI Transactions on Intelligent Systems,2008,3(1):83-90.
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
3
Number of periods:
2008 1
Page number:
83-90
Column:
学术论文—机器学习
Public date:
2008-02-25
- Title:
-
Gene function analysis of semisupervised multilabel learning
- Author(s):
-
CHEN Xiao-feng1; WANG Shi-tong1; CAO Su-qun1; 2
-
1.School of Information Technology, Jiangnan University, Wuxi 214122 , Ch ina;
?2.Department of Mechanical Engineering, Huaiyin Institute of Technology, H uai’an 223001,China
-
- Keywords:
-
semisupervised; multilabel; selftraining; support vector machine
- CLC:
-
TP181
- DOI:
-
-
- Abstract:
-
Conventional machine learning is used only for single label learning, implying that every sample has only one label. However, in bioinformatics, a gen e has more than one function, so it needs more than one label. Therefore, multi label learning is more effective for identifying gene groups than conventional l earning approach. Current research mainly focuses on supervised multilabel lea r ning. The problem of effective semisupervised multilabel learning strategies f or labeled examples and unlabeled examples of gene expression datasets still rem ains unsolved. In this paper, a semisupervised multilabel learning algorithm , named SML_SVM, is presented as an effective multilabel learner for analysis of gene expressions with at least one function. First, the proposed SML_SVM algorit hm transforms the semisupervised multilabel learning into corresp ond ing semisupervised singlelabel learning by the PT4 method, then it labels un la beled examples using the maximum a posteriori (MAP) principle in combination wit h the Knearest neighbor method, and finally, it solves the corresponding singl e label learning problem using SVM. The distinctive characteristic of the propos e d algorithm is its efficient integration of SVMbased singlelabel learning wi th MAP and Knearest neighbor methods. Experimental results with a real Yeast gen e expression dataset and a Genbase protein dataset show that the proposed SML_S VM algorithm outperforms the PT4based MLSVM method and selftraining MLSVM.