[1]GUO Shaocheng,CHEN Songcan.Sparsified factorization machine[J].CAAI Transactions on Intelligent Systems,2017,12(6):816-822.[doi:10.11992/tis.201706030]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
12
Number of periods:
2017 6
Page number:
816-822
Column:
学术论文—人工智能基础
Public date:
2017-12-25
- Title:
-
Sparsified factorization machine
- Author(s):
-
GUO Shaocheng; CHEN Songcan
-
College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
-
- Keywords:
-
factorization machine; sparsity; sparse group lasso; feature selection; recommender systems
- CLC:
-
TP391
- DOI:
-
10.11992/tis.201706030
- Abstract:
-
Factorization machine (FM) is a recently proposed second-order linear model. One of its main advantages is that the interactions within it are factorized, making it suitable for data with high dimensionality and high sparsity. Though FM has been applied in recommender systems, it fails to consider the sparsity of variables explicitly, especially when the variable contains information on structural sparsity. Therefore, the process of feature selection should meet the following requirements: the linear terms and second-order terms that share the same feature should be included or excluded at the same time; when the feature is noise, both should be excluded, otherwise, both should be included. Based on the sparse structure described above, this paper proposes a sparse group lasso-based factorization machine (SGL-FM). By adding sparse group lasso to the loss function, SGL-FM not only achieves sparsity between groups but also within groups. From another point of view, sparsity within groups can be seen as a method of controlling the dimensionality of the factorization; therefore, SGL-FM chooses the best k automatically when faced with datasets with different properties. The experimental results show that by applying the proposed method, under conditions of excellent precision, a model with more sparsity than FM was obtained.