[1]JIANG Bing,GU Feiyang,HE Zengyou.Mining Top-k non-redundant distinguishing sequential patterns[J].CAAI Transactions on Intelligent Systems,2018,13(5):680-686.[doi:10.11992/tis.201702019]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
13
Number of periods:
2018 5
Page number:
680-686
Column:
学术论文—机器学习
Public date:
2018-09-05
- Title:
-
Mining Top-k non-redundant distinguishing sequential patterns
- Author(s):
-
JIANG Bing; GU Feiyang; HE Zengyou
-
Software School, Dalian University of Technology, Dalian 116621, China
-
- Keywords:
-
distinguishing sequential pattern; breadth-first; redundant sequential patterns; pattern mining; Top-k
- CLC:
-
TP393
- DOI:
-
10.11992/tis.201702019
- Abstract:
-
Distinct sequential patterns can be used to characterize different categories of datasets. In the field of bioinformatics, logistics management, and e-commerce, the comparison of sequential pattern has a wide range of applications. The goal of the Top-k distinguishing sequential pattern mining is to find k patterns with the highest contrast in a given data set. However, in the Top-k distinguishing sequential pattern mining, there is a redundancy problem with respect to the set of reported sequential patterns, which is not considered by the algorithm. Therefore, in this paper, a non-redundant Top-k distinguishing sequential pattern mining algorithm, breadth-first miner (BFM), is proposed based on breadth-first spanning tree. The redundancy problem is effectively solved using the BFM algorithm. Based on the BFM algorithm, a better algorithm, pruning breadth-first miner (PBFM), is proposed. Through the experimental analysis and comparison on the real data set, the validity of the algorithm is verified.