[1]胡学钢 张圆圆.一种挖掘带时间约束序列模式的改进算法[J].智能系统学报,2007,2(2):89-93.
HU Xue-gang,ZHANG Yuan-yuan.An improved algorithm for mining sequential patterns with time constraints[J].CAAI Transactions on Intelligent Systems,2007,2(2):89-93.
点击复制
《智能系统学报》[ISSN 1673-4785/CN 23-1538/TP] 卷:
2
期数:
2007年第2期
页码:
89-93
栏目:
学术论文—人工智能基础
出版日期:
2007-04-25
- Title:
-
An improved algorithm for mining sequential patterns with time constraints
- 文章编号:
-
1673-4785(2007)02-0089-05
- 作者:
-
胡学钢 张圆圆
-
合肥工业大学计算机与信息学院 安徽 合肥 230009
- Author(s):
-
HU Xue-gang,ZHANG Yuan-yuan
-
School of Computer and Information, Hefei University of Technology, Hefei 230009, China
-
- 关键词:
-
数据挖掘 序列模式 时间约束
- Keywords:
-
data mining; sequential pattern; time constrain
- 分类号:
-
TP182
- 文献标志码:
-
A
- 摘要:
-
针对带时间约束的序列模式,提出了一种改进的挖掘算法TSPM,克服了传统的序列模式挖掘方法时空开销大,结果数量巨大且缺少针对性的缺陷.算法引入图结构表示频繁2序列,仅需扫描一次数据库,即可将与挖掘任务相关的信息映射到图中,图结构的表示使得挖掘过程可以充分利用项目之间的次序关系,提高了频繁序列的生成效率.另外算法利用序列的位置信息计算支持度,降低了处理时间约束的复杂性,避免了反复测试序列包含的过程. 实验证明,该算法较传统的序列模式发现算法在时间和空间性能上具有优越性.
- Abstract:
-
An improved time constrained sequential pattern mining algorithm (TSPM) is propo sed, overcoming the problem of traditional sequential mining algorithm whose performance is poor, and result is numerous and short of pertinence. Grap h is introduced to express the frequent 2sequence. It need scan the transactio n database only once, then mapping information related to the mining task int o graph. The graph representation can fully utilize the property of item order i n the mining process, thus improving the generating efficiency of frequent seque nces. Besides it makes use of the positional information of sequence to count su pport, therefore reducing the complexity of time constraints processing, and avo iding the process of testing whether a candidate sequence is contained in a data sequence. Experimental results prove the superiority of the algorithm in time a nd space performance.
更新日期/Last Update:
2009-05-06