Information and Media Technologies
Online ISSN : 1881-0896
ISSN-L : 1881-0896
Computing
Mining Sequential Patterns More Efficiently by Reducing the Cost of Scanning Sequence Databases
Jiahong WangYoshiaki AsanumaEiichiro KodamaToyoo TakataJie Li
Author information
JOURNAL FREE ACCESS

2007 Volume 2 Issue 1 Pages 163-177

Details
Abstract

Sequential pattern mining is a useful technique used to discover frequent subsequences as patterns in a sequence database. Depending on the application, sequence databases vary by number of sequences, number of individual items, average length of sequences, and average length of potential patterns. In addition, to discover the necessary patterns in a sequence database, the support threshold may be set to different values. Thus, for a sequential pattern-mining algorithm, responsiveness should be achieved for all of these factors. For that purpose, we propose a candidate-driven pattern-growth sequential pattern-mining algorithm called FSPM (Fast Sequential Pattern Mining). A useful property of FSPM is that the sequential patterns concerning a user-specified item can be mined directly. Extensive experimental results show that, in most cases FSPM outperforms existing algorithms. An analytical performance study shows that it is the inherent potentiality of FSPM that makes it more effective.

Content from these authors
© 2007 by Information Processing Society of Japan
Previous article Next article
feedback
Top