Sequential pattern mining using PrefixSpan with pseudoprojection and separator database

Saputra, D. and Rambli, D.R.A. and Foong, Oi Mean (2008) Sequential pattern mining using PrefixSpan with pseudoprojection and separator database. In: International Symposium on Information Technology 2008, ITSim, 26 August 2008 through 29 August 2008, Kuala Lumpur.

[thumbnail of Sequential pattern mining using PrefixSpan with pseudoprojection and separator database] PDF (Sequential pattern mining using PrefixSpan with pseudoprojection and separator database)
paper.pdf
Restricted to Registered users only

Download (12kB)

Abstract

Sequential pattern mining is a new branch of data mining science that solves inter-transaction pattern mining problems. A comprehensive performance study has been reported that PrefixSpan, one of its algorithms, outperforms GSP, SPADE, as well as FreeSpan in most cases, and PrefixSpan integrated with pseudoprojection technique is the fastest among those tested algorithms. Nevertheless, Pseudoprojection technique, which requires maintaining and visiting the in-memory sequence database frequently until all patterns are found, consumes a considerable amount of memory and induces the algorithm to undertake redundant and unnecessary checks to this copy of original database into memory when the candidate patterns are examined. In this paper, we propose Separator Database to improve PrefixSpan with pseudoprojection through early removal of uneconomical in-memory sequence database. The experimental results show that Separator Database improves PrefixSpan with pseudoprojection. Future research includes exploring the use of Separator Database in PrefixSpan with pseudoprojection to improve mining constrained sequential patterns. © 2008 IEEE.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Decision support systems; Information management; Information technology; Mining; Separation; Separators; Candidate patterns; Comprehensive performances; Future researches; Memory sequences; Prefixspan; Sequential Pattern minings; Sequential patterns; Transaction patterns; Database systems
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments / MOR / COE: Departments > Computer Information Sciences
Depositing User: Foong Oi Mean
Date Deposited: 25 Feb 2010 07:49
Last Modified: 19 Jan 2017 08:26
URI: http://scholars.utp.edu.my/id/eprint/222

Actions (login required)

View Item
View Item