|
| | | |
PCITMiner- Prefix-based Closed Induced Tree Miner for finding closed induced frequent subtrees
Kutty, S., Nayak, R. and Li, Y.
Frequent subtree mining has attracted a great deal of
interest among the researchers due to its application in a
wide variety of domains. Some of the domains include bio
informatics, XML processing, computational linguistics,
and web usage mining. Despite the advances in frequent
subtree mining, mining for the entire frequent subtrees is
infeasible due to the combinatorial explosion of the
frequent subtrees with the size of the datasets. In order to
provide a reduced and concise representation without
information loss, we propose a novel algorithm,
PCITMiner (Prefix-based Closed Induced Tree Miner).
PCITMiner adopts the prefix-based pattern growth
strategy to provide the closed induced frequent subtrees
efficiently. The empirical analysis reveals that our
algorithm significantly outperforms the current state of
the art algorithm, PrefixTreeISpan. |
Cite as: Kutty, S., Nayak, R. and Li, Y. (2007). PCITMiner- Prefix-based Closed Induced Tree Miner for finding closed induced frequent subtrees. In Proc. Sixth Australasian Data Mining Conference (AusDM 2007), Gold Coast, Australia. CRPIT, 70. Christen, P., Kennedy, P. J., Li, J., Kolyshkina, I. and Williams, G. J., Eds. ACS. 151-160. |
(from crpit.com)
(local if available)
|
|