An Efficient Distributed Coverage Pattern Mining Algorithm

P Sathineni, AS Reddy, PK Reddy… - Big Data Analytics: 9th …, 2021 - Springer
Big Data Analytics: 9th International Conference, BDA 2021, Virtual Event …, 2021Springer
Mining of coverage patterns from transactional databases is one of the data mining tasks. It
has applications in banner advertising, search engine advertising and visibility computation.
In general, most real-world transactional databases are typically large. Mining of coverage
patterns from large transactional databases such as query log transactions on a single
computer is challenging and time-consuming. In this paper, we propose Distributed
Coverage Pattern Mining (DCPM) approach. In this approach, we employ a notion of the …
Abstract
Mining of coverage patterns from transactional databases is one of the data mining tasks. It has applications in banner advertising, search engine advertising and visibility computation. In general, most real-world transactional databases are typically large. Mining of coverage patterns from large transactional databases such as query log transactions on a single computer is challenging and time-consuming. In this paper, we propose Distributed Coverage Pattern Mining (DCPM) approach. In this approach, we employ a notion of the summarized form of Inverse Transactional Database (ITD) and replicate it at every node. We also employ an efficient clustering-based method to distribute the computational load of extracting coverage patterns among the Worker nodes. We performed extensive experiments using two real-world datasets and one synthetic dataset. The results show that the proposed approach significantly improves the performance over the state-of-the-art approaches in terms of execution time and data shuffled.
Springer
Showing the best result for this search. See all results