SSDMiner: A Scalable and Fast Disk-Based Frequent Pattern Miner

Cited 0 time in webofscience Cited 1 time in scopus
  • Hit : 214
  • Download : 0
Frequent itemset mining is widely used as a fundamental data mining technique. Recently, there have been proposed a number of disk-based methods. However, the existing methods still do not have a good scalability due to large-scale intermediate data and non-trivial disk I/Os. We propose SSDMiner, a new fast and scalable disk-based method for frequent itemset mining that is based on Apriori-like method and has no intermediate data and small disk I/O overheads by exploiting SSD. We propose a concept of bitmap chunks for storing transactional database in disks and a fast support counting based on bitmap chunks. Through experiments, we demonstrate that SSDMiner has the enhanced scalability and the good performance similar to that in memory-based methods with robustness. ? 2018, Springer Nature Singapore Pte Ltd.
Publisher
Springer Verlag
Issue Date
2017-08
Language
English
Citation

7th International Conference on Emerging Databases: Technologies, Applications, and Theory, EDB 2017, pp.99 - 110

ISSN
1876-1100
DOI
10.1007/978-981-10-6520-0_11
URI
http://hdl.handle.net/10203/275039
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0