Enhancing the Performance of Association Rule Generation over Dynamic Data using Incremental Tree Structures
##plugins.themes.academic_pro.article.main##
Abstract
To discover a novel and dynamic approach for frequent itemsets generation and also for generating association rules is an imperative aspect in data mining. With the fast increase in databases, new transactions added, the incremental mining is acquainted to resolve the issues of maintaining association rules in updated databases. Earlier algorithms focused on this problem which consumed more time and costly to mine. This paper intends to analyze the tree construction like Frequent Pattern-tree(FP),PreOrderCoded(POC) tree and PrePostCoded(PPC) tree for sinking overheads and time constraints. To overcome theissueof updating association rules when new transactions addition this paper proposes a dynamic frequent itemsets mining approach using Incremental PreOrderCoded (IPOC)tree. This will reduce computational and scanning overheads of original dataset against addition of new transaction items and also works in an optimized way. An analysis was done on existing algorithms and compares time complexities for various standard datasets. The proposed approach shown better performance against existed ones over time and efficiency.
##plugins.themes.academic_pro.article.details##
This work is licensed under a Creative Commons Attribution 4.0 International License.
References
- Madhu V. Ahluwalia, AryyaGangopadhyay, ZhiyuanChenand Yelena Yesha,“Target-Based,Privacy Preserving, and Incremental Association Rule Mining”,Ieee Transactions on Services Computing,pp.1-14,2015.
- Yang, J., Zhang, Y., Wei, Y.: An improved vertical algorithmfor frequent itemset mining from uncertain database.In: Intelligent Human-Machine Systems and Cybernetics(IHMSC), vol. 1, pp. 355–358 (2017) DOI: https://doi.org/10.1109/IHMSC.2017.87
- Grahne, G., Zhu, J.: Fast Algorithms for Frequent Itemset Mining Using FP-Trees. Trans-actions on Knowledge and Data Engineering, 17(10), pp. 1347-1362 (2005) DOI: https://doi.org/10.1109/TKDE.2005.166
- Liu, J., Wu, Y., Zhou, Q., Fung, B.C., Chen, F., Yu, B.: Parallel Eclat for Opportunistic Mining of Frequent Itemsets. In: Int. Conf. on Database and Expert Systems Applications, pp. 401-415 (2015) DOI: https://doi.org/10.1007/978-3-319-22849-5_27
- Elfangary, L.,Atteya, W.A.: “Mining Medical Database using Proposed Incremental Association rule Algorithm (PIA)”; Second International Conference on the Digital Society, IEEE (2008).. DOI: https://doi.org/10.1109/ICDS.2008.10
- Y. Li, Z.-H. Zhang, W.-B. Chen, F. Min, “TDUP: an approach to incremental mining of frequent itemsetswith three-way-decision pattern updating”, Int. J. Mach. Learn. & Cyber.DOI 10.1007/s13042-015-0337-6, 2015. DOI: https://doi.org/10.1007/s13042-015-0337-6
- Mohammed M. Fouad, Mostafa G.M. Mostafa, “IndxTAR: An Efficient Algorithm for Indexed Mining of Incremental Temporal Association Rules”,International Journal of Computer Information Systems and Industrial Management Applications. ISSN 2150-7988 Volume 9, pp.103-113, 2017.
- Jiaojiao Sun, YalingXun et al, “Incremental Frequent Itemsets Miningwith FCFP Tree”, IEEE Access, 2019.
- Lee G, Yun U, Ryang H (2015) Mining weighted erasable patterns by using underestimated constraint-based pruning technique. J Intell Fuzzy Syst 28(3):1145–1157. DOI: https://doi.org/10.3233/IFS-141398
- Y. Unil, G. Lee, "Incremental mining of weighted maximal frequentitemsets from dynamic databases" Expert Systems with Applications,vol. 54, pp. 304–327, 2016. DOI: https://doi.org/10.1016/j.eswa.2016.01.049
- Nori F, Deypir M, Sadreddini MH (2013) A sliding-window based algorithm for frequent closed itemset mining over data streams. J SystSoftw 86(3):615–623 DOI: https://doi.org/10.1016/j.jss.2012.10.011
- Le T, Vo B, Nguyen G (2014) A survey of erasable itemset mining algorithms. WIREs: Data Min KnowlDiscov 4(5):356–379 DOI: https://doi.org/10.1002/widm.1137
- RatchadapornAmornchewin, WorapojKreesuradej, “Mining Dynamic Databases using Probability-BasedIncremental Association Rule Discovery Algorithm”, Journal of Universal Computer Science, vol. 15, no. 12 (2009), 2409-2428.
- Yu-Geng Song, Hui-Min Cui and Xiao-Bing Feng, “Parallel Incremental Frequent Itemset Mining for Large Data”, Journal of Computer Science and Technology 32(2): 368–385 Mar. 2017. DOI: https://doi.org/10.1007/s11390-017-1726-y
- RatchadapornAmornchewin, and WorapojKreesuradej,“False Positive Item set Algorithm for Incremental Association RuleDiscovery”, International Journal of Multimedia and Ubiquitous EngineeringVol. 4, No. 2,pp.1-14, April, 2009.
- Qiu, Ping, Long Zhao, and Xiangjun Dong, "NegI-NSP: Negativesequential pattern mining based on loose constraints," IECON2017-43rd Annual Conference of the IEEE Industrial ElectronicsSociety. IEEE, pp. 3419–3425, 2017. DOI: https://doi.org/10.1109/IECON.2017.8216579
- Zhi-Hong Deng , Sheng-Long Lv, “PrePost+: An efficient N-lists-based algorithm for mining frequentitemsets via Children–Parent Equivalence pruning”, Expert Systems with Applications 42 (2015) 5424–5432. DOI: https://doi.org/10.1016/j.eswa.2015.03.004
- Zhi-Hong Deng, Sheng-Long Lv, “Fast mining frequent itemsets using Nodesets”, Expert Syst. Appl. 41(10): 4505-4512 (2014). DOI: https://doi.org/10.1016/j.eswa.2014.01.025
- David W. Cheung, Jiawei Han, Vincent T. Ng and C. Y. Wong, “Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique”, Proceedings of the Twelfth International Conference on Data Engineering, February 26 - March 1, 1996, New Orleans, Louisiana, pp. 106-114.
- Tsai PSM (2010) Mining top-k frequent closed itemsets over data streams using the sliding-window model. ExpSystAppl 37(10):6968–6973 DOI: https://doi.org/10.1016/j.eswa.2010.03.023
- Lichun Li, Rongxing Lu, Kim-Kwang Raymond Choo, AnwitamanDatta, and Jun Shao. (2016), “Privacy-Preserving-Outsourced Association Rule Mining on Vertically Partitioned Databases”, IEEE Transactions on Information Forensics and Security. 11 (8), p1-15. DOI: https://doi.org/10.1109/TIFS.2016.2561241
- Abdelhamid, E., Canim, M., Sadoghi, M., Bhattacharjee, B., Chang, Y.C. and Kalnis, P., 2017. Incremental frequent subgraph mining on large evolving graphs. IEEE Transactions on Knowledge and Data Engineering, 29(12), pp.2710-2723. DOI: https://doi.org/10.1109/TKDE.2017.2743075
- Zhang, S., Du, Z. and Wang, J.T., 2014. New techniques for mining frequent patterns in unordered trees. IEEE transactions on cybernetics, 45(6), pp.1113-1125. DOI: https://doi.org/10.1109/TCYB.2014.2345579
- Zhi-Jun X, Hong C, Li C (2006) An efficient algorithm for frequent itemset mining on data streams. In: ICDM’06, pp 474–491 DOI: https://doi.org/10.1007/11790853_37
- C.F. Ahmed, S.K. Tanbeer, B.S. Jeong, Y.K. Lee, Efficient tree structures for highutility pattern mining in incremental databases, IEEE Trans. Knowl. Data Eng.21 (2009) 1708–1721. DOI: https://doi.org/10.1109/TKDE.2009.46
- Li YC, Yeh JS, Chang CC (2005) Efficient algorithms for mining share-frequent itemsets. In: The world congress of international fuzzy systems association, pp 539–543
- MortezaZihayat, Yan Chen ,Aijun An, “Memory-Adaptive High Utility Sequential Pattern Mining Over Data Stream”, Published in Springer Journal on Machine Learning. Vol. 106. Issue 6. pp.799–836, 2017. DOI: https://doi.org/10.1007/s10994-016-5617-1