Efficient High-utility Itemset Mining Based on a Novel Data Structure: Fid-Bau Portal

Fachinformationsdienst BAUdigital

A platform for research: civil engineering, architecture and urbanism

Efficient High-utility Itemset Mining Based on a Novel Data Structure

Shen, Wei / Zhang, Chao / Fang, Wei / Zhang, Xin / Zhan, Zhi-Hui / Lin, Jerry Chun-Wei

High-utility itemset mining (HUIM) is one of the important tasks in data mining. In HUIM, the most profitable products can be found by considering the quantity and profit factors, rather than the frequency factor. The one-phase HUIM algorithms based on utility-list structure have become one of the most efficient algorithms since they can mine high-utility itemsets (HUIs) without generating candidates. However, storing itemset information for utility list is time consuming and memory consuming, especially on the dense datasets with long transactions. To address the problem, this paper proposes an HUIM algorithm based on a novel data structure. The novel data structure is designed by reorganizing the transaction database in order to get all HUIs effectively and reduce memory usage in the depth-first search process. Based on the novel data structure, two upper bounds, which are based on the extensions utility and local transaction weighted utility, are introduced to reduce the search space greatly from width and depth. Experiment are carried out on the dense and sparse benchmark datasets. Compared with some state-of-the-art HUIM algorithms, the proposed HUIM algorithm has obtained better performance for mining HUIs in terms of running time and memory usage.

Access

Check availability in my library

Order at Subito €

Page navigation

Document information

Export, share and cite

Efficient High-utility Itemset Mining Based on a Novel Data Structure

Shen, Wei / Zhang, Chao / Fang, Wei / Zhang, Xin / Zhan, Zhi-Hui / Lin, Jerry Chun-Wei

High-utility itemset mining (HUIM) is one of the important tasks in data mining. In HUIM, the most profitable products can be found by considering the quantity and profit factors, rather than the frequency factor. The one-phase HUIM algorithms based on utility-list structure have become one of the most efficient algorithms since they can mine high-utility itemsets (HUIs) without generating candidates. However, storing itemset information for utility list is time consuming and memory consuming, especially on the dense datasets with long transactions. To address the problem, this paper proposes an HUIM algorithm based on a novel data structure. The novel data structure is designed by reorganizing the transaction database in order to get all HUIs effectively and reduce memory usage in the depth-first search process. Based on the novel data structure, two upper bounds, which are based on the extensions utility and local transaction weighted utility, are introduced to reduce the search space greatly from width and depth. Experiment are carried out on the dense and sparse benchmark datasets. Compared with some state-of-the-art HUIM algorithms, the proposed HUIM algorithm has obtained better performance for mining HUIs in terms of running time and memory usage.

Efficient High-utility Itemset Mining Based on a Novel Data Structure

Shen, Wei / Zhang, Chao / Fang, Wei / Zhang, Xin / Zhan, Zhi-Hui / Lin, Jerry Chun-Wei

High-utility itemset mining (HUIM) is one of the important tasks in data mining. In HUIM, the most profitable products can be found by considering the quantity and profit factors, rather than the frequency factor. The one-phase HUIM algorithms based on utility-list structure have become one of the most efficient algorithms since they can mine high-utility itemsets (HUIs) without generating candidates. However, storing itemset information for utility list is time consuming and memory consuming, especially on the dense datasets with long transactions. To address the problem, this paper proposes an HUIM algorithm based on a novel data structure. The novel data structure is designed by reorganizing the transaction database in order to get all HUIs effectively and reduce memory usage in the depth-first search process. Based on the novel data structure, two upper bounds, which are based on the extensions utility and local transaction weighted utility, are introduced to reduce the search space greatly from width and depth. Experiment are carried out on the dense and sparse benchmark datasets. Compared with some state-of-the-art HUIM algorithms, the proposed HUIM algorithm has obtained better performance for mining HUIs in terms of running time and memory usage.

Access

Check availability in my library

Order at Subito €

Page navigation

Document information

Export, share and cite

Document information

Title:

Efficient High-utility Itemset Mining Based on a Novel Data Structure

Contributors:

Shen, Wei (author) / Zhang, Chao (author) / Fang, Wei (author) / Zhang, Xin (author) / Zhan, Zhi-Hui (author) / Lin, Jerry Chun-Wei (author)

Published in:

2021 IEEE International Smart Cities Conference (ISC2) ; 1-6

Publication date:

2021-09-07

Size:

5490794 byte

ISBN:

978-1-6654-4919-9

ISSN:

DOI:

https://doi.org/10.1109/ISC253183.2021.9562788

Type of media:

Conference paper

Type of material:

Electronic Resource

Language:

English

Similar titles

An Effective Gradual Data-Reduction Strategy for Fuzzy Itemset Mining

Hong, T.-P. / Lan, G.-C. / Lin, Y.-H. et al. | British Library Online Contents | 2013

Detecting changes in spatial characteristics of Colorado human-caused wildfires using APRIORI-based frequent itemset mining

Cleland, Zachary W. / Dao, Khac An / Dao, Thi Hong Diep | Elsevier | 2023

Mining Customer Data to Better Inform Utility Decision-Making

Vickers, A. / Tiger, M.W. / Eskaf, S. et al. | British Library Conference Proceedings | 2013

Asset deterioration analysis using multi-utility data and multi-objective data mining

Savic, D.A. / Giustolisi, O. / Laucelli, D. | British Library Online Contents | 2009

An Efficient Tree-based Fuzzy Data Mining Approach

Lin, C.-W. / Hong, T.-P. / Lu, W.-H. | British Library Online Contents | 2010