A platform for research: civil engineering, architecture and urbanism
Efficient High-utility Itemset Mining Based on a Novel Data Structure
High-utility itemset mining (HUIM) is one of the important tasks in data mining. In HUIM, the most profitable products can be found by considering the quantity and profit factors, rather than the frequency factor. The one-phase HUIM algorithms based on utility-list structure have become one of the most efficient algorithms since they can mine high-utility itemsets (HUIs) without generating candidates. However, storing itemset information for utility list is time consuming and memory consuming, especially on the dense datasets with long transactions. To address the problem, this paper proposes an HUIM algorithm based on a novel data structure. The novel data structure is designed by reorganizing the transaction database in order to get all HUIs effectively and reduce memory usage in the depth-first search process. Based on the novel data structure, two upper bounds, which are based on the extensions utility and local transaction weighted utility, are introduced to reduce the search space greatly from width and depth. Experiment are carried out on the dense and sparse benchmark datasets. Compared with some state-of-the-art HUIM algorithms, the proposed HUIM algorithm has obtained better performance for mining HUIs in terms of running time and memory usage.
Efficient High-utility Itemset Mining Based on a Novel Data Structure
High-utility itemset mining (HUIM) is one of the important tasks in data mining. In HUIM, the most profitable products can be found by considering the quantity and profit factors, rather than the frequency factor. The one-phase HUIM algorithms based on utility-list structure have become one of the most efficient algorithms since they can mine high-utility itemsets (HUIs) without generating candidates. However, storing itemset information for utility list is time consuming and memory consuming, especially on the dense datasets with long transactions. To address the problem, this paper proposes an HUIM algorithm based on a novel data structure. The novel data structure is designed by reorganizing the transaction database in order to get all HUIs effectively and reduce memory usage in the depth-first search process. Based on the novel data structure, two upper bounds, which are based on the extensions utility and local transaction weighted utility, are introduced to reduce the search space greatly from width and depth. Experiment are carried out on the dense and sparse benchmark datasets. Compared with some state-of-the-art HUIM algorithms, the proposed HUIM algorithm has obtained better performance for mining HUIs in terms of running time and memory usage.
Efficient High-utility Itemset Mining Based on a Novel Data Structure
Shen, Wei (author) / Zhang, Chao (author) / Fang, Wei (author) / Zhang, Xin (author) / Zhan, Zhi-Hui (author) / Lin, Jerry Chun-Wei (author)
2021-09-07
5490794 byte
Conference paper
Electronic Resource
English
An Effective Gradual Data-Reduction Strategy for Fuzzy Itemset Mining
British Library Online Contents | 2013
|Mining Customer Data to Better Inform Utility Decision-Making
British Library Conference Proceedings | 2013
|Asset deterioration analysis using multi-utility data and multi-objective data mining
British Library Online Contents | 2009
|An Efficient Tree-based Fuzzy Data Mining Approach
British Library Online Contents | 2010
|