Inicio  /  Information  /  Vol: 9 Par: 5 (2018)  /  Artículo
ARTÍCULO
TITULO

Fast Identification of High Utility Itemsets from Candidates

Jun-Feng Qu    
Mengchi Liu    
Chunsheng Xin and Zhongbo Wu    

Resumen

High utility itemsets (HUIs) are sets of items with high utility, like profit, in a database. Efficient mining of high utility itemsets is an important problem in the data mining area. Many mining algorithms adopt a two-phase framework. They first generate a set of candidate itemsets by roughly overestimating the utilities of all itemsets in a database, and subsequently compute the exact utility of each candidate to identify HUIs. Therefore, the major costs in these algorithms come from candidate generation and utility computation. Previous works mainly focus on how to reduce the number of candidates, without dedicating much attention to utility computation, to the best of our knowledge. However, we find that, for a mining task, the time of utility computation in two-phase algorithms dominates the whole running time of these algorithms. Therefore, it is important to optimize utility computation. In this paper, we first give a basic algorithm for HUI identification, the core of which is a utility computation procedure. Subsequently, a novel candidate tree structure is proposed for storing candidate itemsets, and a candidate tree-based algorithm is developed for fast HUI identification, in which there is an efficient utility computation procedure. Extensive experimental results show that the candidate tree-based algorithm outperforms the basic algorithm and the performance of two-phase algorithms, integrating the candidate tree algorithm as their second step, can be significantly improved.

 Artículos similares

       
 
Umair Khan, William Pao and Nabihah Sallih    
Gas?liquid two-phase flow is very common in industrial pipelines. Flow regime identification is the first step to design, analyze, and operate the gas?liquid system successfully. The purpose of this study is to develop a methodology for identification of... ver más
Revista: Applied Sciences

 
Yuantian Qin, Yucheng Zhang, Vadim Silberschmidt and Luping Zhang    
The location identification of dynamic load is an important part of load-identification technology. Traditional methods are mostly aimed at the identification of dynamic load?s amplitude and phase. A new method for dynamic load location identification is... ver más
Revista: Aerospace

 
Egor Shiriaev, Nikolay Kucherov, Mikhail Babenko, Vladislav Lutsenko and Safwat Al-Galda    
In this paper, a study is carried out related to improving the reliability and fault tolerance of Fog Computing systems. This work is a continuation of previous studies. In the past, we have developed a method of fast operation for determining the sign o... ver más
Revista: Applied Sciences

 
Guangchao Yang, Jigang Zhang, Zhehao Ma and Weixiao Xu    
The steel tube-reinforced concrete (STRC) shear wall plays an important role in the seismic design of high-rise building structures. Due to the synergistic collaboration between steel tubes and concrete, they effectively enhance the ductility and energy ... ver más
Revista: Applied Sciences

 
Bin Sheng, Ligang Wu and Nan Zhang    
Hemerocallis citrina Baroni with different maturity levels has different uses for food and medicine and has different economic benefits and sales value. However, the growth speed of Hemerocallis citrina Baroni is fast, the harvesting cycle is short, and ... ver más
Revista: Applied Sciences