Inicio  /  Applied Sciences  /  Vol: 14 Par: 8 (2024)  /  Artículo
ARTÍCULO
TITULO

FSM-BC-BSP: Frequent Subgraph Mining Algorithm Based on BC-BSP

Fangling Leng    
Fan Li    
Yubin Bao    
Tiancheng Zhang and Ge Yu    

Resumen

As graph models become increasingly prevalent in the processing of scientific data, the exploration of effective methods for the mining of meaningful patterns from large-scale graphs has garnered significant research attention. This paper delves into the complexity of frequent subgraph mining and proposes a frequent subgraph mining (FSM) algorithm. This FSM algorithm is developed within a distributed graph iterative system, designed for the Big Cloud (BC) environment of the China Mobile Corp., and is based on the bulk synchronous parallel (BSP) model, named FSM-BC-BSP. Its aim is to address the challenge of mining frequent subgraphs within a single, large graph. This study advocates for the incorporation of a message sending and receiving mechanism to facilitate data sharing across various stages of the frequent subgraph mining algorithm. Additionally, it suggests employing a standard coded subgraph and sending it to the same node for global support calculation on the large graph. The adoption of the rightmost path expansion strategy in generating candidate subgraphs helps to mitigate the occurrence of redundant subgraphs. The use of standard coding ensures the unique identification of subgraphs, thus eliminating the need for isomorphism calculations. Support calculation is executed using the Minimum Image (MNI) measurement method, aligning with the downward closure attribute. The experimental results demonstrate the robust performance of the FSM-BC-BSP algorithm across diverse input datasets and parameter configurations. Notably, the algorithm exhibits exceptional efficacy, particularly in scenarios with low support requirements, showcasing its superior performance under such conditions.

 Artículos similares

       
 
Jie Zhang, Jianjun Wu, Tao Yang, Sen Yang, Yifeng He and Shoushi Gao    
With the gradual increase in the mining depth of coal resources, the destruction of the rock structure of the inter-layered rock of the near coal seam under the influence of mining has led to the frequent occurrence of water-inrush disasters in mines, wh... ver más
Revista: Applied Sciences

 
Faria Ferooz, Malik Tahir Hassan, Sajid Mahmood, Hira Asim, Muhammad Idrees, Muhammad Assam, Abdullah Mohamed and El-Awady Attia    
To reduce crime rates, there is a need to understand and analyse emerging patterns of criminal activities. This study examines the occurrence patterns of crimes using the crime dataset of Lahore, a metropolitan city in Pakistan. The main aim is to facili... ver más
Revista: Applied Sciences

 
Anna Karen Gárate-Escamilla, Ivan Amaya, Jorge M. Cruz-Duarte, Hugo Terashima-Marín and José Carlos Ortiz-Bayliss    
Hyper-heuristics have arisen as methods that increase the generality of existing solvers. They have proven helpful for dealing with complex problems, particularly those related to combinatorial optimization. Their recent growth in popularity has increase... ver más
Revista: Applied Sciences

 
Jing Wang and Xiongfei Li    
Most data with a complicated structure can be represented by a tree structure. Parallel processing is essential to mining frequent subtrees from massive data in a timely manner. However, only a few algorithms could be transplanted to a parallel framework... ver más
Revista: Applied Sciences

 
Neda Rostamzadeh, Sheikh S. Abdullah, Kamran Sedig, Amit X. Garg and Eric McArthur    
Laboratory tests play an essential role in the early and accurate diagnosis of diseases. In this paper, we propose SUNRISE, a visual analytics system that allows the user to interactively explore the relationships between laboratory test results and a di... ver más
Revista: Informatics