Inicio  /  Information  /  Vol: 14 Par: 11 (2023)  /  Artículo
ARTÍCULO
TITULO

CoDiS: Community Detection via Distributed Seed Set Expansion on Graph Streams

Austin Anderson    
Petros Potikas and Katerina Potika    

Resumen

Community detection has been (and remains) a very important topic in several fields. From marketing and social networking to biological studies, community detection plays a key role in advancing research in many different fields. Research on this topic originally looked at classifying nodes into discrete communities (non-overlapping communities) but eventually moved forward to placing nodes in multiple communities (overlapping communities). Unfortunately, community detection has always been a time-inefficient process, and datasets are too large to realistically process them using traditional methods. Because of this, recent methods have turned to parallelism and graph stream models, where the edge list is accessed one edge at a time. However, all these methods, while offering a significant decrease in processing time, still have several shortcomings. We propose a new parallel algorithm called community detection with seed sets (CoDiS), which solves the overlapping community detection problem in graph streams. Initially, some nodes (seed sets) have known community structures, and the aim is to expand these communities by processing one edge at a time. The innovation of our approach is that it splits communities among the parallel computation workers so that each worker is only updating a subset of all the communities. By doing so, we decrease the edge processing throughput and decrease the amount of time each worker spends on each edge. Crucially, we remove the need for every worker to have access to every community. Experimental results show that we are able to gain a significant improvement in running time with no loss of accuracy.

 Artículos similares

       
 
Marisa Magno, Ana Isabel Martins, Joana Pais, Anabela G. Silva and Nelson Pacheco Rocha    
The early detection of cognitive impairment is essential in order to initiate interventions and guarantee access to healthcare services. Digital solutions are emerging in the literature as an alternative approach to cognitive screening. Our primary goal ... ver más
Revista: Applied Sciences

 
Jiju Guo, Wengeng Cao, Guohui Lang, Qifa Sun, Tian Nan, Xiangzhi Li, Yu Ren and Zeyan Li    
The presence of high concentrations of geogenic arsenic (As) in groundwater poses a serious threat to the health of millions of individuals globally. This paper examines the research progress of groundwater with high concentrations of geogenic As through... ver más
Revista: Water

 
Andra Sandu, Ioana Ioana?, Camelia Delcea, Laura-Madalina Geanta and Liviu-Adrian Cotfas    
The proliferation of misinformation presents a significant challenge in today?s information landscape, impacting various aspects of society. While misinformation is often confused with terms like disinformation and fake news, it is crucial to distinguish... ver más
Revista: Information

 
Jian Huang and Yijun Gu    
Community detection is an important task in the analysis of complex networks, which is significant for mining and analyzing the organization and function of networks. As an unsupervised learning algorithm based on the particle competition mechanism, stoc... ver más
Revista: Applied Sciences

 
Seunghyun Lee, Jiho Lee, Jae-Min Lee, Hong-Woo Chun and Janghyeok Yoon    
Social issues refer to topics that occur and become increasingly focused in various areas of society. Because of the evolutionary pattern of issues, detecting social issues requires monitoring various stories formed by members of society over time. Vario... ver más
Revista: Applied Sciences