Inicio  /  Future Internet  /  Vol: 13 Par: 11 (2021)  /  Artículo
ARTÍCULO
TITULO

Detection of Hidden Communities in Twitter Discussions of Varying Volumes

Ivan Blekanov    
Svetlana S. Bodrunova and Askar Akhmetov    

Resumen

The community-based structure of communication on social networking sites has long been a focus of scholarly attention. However, the problem of discovery and description of hidden communities, including defining the proper level of user aggregation, remains an important problem not yet resolved. Studies of online communities have clear social implications, as they allow for assessment of preference-based user grouping and the detection of socially hazardous groups. The aim of this study is to comparatively assess the algorithms that effectively analyze large user networks and extract hidden user communities from them. The results we have obtained show the most suitable algorithms for Twitter datasets of different volumes (dozen thousands, hundred thousands, and millions of tweets). We show that the Infomap and Leiden algorithms provide for the best results overall, and we advise testing a combination of these algorithms for detecting discursive communities based on user traits or views. We also show that the generalized K-means algorithm does not apply to big datasets, while a range of other algorithms tend to prioritize the detection of just one big community instead of many that would mirror the reality better. For isolating overlapping communities, the GANXiS algorithm should be used, while OSLOM is not advised.

 Artículos similares

       
 
Ye Li and Hongxiang Ren    
The widespread of shipborne Automatic Identification System (AIS) equipment will continue to produce a large amount of spatiotemporal trajectory data. In order to explore and understand the hidden behaviour patterns in the data, an interactive visual ana... ver más

 
Xiaorong Gao, Haowen Yan, Xiaomin Lu and Pengbo Li    
The major reason that the fully automated generalization of residential areas has not been achieved to date is that it is difficult to acquire the knowledge that is required for automated generalization and for the calculation of spatial similarity degre... ver más

 
Ahmed Latif Yaser, Hamdy M. Mousa and Mahmoud Hussein    
Software-defined networking (SDN) is an innovative network paradigm, offering substantial control of network operation through a network?s architecture. SDN is an ideal platform for implementing projects involving distributed applications, security solut... ver más
Revista: Future Internet

 
Belle Fille Murorunkwere, Origene Tuyishimire, Dominique Haughton and Joseph Nzabanita    
Detecting tax fraud is a top objective for practically all tax agencies in order to maximize revenues and maintain a high level of compliance. Data mining, machine learning, and other approaches such as traditional random auditing have been used in many ... ver más
Revista: Future Internet

 
Israa Kadhim and Fanar M. Abed    
With the increasing demands to use remote sensing approaches, such as aerial photography, satellite imagery, and LiDAR in archaeological applications, there is still a limited number of studies assessing the differences between remote sensing methods in ... ver más