Redirigiendo al acceso original de articulo en 19 segundos...
Inicio  /  Informatics  /  Vol: 11 Par: 1 (2024)  /  Artículo
ARTÍCULO
TITULO

Topic Extraction: BERTopic?s Insight into the 117th Congress?s Twitterverse

Margarida Mendonça and Álvaro Figueira    

Resumen

As social media (SM) becomes increasingly prevalent, its impact on society is expected to grow accordingly. While SM has brought positive transformations, it has also amplified pre-existing issues such as misinformation, echo chambers, manipulation, and propaganda. A thorough comprehension of this impact, aided by state-of-the-art analytical tools and by an awareness of societal biases and complexities, enables us to anticipate and mitigate the potential negative effects. One such tool is BERTopic, a novel deep-learning algorithm developed for Topic Mining, which has been shown to offer significant advantages over traditional methods like Latent Dirichlet Allocation (LDA), particularly in terms of its high modularity, which allows for extensive personalization at each stage of the topic modeling process. In this study, we hypothesize that BERTopic, when optimized for Twitter data, can provide a more coherent and stable topic modeling. We began by conducting a review of the literature on topic-mining approaches for short-text data. Using this knowledge, we explored the potential for optimizing BERTopic and analyzed its effectiveness. Our focus was on Twitter data spanning the two years of the 117th US Congress. We evaluated BERTopic?s performance using coherence, perplexity, diversity, and stability scores, finding significant improvements over traditional methods and the default parameters for this tool. We discovered that improvements are possible in BERTopic?s coherence and stability. We also identified the major topics of this Congress, which include abortion, student debt, and Judge Ketanji Brown Jackson. Additionally, we describe a simple application we developed for a better visualization of Congress topics.

 Artículos similares

       
 
Saif Alzabeebee, Safaa Manfi Alshibany, Suraparb Keawsawasvong and Davide Forcellini    
Tire-derived aggregate (TDA) has been proposed in recent studies to be considered as part of backfill soil to reduce stress and strain developed in buried pipes. However, little attention is paid to checking the influence of TDA on the behavior of concre... ver más
Revista: Infrastructures

 
Md. Rashadur Rahman, Mohammad Shamsul Arefin, Sanjida Rahman, Afsana Ahmed, Tahsina Islam, Pranab Kumar Dhar and Oh-Jin Kwon    
Recent advancements in high-speed communications and high-capacity computing systems have contributed to major growth in the data volume of databases. Data mining is a crucial part of information retrieval; it is often termed as database knowledge discov... ver más
Revista: Applied Sciences

 
Marco Roveri, Sara Goidanich and Lucia Toniolo    
During the last ten years, photocatalytic nanocomposites combining titania nanoparticles with silicon-based matrices have received increasing attention in the stone conservation research field, because they offer an effective multifunctional approach to ... ver más
Revista: Coatings

 
Vuk Uskokovic    
Despite decades of research into the interaction between cells and nanoparticles, there is a lack of consensus regarding how specific physicochemical characteristics of the nanoparticles, including chemical composition, crystallinity, size, morphology, c... ver más
Revista: Applied Sciences

 
Gennaro Vessio    
Studying the effects of neurodegeneration on handwriting has emerged as an interdisciplinary research topic and has attracted considerable interest from psychologists to neuroscientists and from physicians to computer scientists. The complexity of handwr... ver más
Revista: Applied Sciences