ARTÍCULO
TITULO

Analysis of Geotagging Behavior: Do Geotagged Users Represent the Twitter Population?

Amir Karami    
Rachana Redd Kadari    
Lekha Panati    
Siva Prasad Nooli    
Harshini Bheemreddy and Parisa Bozorgi    

Resumen

Twitter?s APIs are now the main data source for social media researchers. A large number of studies have utilized Twitter data for diverse research interests. Twitter users can share their precise real-time location, and Twitter APIs can provide this information as longitude and latitude. These geotagged Twitter data can help to study human activities and movements for different applications. Compared to the mostly small-scale data samples in different domains, such as social science, collecting geotagged data offers large samples. There is a fundamental question whether geotagged users can represent non-geotagged users. While some studies have investigated the question from different perspectives, they did not investigate profile information and the contents of tweets of geotagged and non-geotagged users. This empirical study addresses this limitation by applying text mining, statistical analysis, and machine learning techniques on Twitter data comprising more than 88,000 users and over 170 million tweets. Our findings show that there is a significant difference (p-value < 0.001) between geotagged and non-geotagged users based on 73% of the features obtained from the users? profiles and tweets. The features can also help to distinguish between geotagged and non-geotagged users with around 80% accuracy. This research illustrates that geotagged users do not represent the Twitter population.

 Artículos similares

       
 
Yuqin Jiang, Xiao Huang and Zhenlong Li    
The novel coronavirus disease (COVID-19) pandemic has impacted every facet of society. One of the non-pharmacological measures to contain the COVID-19 infection is social distancing. Federal, state, and local governments have placed multiple executive or... ver más

 
Antonio Annis and Fernando Nardi    
Hydrologic/hydraulic models for flood risk assessment, forecasting and hindcasting have been greatly supported by the rising availability of increasingly accurate and high-resolution Earth Observation (EO) data. EO-based topographic and hydrologic open g... ver más
Revista: Hydrology

 
Sevim Sezi Karayazi, Gamze Dane and Bauke de Vries    
Touristic cities are home to historical landmarks and irreplaceable urban heritages. Although tourism brings financial advantages, mass tourism creates pressure on historical cities. Therefore, ?attractiveness? is one of the key elements to explain touri... ver más

 
Shanshan Han, Cuiming Liu, Keyun Chen, Dawei Gui and Qingyun Du    
The rapid development of social media data, including geotagged photos, has benefited the research of tourism geography; additionally, tourists? increasing demand for personalized travel has encouraged more researchers to pay attention to tourism recomme... ver más

 
Da Li, Yingke Lei, Xin Li and Haichuan Zhang    
Wi-Fi and magnetic field fingerprinting-based localization have gained increased attention owing to their satisfactory accuracy and global availability. The common signal-based fingerprint localization deteriorates due to well-known signal fluctuations. ... ver más