Redirigiendo al acceso original de articulo en 15 segundos...
Inicio  /  Information  /  Vol: 10 Par: 11 (2019)  /  Artículo
ARTÍCULO
TITULO

Analysis of Data Persistence in Collaborative Content Creation Systems: The Wikipedia Case

Lorenzo Bracciale    
Pierpaolo Loreti    
Andrea Detti and Nicola Blefari Melazzi    

Resumen

A very common problem in designing caching/prefetching systems, distribution networks, search engines, and web-crawlers is determining how long a given content lasts before being updated, i.e., its update frequency. Indeed, while some content is not frequently updated (e.g., videos), in other cases revisions periodically invalidate contents. In this work, we present an analysis of Wikipedia, currently the 5th most visited website in the world, evaluating the statistics of updates of its pages and their relationship with page view statistics. We discovered that the number of updates of a page follows a lognormal distribution. We provide fitting parameters as well as a goodness of fit analysis, showing the statistical significance of the model to describe the empirical data. We perform an analysis of the views?updates relationship, showing that in a time period of a month, there is a lack of evident correlation between the most updated pages and the most viewed pages. However, observing specific pages, we show that there is a strong correlation between the peaks of views and updates, and we find that in more than 50% of cases, the time difference between the two peaks is less than a week. This reflects the underlying process whereby an event causes both an update and a visit peak that occurs with different time delays. This behavior can pave the way for predictive traffic analysis applications based on content update statistics. Finally, we show how the model can be used to evaluate the performance of an in-network caching scenario.

 Artículos similares

       
 
J. Madhu Babu,K. M. M. Krishna     Pág. 68 - 78
Television in India has proven a most influential infotainment media powerful and popular among its audience. Television plays a vital role in the telecast entertaining program. Fiction has been a popular genre on Indian Television. A common habit among ... ver más

 
Michal Dominik Stasiak     Pág. 39 - 45
An exchange rate between two currencies can be described in a binary representation. The binarization algorithm transforms the exchange rate represented by tick data into a binary string. Each course change equal to a given discretization unit is assigne... ver más

 
Osareme Erhomosele     Pág. 130 - 144
AbstractInvestigations into the relationship between capital structure and firm performance over the years have consistently produced mixed results in the light of prevailing theories relevant to the concept of capital structure. The study examined the n... ver más

 
Margarita Garcia-Vila, Rodrigo Morillo-Velarde and Elias Fereres    
Process-based crop models such as AquaCrop are useful for a variety of applications but must be accurately calibrated and validated. Sugar beet is an important crop that is grown in regions under water scarcity. The discrepancies and uncertainty in past ... ver más
Revista: Water

 
Benjamin Bett Cheruiyot     Pág. 88 - 97
The focus of this study was to investigate the influence of training strategies on employee performance in public university campuses in Kericho County, Kenya. The study was motivated by concerns on employee performance in public university campuses desp... ver más