Inicio  /  Algorithms  /  Vol: 13 Par: 7 (2020)  /  Artículo
ARTÍCULO
TITULO

An Algorithm for Density Enrichment of Sparse Collaborative Filtering Datasets Using Robust Predictions as Derived Ratings

Dionisis Margaris    
Dimitris Spiliotopoulos    
Gregory Karagiorgos and Costas Vassilakis    

Resumen

Collaborative filtering algorithms formulate personalized recommendations for a user, first by analysing already entered ratings to identify other users with similar tastes to the user (termed as near neighbours), and then using the opinions of the near neighbours to predict which items the target user would like. However, in sparse datasets, too few near neighbours can be identified, resulting in low accuracy predictions and even a total inability to formulate personalized predictions. This paper addresses the sparsity problem by presenting an algorithm that uses robust predictions, that is predictions deemed as highly probable to be accurate, as derived ratings. Thus, the density of sparse datasets increases, and improved rating prediction coverage and accuracy are achieved. The proposed algorithm, termed as CFDR, is extensively evaluated using (1) seven widely-used collaborative filtering datasets, (2) the two most widely-used correlation metrics in collaborative filtering research, namely the Pearson correlation coefficient and the cosine similarity, and (3) the two most widely-used error metrics in collaborative filtering, namely the mean absolute error and the root mean square error. The evaluation results show that, by successfully increasing the density of the datasets, the capacity of collaborative filtering systems to formulate personalized and accurate recommendations is considerably improved.

 Artículos similares

       
 
Jin Li, Tao Han, Wenyang Guan and Xiaoqin Lian    
With the development and popularization of Intelligent Transportation Systems (ITS), Vehicle Ad-Hoc Networks (VANETs) have attracted extensive attention as a key technology. In order to achieve real-time monitoring, VANET technology enables vehicles to c... ver más
Revista: Applied Sciences

 
Dariush Ashtab, Mehdi Gholamalifard, Parviz Jokar, Andrey G. Kostianoy and Aleksander V. Semenov    
Protected areas are referred to around the world as the basis of conservation strategies. Designation of marine protected areas (MPAs) is to preserve marine biodiversity and protect species, habitats in the seas, and oceans. The simulated annealing algor... ver más

 
Liqiu Chen, Chongshi Gu, Sen Zheng and Yanbo Wang    
Real and effective monitoring data are crucial in assessing the structural safety of dams. Gross errors, resulting from manual mismeasurement, instrument failure, or other factors, can significantly impact the evaluation process. It is imperative to elim... ver más
Revista: Water

 
Yuting Bai, Yijie Niu, Zhiyao Zhao, Xuebo Jin and Xiaoyi Wang    
The phenomenon of algal bloom seriously affects the function of the aquatic ecosystems, damages the landscape of urban river and lakes, and threatens the safety of water use. The introduction of a multi-attribute decision-making method avoids the shortco... ver más
Revista: Water

 
Yanan Wu, Yalin Yang and May Yuan    
Conventional spatiotemporal methods take frequentist or density-based approaches to map event clusters over time. While these methods discern hotspots of varying continuity in space and time, their findings overlook locations of routine occurrences where... ver más
Revista: Information