ARTÍCULO
TITULO

A Probabilistic Data Fusion Modeling Approach for Extracting True Values from Uncertain and Conflicting Attributes

Ashraf Jaradat    
Fadi Safieddine    
Aziz Deraman    
Omar Ali    
Ahmad Al-Ahmad and Yehia Ibrahim Alzoubi    

Resumen

Real-world data obtained from integrating heterogeneous data sources are often multi-valued, uncertain, imprecise, error-prone, outdated, and have different degrees of accuracy and correctness. It is critical to resolve data uncertainty and conflicts to present quality data that reflect actual world values. This task is called data fusion. In this paper, we deal with the problem of data fusion based on probabilistic entity linkage and uncertainty management in conflict data. Data fusion has been widely explored in the research community. However, concerns such as explicit uncertainty management and on-demand data fusion, which can cope with dynamic data sources, have not been studied well. This paper proposes a new probabilistic data fusion modeling approach that attempts to find true data values under conditions of uncertain or conflicted multi-valued attributes. These attributes are generated from the probabilistic linkage and merging alternatives of multi-corresponding entities. Consequently, the paper identifies and formulates several data fusion cases and sample spaces that require further conditional computation using our computational fusion method. The identification is established to fit with a real-world data fusion problem. In the real world, there is always the possibility of heterogeneous data sources, the integration of probabilistic entities, single or multiple truth values for certain attributes, and different combinations of attribute values as alternatives for each generated entity. We validate our probabilistic data fusion approach through mathematical representation based on three data sources with different reliability scores. The validity of the approach was assessed via implementation into our probabilistic integration system to show how it can manage and resolve different cases of data conflicts and inconsistencies. The outcome showed improved accuracy in identifying true values due to the association of constructive evidence.

 Artículos similares

       
 
Yalin Yang, Yanan Wu and May Yuan    
In-person social events bring people to places, while people and places influence where and what social events occur. Knowing what people do and where they build social relationships gives insights into the distribution and availability of places for soc... ver más

 
Nunziarita Palazzolo, David J. Peres, Brunella Bonaccorso and Antonino Cancelliere    
Assessing and monitoring the spatial extent of drought is of key importance to forecasting the future evolution of drought conditions and taking timely preventive and mitigation measures. A commonly used approach in regional drought analysis involves spa... ver más
Revista: Water

 
Alessandro Bocci, Stefano Forti, Roberto Guanciale, Gian-Luigi Ferrari and Antonio Brogi    
The security of Cloud applications is a major concern for application developers and operators. Protecting users? data confidentiality requires methods to avoid leakage from vulnerable software and unreliable Cloud providers. Recently, trusted execution ... ver más
Revista: Future Internet

 
Katherine Ho and Rebecca Loraamm    
Animal movements are realizations of complex spatiotemporal processes. Central to these processes are the varied environmental contexts in which animals move, which fundamentally impact the movement trajectories of individuals at fine spatial and tempora... ver más

 
Jun-Fang Wang, Jian-Fu Lin and Yan-Long Xie    
Subjected to complex loadings from the wheel?rail interaction, turnout rail is prone to crack damage. This paper aims to develop a condition evaluation method for crack-alike damage detection of in-service turnout rail. A covariance-based structural cond... ver más
Revista: Infrastructures