Inicio  /  Applied Sciences  /  Vol: 12 Par: 18 (2022)  /  Artículo
ARTÍCULO
TITULO

A Linked Data Application for Harmonizing Heterogeneous Biomedical Information

Nicola Capuano    
Pasquale Foggia    
Luca Greco and Pierluigi Ritrovato    

Resumen

In the biomedical field, there is an ever-increasing number of large, fragmented, and isolated data sources stored in databases and ontologies that use heterogeneous formats and poorly integrated schemes. Researchers and healthcare professionals find it extremely difficult to master this huge amount of data and extract relevant information. In this work, we propose a linked data approach, based on multilayer networks and semantic Web standards, capable of integrating and harmonizing several biomedical datasets with different schemas and semi-structured data through a multi-model database providing polyglot persistence. The domain chosen concerns the analysis and aggregation of available data on neuroendocrine neoplasms (NENs), a relatively rare type of neoplasm. Integrated information includes twelve public datasets available in heterogeneous schemas and formats including RDF, CSV, TSV, SQL, OWL, and OBO. The proposed integrated model consists of six interconnected layers representing, respectively, information on the disease, the related phenotypic alterations, the affected genes, the related biological processes, molecular functions, the involved human tissues, and drugs and compounds that show documented interactions with them. The defined scheme extends an existing three-layer model covering a subset of the mentioned aspects. A client?server application was also developed to browse and search for information on the integrated model. The main challenges of this work concern the complexity of the biomedical domain, the syntactic and semantic heterogeneity of the datasets, and the organization of the integrated model. Unlike related works, multilayer networks have been adopted to organize the model in a manageable and stratified structure, without the need to change the original datasets but by transforming their data ?on the fly? to respond to user requests.

 Artículos similares

       
 
David Dunkerley    
The inter-tip times (ITTs) of tipping-bucket rain gauges (TBRGs) potentially provide the highest-resolution intensity data that can be acquired from this type of gauge. At an intensity of 100 mm h-1, a typical gauge with a sensitivity of 0.2 mm of rainfa... ver más
Revista: Water

 
Muhammad Umer Masood, Muhammad Rashid, Saif Haider, Iram Naz, Chaitanya B. Pande, Salim Heddam, Fahad Alshehri, Ismail Elkhrachy, Amimul Ahsan and Saad Sh. Sammen    
Groundwater is an important source of freshwater. At the same time, anthropogenic activities, in particular, industrialization, urbanization, population growth, and excessive application of fertilizers, are some of the major reasons for groundwater quali... ver más
Revista: Water

 
Matharit Namsai, Butsawan Bidorn, Ruetaitip Mama and Warit Charoenlerkthawin    
The construction of large dams in the upper tributary basin of the Chao Phraya River (CPR) has been linked to a significant decrease in sediment load in the CPR system, estimated between 75?85%. This study, utilizing historical and recent river flow and ... ver más
Revista: Water

 
Moritz Müller, Ambre Dupuis, Tobias Zeulner, Ignacio Vazquez, Johann Hagerer and Peter A. Gloor    
Well-being is one of the pillars of positive psychology, which is known to have positive effects not only on the personal and professional lives of individuals but also on teams and organizations. Understanding and promoting individual well-being is esse... ver más
Revista: Applied Sciences

 
Giovanni Briguglio and Vincenzo Crupi    
The increasingly stringent requirements?in terms of limiting pollutants and the constant need to make maritime transport safer?generated the necessity to foresee different solutions that are original. According to the European Maritime Safety Agency, the... ver más