Inicio  /  Informatics  /  Vol: 5 Par: 1 (2018)  /  Artículo
ARTÍCULO
TITULO

Utilizing Provenance in Reusable Research Objects

Zhihao Yuan    
Dai Hai Ton That    
Siddhant Kothari    
Gabriel Fils and Tanu Malik    

Resumen

Science is conducted collaboratively, often requiring the sharing of knowledge about computational experiments. When experiments include only datasets, they can be shared using Uniform Resource Identifiers (URIs) or Digital Object Identifiers (DOIs). An experiment, however, seldom includes only datasets, but more often includes software, its past execution, provenance, and associated documentation. The Research Object has recently emerged as a comprehensive and systematic method for aggregation and identification of diverse elements of computational experiments. While a necessary method, mere aggregation is not sufficient for the sharing of computational experiments. Other users must be able to easily recompute on these shared research objects. Computational provenance is often the key to enable such reuse. In this paper, we show how reusable research objects can utilize provenance to correctly repeat a previous reference execution, to construct a subset of a research object for partial reuse, and to reuse existing contents of a research object for modified reuse. We describe two methods to summarize provenance that aid in understanding the contents and past executions of a research object. The first method obtains a process-view by collapsing low-level system information, and the second method obtains a summary graph by grouping related nodes and edges with the goal to obtain a graph view similar to application workflow. Through detailed experiments, we show the efficacy and efficiency of our algorithms.

 Artículos similares

       
 
Ichchha Pradeep Sharma, Tam V. Nguyen, Shruti Ajay Singh and Tom Ongwere    
This paper focuses on addressing the complex healthcare needs of patients struggling with discordant chronic comorbidities (DCCs). Managing these patients within the current healthcare system often proves to be a challenging process, characterized by evo... ver más
Revista: Information

 
José-Luis Molina, Santiago Zazo, Fernando Espejo, Carmen Patino-Alonso, Irene Blanco-Gutiérrez and Domingo Zarzo    
Floods are probably the most hazardous global natural event as well as the main cause of human losses and economic damage. They are often hard to predict, but their consequences may be reduced by taking the right precautions. In this sense, hydraulic inf... ver más
Revista: Water

 
Bahruddin Ibrahim, Arya Wiranata, Ida Zahrina, Leo Sentosa, Nasruddin Nasruddin and Yuswan Muharam    
Overloading and climate change are often problems in pavement structures. For this reason, hard asphalt binders have high softening points, are elastic, and have good adhesion, which is needed to improve pavement performance. Asphalt binder performance c... ver más
Revista: Applied Sciences

 
Wensi Li, Yu Zhang, Ruizhi Li, Lijun Zhang, Xingwang Zhang, Hongyin Li, Peng Nie and Shengdong Zhang    
Currently, over 100 nuclear power units globally have been in operation for more than 40 years. Hindered by the limitations of computer technology at the time, these nuclear facilities lack detailed electronic drawings. Activities such as equipment repla... ver más
Revista: Applied Sciences

 
Hao Liu, Bo Yang and Zhiwen Yu    
Multimodal sarcasm detection is a developing research field in social Internet of Things, which is the foundation of artificial intelligence and human psychology research. Sarcastic comments issued on social media often imply people?s real attitudes towa... ver más
Revista: Applied Sciences