Portada: Infraestructura para la Logística Sustentable 2050
DESTACADO | CPI Propone - Resumen Ejecutivo

Infraestructura para el desarrollo que queremos 2026-2030

Elaborado por el Consejo de Políticas de Infraestructura (CPI), este documento constituye una hoja de ruta estratégica para orientar la inversión y la gestión de infraestructura en Chile. Presenta propuestas organizadas en siete ejes estratégicos, sin centrarse en proyectos específicos, sino en influir en las decisiones de política pública para promover una infraestructura que conecte territorios, genere oportunidades y eleve la calidad de vida de la población.
ARTÍCULO
TITULO

An Advanced Big Data Quality Framework Based on Weighted Metrics

Widad Elouataoui    
Imane El Alaoui    
Saida El Mendili and Youssef Gahi    

Resumen

While big data benefits are numerous, the use of big data requires, however, addressing new challenges related to data processing, data security, and especially degradation of data quality. Despite the increased importance of data quality for big data, data quality measurement is actually limited to few metrics. Indeed, while more than 50 data quality dimensions have been defined in the literature, the number of measured dimensions is limited to 11 dimensions. Therefore, this paper aims to extend the measured dimensions by defining four new data quality metrics: Integrity, Accessibility, Ease of manipulation, and Security. Thus, we propose a comprehensive Big Data Quality Assessment Framework based on 12 metrics: Completeness, Timeliness, Volatility, Uniqueness, Conformity, Consistency, Ease of manipulation, Relevancy, Readability, Security, Accessibility, and Integrity. In addition, to ensure accurate data quality assessment, we apply data weights at three data unit levels: data fields, quality metrics, and quality aspects. Furthermore, we define and measure five quality aspects to provide a macro-view of data quality. Finally, an experiment is performed to implement the defined measures. The results show that the suggested methodology allows a more exhaustive and accurate big data quality assessment, with a more extensive methodology defining a weighted quality score based on 12 metrics and achieving a best quality model score of 9/10.

Artículos similares

Hemos preparados una selección de otros artículos que pudieran ser de tu interés
Pengcheng Han, Xiaoqiong He, Yi Wang, Haijun Ren, Xu Peng and Zeliang Shu    
An advanced traction power supply system based on a single phase neutral-point-clamped (NPC) cascaded inverter is studied. The big triangular carrier equivalence method in double coordinate system is proposed, which can reduce one coordinate system, thus... ver más
Revista: Energies
Hossein Hassani and Steve MacFeely    
With the ubiquitous use of digital technologies and the consequent data deluge, official statistics faces new challenges and opportunities. In this context, strengthening official statistics through effective data governance will be crucial to ensure rel... ver más
Raquel Redondo, Álvaro Herrero, Emilio Corchado and Javier Sedano    
In recent years, the digital transformation has been advancing in industrial companies, supported by the Key Enabling Technologies (Big Data, IoT, etc.) of Industry 4.0. As a consequence, companies have large volumes of data and information that must be ... ver más
Revista: Applied Sciences
Alfredo Cuzzocrea, Enzo Mumolo and Giorgio Mario Grasso    
In this paper we describe a novel algorithm, inspired by the mirror neuron discovery, to support automatic learning oriented to advanced man-machine interfaces. The algorithm introduces several points of innovation, based on complex metrics of similarity... ver más
Revista: Algorithms
Jae-joon Chung and Hyun-Jung Kim    
This paper elucidates the development of a deep learning?based driver assistant that can prevent driving accidents arising from drowsiness. As a precursor to this assistant, the relationship between the sensation of sleep depravity among drivers during l... ver más
Revista: Sustainability