Portada: Infraestructura para la Logística Sustentable 2050
DESTACADO | CPI Propone - Resumen Ejecutivo

Infraestructura para el desarrollo que queremos 2026-2030

Elaborado por el Consejo de Políticas de Infraestructura (CPI), este documento constituye una hoja de ruta estratégica para orientar la inversión y la gestión de infraestructura en Chile. Presenta propuestas organizadas en siete ejes estratégicos, sin centrarse en proyectos específicos, sino en influir en las decisiones de política pública para promover una infraestructura que conecte territorios, genere oportunidades y eleve la calidad de vida de la población.
ARTÍCULO
TITULO

Probabilistic Forecasting Based Joint Detection and Imputation of Clustered Bad Data in Residential Electricity Loads

Soyeong Park    
Seungwook Yoon    
Byungtak Lee    
Seokkap Ko and Euiseok Hwang    

Resumen

Residential electricity load data can include numerous types of bad data, even clustered bad data, as they that are typically captured by simple measurement instruments. For example, in the case of a time-series of Not-a-Number (NaN) errors, the values before or next to a NaN may appear as the sum of actual values during the times of the NaN series. To utilize load data that includes such erroneous data for prediction or data mining analysis, customized detection and imputation should be conducted. This study proposes a new joint detection and imputation method for handling clustered bad data in residential electricity loads. Examples of these data are known invalid data points, such as consecutive NaN or zero values followed by or being ahead of an outlier. The proposed joint detection and imputation scheme first investigates the neighbors of the invalid data points, using probabilistic forecasting techniques. These techniques are implemented by the next valid neighbors to determine whether there is an anomaly or not. Then, adaptive imputations are applied on the basis of the detection, the candidate point should be imputed simultaneously or not. To assess the potential of the newly proposed scheme to characterize the clustered bad data, we analyzed the electricity loads of 354 households. Moreover, joint detection and imputations are conducted to test with the randomly injected synthesized clustered bad data (containing NaNs of various lengths) that is followed by the summation of the actual NaN values. The proposed scheme succeeded in detecting clustered bad data with an accuracy of 95.5% and a false alarm rate of 3.6% for all households in the dataset. Outlier detection-assisted imputation schemes are evaluated for NaNs with optional outliers. Results demonstrate that these schemes improve the overall accuracy significantly compared to schemes without outlier detection.

Artículos similares

Hemos preparados una selección de otros artículos que pudieran ser de tu interés
Dong-Jiing Doong, Shien-Tsung Chen, Ying-Chih Chen and Cheng-Han Tsai    
Coastal freak waves (CFWs) are unpredictable large waves that occur suddenly in coastal areas and have been reported to cause casualties worldwide. CFW forecasting is difficult because the complex mechanisms that cause CFWs are not well understood. This ... ver más
Carla Sahori Seefoo Jarquin, Alessandro Gandelli, Francesco Grimaccia and Marco Mussetta    
Understanding how, why and when energy consumption changes provides a tool for decision makers throughout the power networks. Thus, energy forecasting provides a great service. This research proposes a probabilistic approach to capture the five inherent ... ver más
Revista: Forecasting
Yuan-Kang Wu, Cheng-Liang Huang, Quoc-Thang Phan and Yuan-Yao Li    
Solar power has rapidly become an increasingly important energy source in many countries over recent years; however, the intermittent nature of photovoltaic (PV) power generation has a significant impact on existing power systems. To reduce this uncertai... ver más
Revista: Energies
Binquan Li, Zhongmin Liang, Qingrui Chang, Wei Zhou, Huan Wang, Jun Wang and Yiming Hu    
Low-quality input data (such as sparse rainfall gauges, low spatial resolution soil type and land use maps) have limited the application of physically-based distributed hydrological models in operational practices in many data-sparse regions. It is neces... ver más
Revista: Sustainability
Antonio Bello, Javier Reneses and Antonio Muñoz    
One of the most relevant challenges that have arisen in electricity markets during the last few years is the emergence of extremely low prices. Trying to predict these events is crucial for market agents in a competitive environment. This paper proposes ... ver más
Revista: Energies