Inicio  /  Algorithms  /  Vol: 16 Par: 7 (2023)  /  Artículo
ARTÍCULO
TITULO

Probability Density Estimation through Nonparametric Adaptive Partitioning and Stitching

Zach D. Merino    
Jenny Farmer and Donald J. Jacobs    

Resumen

We present a novel nonparametric adaptive partitioning and stitching (NAPS) algorithm to estimate a probability density function (PDF) of a single variable. Sampled data is partitioned into blocks using a branching tree algorithm that minimizes deviations from a uniform density within blocks of various sample sizes arranged in a staggered format. The block sizes are constructed to balance the load in parallel computing as the PDF for each block is independently estimated using the nonparametric maximum entropy method (NMEM) previously developed for automated high throughput analysis. Once all block PDFs are calculated, they are stitched together to provide a smooth estimate throughout the sample range. Each stitch is an averaging process over weight factors based on the estimated cumulative distribution function (CDF) and a complementary CDF that characterize how data from flanking blocks overlap. Benchmarks on synthetic data show that our PDF estimates are fast and accurate for sample sizes ranging from 29" role="presentation">2929 2 9 to 227" role="presentation">227227 2 27 , across a diverse set of distributions that account for single and multi-modal distributions with heavy tails or singularities. We also generate estimates by replacing NMEM with kernel density estimation (KDE) within blocks. Our results indicate that NAPS(NMEM) is the best-performing method overall, while NAPS(KDE) improves estimates near boundaries compared to standard KDE.

 Artículos similares

       
 
Ning Jin, Linlin Song, Gabriel Jing Huang and Ke Yan    
Residential electricity consumption forecasting plays a crucial role in the rational allocation of resources reducing energy waste and enhancing the grid-connected operation of power systems. Probabilistic forecasting can provide more comprehensive infor... ver más
Revista: Information

 
Leonardo De Bona Becker, Maria do Carmo Reis Cavalcanti and Alfredo Affonso Monteiro Marques    
Tailings dam accidents emphasize the importance of an adequate understanding of the strength parameters of tailings to improve the efficiency and effectiveness of the design, construction, and operation of such structures. Usually, the tailings strength ... ver más
Revista: Infrastructures

 
Liping Chen, Hui Zhang, Wei Wang and Qiliang Zhang    
Bidirectional asymptotic structure methods have long been used to solve topological optimization problems, but are prone to being stuck in local optimal solutions. To solve this problem, this paper proposed a topology optimization method based on the Bi-... ver más
Revista: Applied Sciences

 
Liliya A. Demidova, Dmitry O. Zhukov, Elena G. Andrianova and Alexander S. Sigov    
This paper explores the social dynamics of processes in complex systems involving humans by focusing on user activity in online media outlets. The R/S analysis showed that the time series of the processes under consideration are fractal and anti-persiste... ver más
Revista: Information

 
Guo Li, Junbo Liu, Liu Yang, Huimin Zhou and Shuiting Ding    
The probabilistic damage tolerance analysis of aeroengine rotor disks is essential for determining if the disk is safe. To calculate the probability of failure, the numerical integration method is efficient if the integral formula of the probability dens... ver más
Revista: Aerospace