Inicio  /  Applied Sciences  /  Vol: 14 Par: 1 (2024)  /  Artículo
ARTÍCULO
TITULO

Modeling the Spatial Distribution of Population Based on Random Forest and Parameter Optimization Methods: A Case Study of Sichuan, China

Yunzhou Chen    
Shumin Wang    
Ziying Gu and Fan Yang    

Resumen

Spatial population distribution data is the discretization of demographic data into spatial grids, which has vital reference significance for disaster emergency response, disaster assessment, emergency rescue resource allocation, and post-disaster reconstruction. The random forest (RF) model, as a prominent method for modeling the spatial distribution of population, has been studied by many scholars, both domestically and abroad. Specifically, research has focused on aspects such as multi-source data fusion, feature selection, and data accuracy evaluation within the modeling process. However, discussions about parameter optimization methods during the modeling process and the impact of different optimization methods on modeling accuracy are relatively limited. In light of the above circumstances, this paper employs the RF model to conduct research on population spatialization with multi-source spatial information data. The study primarily explores the differences in model parameter optimization achieved through random search algorithms, grid search algorithms, genetic algorithms, simulated annealing algorithms, Bayesian optimization based on Gaussian process algorithms, and Bayesian optimization based on gradient boosting regression tree algorithms. Additionally, the study investigates the influence of different optimization algorithms on the accuracy of population spatialization modeling. Subsequently, the model with the highest accuracy is selected as the prediction model for population spatialization. Based on this model, a spatial population distribution dataset of Sichuan Province at a 1 km resolution is generated. Finally, the population dataset created in this paper is compared and validated with open datasets such as GPW, LandScan, and WorldPop. Experimental results indicate that the spatial population distribution dataset produced by the Bayesian optimization-based random forest model proposed in this paper exhibits a higher fitting accuracy with real data. The Coefficient of Determination (R2) is 0.6628, the Mean Absolute Error (MAE) is 12,459, and the Root Mean Squared Error (RMSE) is 25,037. Compared to publicly available international datasets, the dataset generated in this paper more accurately represents the spatial distribution of the population.

 Artículos similares

       
 
WoonSeong Jeong, ByungChan Kong and Sang-Guk Yum    
The demand for compact housing is on the rise, driven by the need for floor plans that accommodate stakeholders? preferences. However, clients frequently struggle to convey their spatial needs to professionals, such as architects, due to a lack of means ... ver más
Revista: Applied Sciences

 
Jonathan Stiles, Harvey Miller     Pág. 97 - 113
This study identifies built environmental factors that influence the determination of fault in urban pedestrian crashes in the United States, with implications for both safety and equity. Using data from Columbus, Ohio, we apply regression modeling, spat... ver más

 
Zeyu Xu, Wenbin Yu, Chengjun Zhang and Yadang Chen    
In the era of noisy intermediate-scale quantum (NISQ) computing, the synergistic collaboration between quantum and classical computing models has emerged as a promising solution for tackling complex computational challenges. Long short-term memory (LSTM)... ver más
Revista: Information

 
Guoliang Sun, Tingting Guo, Bao Yuan, Xiaojing Yang and Guang Wang    
The sample environment is essential to neutron scattering experiments as it induces the sample under study into a phase or state of particular interest. Various sample environments have been developed, yet the high-voltage electric field has rarely been ... ver más
Revista: Instruments

 
Chinh Lieou, Serge Jolicoeur, Thomas Guyondet, Stéphane O?Carroll and Tri Nguyen-Quang    
This study examines the hydrodynamic regimes in Shediac Bay, located in New Brunswick, Canada, with a focus on the breach in the Grande-Digue sand spit. The breach, which was developed in the mid-1980s, has raised concerns about its potential impacts on ... ver más