Inicio  /  Applied Sciences  /  Vol: 12 Par: 20 (2022)  /  Artículo
ARTÍCULO
TITULO

An Ensemble Framework to Improve the Accuracy of Prediction Using Clustered Random-Forest and Shrinkage Methods

Zari Farhadi    
Hossein Bevrani    
Mohammad-Reza Feizi-Derakhshi    
Wonjoon Kim and Muhammad Fazal Ijaz    

Resumen

Nowadays, in the topics related to prediction, in addition to increasing the accuracy of existing algorithms, the reduction of computational time is a challenging issue that has attracted much attention. Since the existing methods may not have enough efficiency and accuracy, we use a combination of machine-learning algorithms and statistical methods to solve this problem. Furthermore, we reduce the computational time in the testing model by automatically reducing the number of trees using penalized methods and ensembling the remaining trees. We call this efficient combinatorial method ?ensemble of clustered and penalized random forest (ECAPRAF)?. This method consists of four fundamental parts. In the first part, k-means clustering is used to identify homogeneous subsets of data and assign them to similar groups. In the second part, a tree-based algorithm is used within each cluster as a predictor model; in this work, random forest is selected. In the next part, penalized methods are used to reduce the number of random-forest trees and remove high-variance trees from the proposed model. This increases model accuracy and decreases the computational time in the test phase. In the last part, the remaining trees within each cluster are combined. The results of the simulation and two real datasets based on the WRMSE criterion show that our proposed method has better performance than the traditional random forest by reducing approximately 12.75%, 11.82%, 12.93%, and 11.68% and selecting 99, 106, 113, and 118 trees for the ECAPRAF?EN algorithm.

 Artículos similares

       
 
Junartho Halomoan, Kalamullah Ramli, Dodi Sudiana, Teddy Surya Gunawan and Muhammad Salman    
More than 1.3 million people are killed in traffic accidents annually. Road traffic accidents are mostly caused by human error. Therefore, an accurate driving fatigue detection system is required for drivers. Most driving fatigue detection studies concen... ver más
Revista: Information

 
Jiahui Zhao, Zhibin Li, Pan Liu     Pág. 19?41
The land-use identification process, which involves quantifying the types and intensity of human activities at a regional level, is a critical investigation step for ongoing land-use planning. One limitation of land-use identification practices is that t... ver más

 
Zhipeng Zhang and Liyi Zhang    
Electroencephalography (EEG)-based emotion recognition technologies can effectively help robots to perceive human behavior, which have attracted extensive attention in human?machine interaction (HMI). Due to the complexity of EEG data, current researcher... ver más
Revista: Applied Sciences

 
Junartho Halomoan, Kalamullah Ramli, Dodi Sudiana, Teddy Surya Gunawan and Muhammad Salman    
One of the WHO?s strategies to reduce road traffic injuries and fatalities is to enhance vehicle safety. Driving fatigue detection can be used to increase vehicle safety. Our previous study developed an ECG-based driving fatigue detection framework with ... ver más
Revista: Information

 
Elissaios Sarmas, Evangelos Spiliotis, Nikos Dimitropoulos, Vangelis Marinakis and Haris Doukas    
Energy efficiency financing is considered among the top priorities in the energy sector among several stakeholders. In this context, accurately estimating the energy savings achieved by energy efficiency actions before being approved and implemented is o... ver más
Revista: Applied Sciences