Inicio  /  Applied Sciences  /  Vol: 9 Par: 7 (2019)  /  Artículo
ARTÍCULO
TITULO

Comparison of Machine Learning Regression Algorithms for Cotton Leaf Area Index Retrieval Using Sentinel-2 Spectral Bands

Huihui Mao    
Jihua Meng    
Fujiang Ji    
Qiankun Zhang and Huiting Fang    

Resumen

Leaf area index (LAI) is a crucial crop biophysical parameter that has been widely used in a variety of fields. Five state-of-the-art machine learning regression algorithms (MLRAs), namely, artificial neural network (ANN), support vector regression (SVR), Gaussian process regression (GPR), random forest (RF) and gradient boosting regression tree (GBRT), have been used in the retrieval of cotton LAI with Sentinel-2 spectral bands. The performances of the five machine learning models are compared for better applications of MLRAs in remote sensing, since challenging problems remain in the selection of MLRAs for crop LAI retrieval, as well as the decision as to the optimal number for the training sample size and spectral bands to different MLRAs. A comprehensive evaluation was employed with respect to model accuracy, computational efficiency, sensitivity to training sample size and sensitivity to spectral bands. We conducted the comparison of five MLRAs in an agricultural area of Northwest China over three cotton seasons with the corresponding field campaigns for modeling and validation. Results show that the GBRT model outperforms the other models with respect to model accuracy in average (R2¯" role="presentation">??2?????????R2¯ R 2 ¯ = 0.854, RMSE¯" role="presentation">????????????????????????????????RMSE¯ R M S E ¯ = 0.674 and MAE¯" role="presentation">??????????????????????????MAE¯ M A E ¯ = 0.456). SVR achieves the best performance in computational efficiency, which means it is fast to train, and to validate that it has great potentials to deliver near-real-time operational products for crop management. As for sensitivity to training sample size, GBRT behaves as the most robust model, and provides the best model accuracy on the average among the variations of training sample size, compared with other models (R2¯" role="presentation">??2?????????R2¯ R 2 ¯ = 0.884, RMSE¯" role="presentation">????????????????????????????????RMSE¯ R M S E ¯ = 0.615 and MAE¯" role="presentation">??????????????????????????MAE¯ M A E ¯ = 0.452). Spectral bands sensitivity analysis with dCor (distance correlation), combined with the backward elimination approach, indicates that SVR, GPR and RF provide relatively robust performance to the spectral bands, while ANN outperforms the other models in terms of model accuracy on the average among the reduction of spectral bands (R2¯" role="presentation">??2?????????R2¯ R 2 ¯ = 0.881, RMSE¯" role="presentation">????????????????????????????????RMSE¯ R M S E ¯ = 0.625 and MAE¯" role="presentation">??????????????????????????MAE¯ M A E ¯ = 0.480). A comprehensive evaluation indicates that GBRT is an appealing alternative for cotton LAI retrieval, except for its computational efficiency. Despite the different performance of the ML models, all models exhibited considerable potential for cotton LAI retrieval, which could offer accurate crop parameters information timely and accurately for crop fields management and agricultural production decisions.

 Artículos similares

       
 
Sipho G. Thango, Georgios A. Drosopoulos, Siphesihle M. Motsa and Georgios E. Stavroulakis    
A methodology to predict key aspects of the structural response of masonry walls under blast loading using artificial neural networks (ANN) is presented in this paper. The failure patterns of masonry walls due to in and out-of-plane loading are complex d... ver más
Revista: Infrastructures

 
Fenfang Li, Zhengzhang Zhao, Li Wang and Han Deng    
Sentence Boundary Disambiguation (SBD) is crucial for building datasets for tasks such as machine translation, syntactic analysis, and semantic analysis. Currently, most automatic sentence segmentation in Tibetan adopts the methods of rule-based and stat... ver más
Revista: Applied Sciences

 
Falah Amer Abdulazeez, Ismail Taha Ahmed and Baraa Tareq Hammad    
A significant quantity of malware is created on purpose every day. Users of smartphones and computer networks now mostly worry about malware. These days, malware detection is a major concern in the cybersecurity area. Several factors can impact malware d... ver más
Revista: Applied Sciences

 
Kui Zeng, Shutan Xu, Daode Shu and Ming Chen    
Medaka (Oryzias latipes), as a crucial model organism in biomedical research, holds significant importance in fields such as cardiovascular diseases. Currently, the analysis of the medaka ventricle relies primarily on visual observation under a microscop... ver más
Revista: Applied Sciences

 
Vahid Safavi, Arash Mohammadi Vaniar, Najmeh Bazmohammadi, Juan C. Vasquez and Josep M. Guerrero    
Predicting the remaining useful life (RUL) of lithium-ion (Li-ion) batteries is crucial to preventing system failures and enhancing operational performance. Knowing the RUL of a battery enables one to perform preventative maintenance or replace the batte... ver más
Revista: Information