Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniques

Detalhes bibliográficos
Autor(a) principal: Watson-Hernandez, Fernando
Data de Publicação: 2022
Outros Autores: Gomez-Calderon, Natalia, Silva, Rouverson Pereira da [UNESP]
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.3390/agriengineering4010019
http://hdl.handle.net/11449/218996
Resumo: Palm oil has become one of the most consumed vegetable oils in the world, and it is a key element in profitable global value chains. In Costa Rica, oil palm cultivation is one of the three crops with the largest occupied agricultural area. The objective of this study was to explain and predict yield in safe time lags for production management by using free-access satellite images. To this end, machine learning methods were performed to a 20-year data set of an oil palm plantation located in the Central Pacific Region of Costa Rica and the corresponding vegetation indices obtained from LANDSAT satellite images. Since the best correlations corresponded to a one-year time lag, the predictive models Random Forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO), Extreme Gradient Boosting (XGBoost), Recursive Partitioning and Regression Trees (RPART), and Neural Network (NN) were built for a Time-lag 1. These models were applied to all genetic material and to the predominant variety (AVROS) separately. While NN showed the best performance for multispecies information (r(2) = 0.8139, NSE = 0.8131, RMSE = 0.3437, MAE = 0.2605), RF showed a better fit for AVROS (r(2) = 0.8214, NSE = 0.8020, RMSE = 0.3452, MAE = 0.2669). The most relevant vegetation indices (NDMI, MSI) are related to water in the plant. The study also determined that data distribution must be considered for the prediction and evaluation of the oil palm yield in the area under study. The estimation methods of this study provide information on the identification of important variables (NDMI) to characterize palm oil yield. Additionally, it generates a scenario with acceptable uncertainties on the yield forecast one year in advance. This information is of direct interest to the oil palm industry.
id UNSP_535b603011c728ea2c98f87ae4ea6f54
oai_identifier_str oai:repositorio.unesp.br:11449/218996
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniquescrop yieldgoogle earth engineneural networkrandom forestsimulationPalm oil has become one of the most consumed vegetable oils in the world, and it is a key element in profitable global value chains. In Costa Rica, oil palm cultivation is one of the three crops with the largest occupied agricultural area. The objective of this study was to explain and predict yield in safe time lags for production management by using free-access satellite images. To this end, machine learning methods were performed to a 20-year data set of an oil palm plantation located in the Central Pacific Region of Costa Rica and the corresponding vegetation indices obtained from LANDSAT satellite images. Since the best correlations corresponded to a one-year time lag, the predictive models Random Forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO), Extreme Gradient Boosting (XGBoost), Recursive Partitioning and Regression Trees (RPART), and Neural Network (NN) were built for a Time-lag 1. These models were applied to all genetic material and to the predominant variety (AVROS) separately. While NN showed the best performance for multispecies information (r(2) = 0.8139, NSE = 0.8131, RMSE = 0.3437, MAE = 0.2605), RF showed a better fit for AVROS (r(2) = 0.8214, NSE = 0.8020, RMSE = 0.3452, MAE = 0.2669). The most relevant vegetation indices (NDMI, MSI) are related to water in the plant. The study also determined that data distribution must be considered for the prediction and evaluation of the oil palm yield in the area under study. The estimation methods of this study provide information on the identification of important variables (NDMI) to characterize palm oil yield. Additionally, it generates a scenario with acceptable uncertainties on the yield forecast one year in advance. This information is of direct interest to the oil palm industry.Vice-Rector's Office for Research and Extension of the Technological Institute of Costa RicaInst Tecnol Costa Rica, Sch Agr Engn, Cartago 30101, Costa RicaSao Paulo State Univ Unesp, Sch Agr & Veterinarian Sci, Dept Engn & Math Sci, BR-14884900 Jaboticabal, SP, BrazilSao Paulo State Univ Unesp, Sch Agr & Veterinarian Sci, Dept Engn & Math Sci, BR-14884900 Jaboticabal, SP, BrazilMdpiInst Tecnol Costa RicaUniversidade Estadual Paulista (UNESP)Watson-Hernandez, FernandoGomez-Calderon, NataliaSilva, Rouverson Pereira da [UNESP]2022-04-28T18:46:00Z2022-04-28T18:46:00Z2022-03-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article279-291http://dx.doi.org/10.3390/agriengineering4010019Agriengineering. Basel: Mdpi, v. 4, n. 1, p. 279-291, 2022.http://hdl.handle.net/11449/21899610.3390/agriengineering4010019WOS:000775673200001Web of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengAgriengineeringinfo:eu-repo/semantics/openAccess2024-06-06T15:18:42Zoai:repositorio.unesp.br:11449/218996Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T20:49:21.416761Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniques
title Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniques
spellingShingle Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniques
Watson-Hernandez, Fernando
crop yield
google earth engine
neural network
random forest
simulation
title_short Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniques
title_full Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniques
title_fullStr Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniques
title_full_unstemmed Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniques
title_sort Oil Palm Yield Estimation Based on Vegetation and Humidity Indices Generated from Satellite Images and Machine Learning Techniques
author Watson-Hernandez, Fernando
author_facet Watson-Hernandez, Fernando
Gomez-Calderon, Natalia
Silva, Rouverson Pereira da [UNESP]
author_role author
author2 Gomez-Calderon, Natalia
Silva, Rouverson Pereira da [UNESP]
author2_role author
author
dc.contributor.none.fl_str_mv Inst Tecnol Costa Rica
Universidade Estadual Paulista (UNESP)
dc.contributor.author.fl_str_mv Watson-Hernandez, Fernando
Gomez-Calderon, Natalia
Silva, Rouverson Pereira da [UNESP]
dc.subject.por.fl_str_mv crop yield
google earth engine
neural network
random forest
simulation
topic crop yield
google earth engine
neural network
random forest
simulation
description Palm oil has become one of the most consumed vegetable oils in the world, and it is a key element in profitable global value chains. In Costa Rica, oil palm cultivation is one of the three crops with the largest occupied agricultural area. The objective of this study was to explain and predict yield in safe time lags for production management by using free-access satellite images. To this end, machine learning methods were performed to a 20-year data set of an oil palm plantation located in the Central Pacific Region of Costa Rica and the corresponding vegetation indices obtained from LANDSAT satellite images. Since the best correlations corresponded to a one-year time lag, the predictive models Random Forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO), Extreme Gradient Boosting (XGBoost), Recursive Partitioning and Regression Trees (RPART), and Neural Network (NN) were built for a Time-lag 1. These models were applied to all genetic material and to the predominant variety (AVROS) separately. While NN showed the best performance for multispecies information (r(2) = 0.8139, NSE = 0.8131, RMSE = 0.3437, MAE = 0.2605), RF showed a better fit for AVROS (r(2) = 0.8214, NSE = 0.8020, RMSE = 0.3452, MAE = 0.2669). The most relevant vegetation indices (NDMI, MSI) are related to water in the plant. The study also determined that data distribution must be considered for the prediction and evaluation of the oil palm yield in the area under study. The estimation methods of this study provide information on the identification of important variables (NDMI) to characterize palm oil yield. Additionally, it generates a scenario with acceptable uncertainties on the yield forecast one year in advance. This information is of direct interest to the oil palm industry.
publishDate 2022
dc.date.none.fl_str_mv 2022-04-28T18:46:00Z
2022-04-28T18:46:00Z
2022-03-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.3390/agriengineering4010019
Agriengineering. Basel: Mdpi, v. 4, n. 1, p. 279-291, 2022.
http://hdl.handle.net/11449/218996
10.3390/agriengineering4010019
WOS:000775673200001
url http://dx.doi.org/10.3390/agriengineering4010019
http://hdl.handle.net/11449/218996
identifier_str_mv Agriengineering. Basel: Mdpi, v. 4, n. 1, p. 279-291, 2022.
10.3390/agriengineering4010019
WOS:000775673200001
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Agriengineering
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 279-291
dc.publisher.none.fl_str_mv Mdpi
publisher.none.fl_str_mv Mdpi
dc.source.none.fl_str_mv Web of Science
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808129254326534144