Predicting coffee yield based on agroclimatic data and machine learning

Detalhes bibliográficos
Autor(a) principal: de Oliveira Aparecido, Lucas Eduardo
Data de Publicação: 2022
Outros Autores: Lorençone, João Antonio, Lorençone, Pedro Antonio, Torsoni, Guilherme Botega, Lima, Rafael Fausto [UNESP], dade Silva CabralMoraes, José Reinaldo
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1007/s00704-022-03983-z
http://hdl.handle.net/11449/230415
Resumo: Climate directly and indirectly influences agriculture, being the main responsible for low and high yields. Prior knowledge on yield helps coffee farmers in their decision-making and planning for the future harvest, avoiding unnecessary costs and losses during the harvesting process. Thus, we sought to predict coffee yield with regressive models using meteorological data of the state of Paraná, Brazil. This study was carried out in 15 localities that produce Coffea arabica in this Brazilian state. The climate data were collected using the NASA/POWER platform from 1989 to 2020, while the data of arabica coffee yield (bags/ha) were obtained by CONAB from 2003 to 2018. The Penman–Monteith method was used to calculate the reference evapotranspiration and the climatological water balance (WB) was calculated based on Thornthwaite and Mather (1955). Multiple linear regression was used in the data modeling, in which C. arabica yield was the dependent variable and air temperature, precipitation, solar radiation, water deficit, water surplus, and soil water storage were the independent variables. The comparison between the estimation models and the actual data was performed using the statistical indices RMSE (accuracy) and adjusted coefficient of determination (R2adj) (precision). Multiple linear regression models can predict arabica coffee yield in the state of Paraná 2 to 3 months before harvest. The maximum air temperature is the climate element that most influences coffee plants, especially during fruit formation (March). Maximum air temperatures of 31.01 °C in March can reduce coffee production. Wenceslau Braz, Jacarezinho, and Ibaiti presented the highest yields, with mean values of 32.5, 29.9, and 29.3 bags ha−1, respectively. The models calibrated for localities that have Argisol had the highest mean accuracy, with an RMSE of 2.68 bags ha−1. The best models were calibrated for Paranavaí (Latosol), with an RMSE of 0.78 bags ha−1 and R2adj of 0.89, and Ibaiti (Argisol), with RMSE and R2adj values of 3.09 bags ha−1 and 0.83, respectively. Paranavaí has a mean difference between the actual and estimated coffee yield of only 0.86 bags ha−1. The highest deviations were observed in Wenceslau Braz (9.17 bags ha−1) and the lowest deviations were found in Paranavaí (0.86 bags ha−1). The models can be used to predict arabica coffee yield, assisting the planning of coffee farmers in the northern region of the state of Paraná.
id UNSP_7d1855b66e8ae3b1a7ae895401af6c19
oai_identifier_str oai:repositorio.unesp.br:11449/230415
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Predicting coffee yield based on agroclimatic data and machine learningClimate directly and indirectly influences agriculture, being the main responsible for low and high yields. Prior knowledge on yield helps coffee farmers in their decision-making and planning for the future harvest, avoiding unnecessary costs and losses during the harvesting process. Thus, we sought to predict coffee yield with regressive models using meteorological data of the state of Paraná, Brazil. This study was carried out in 15 localities that produce Coffea arabica in this Brazilian state. The climate data were collected using the NASA/POWER platform from 1989 to 2020, while the data of arabica coffee yield (bags/ha) were obtained by CONAB from 2003 to 2018. The Penman–Monteith method was used to calculate the reference evapotranspiration and the climatological water balance (WB) was calculated based on Thornthwaite and Mather (1955). Multiple linear regression was used in the data modeling, in which C. arabica yield was the dependent variable and air temperature, precipitation, solar radiation, water deficit, water surplus, and soil water storage were the independent variables. The comparison between the estimation models and the actual data was performed using the statistical indices RMSE (accuracy) and adjusted coefficient of determination (R2adj) (precision). Multiple linear regression models can predict arabica coffee yield in the state of Paraná 2 to 3 months before harvest. The maximum air temperature is the climate element that most influences coffee plants, especially during fruit formation (March). Maximum air temperatures of 31.01 °C in March can reduce coffee production. Wenceslau Braz, Jacarezinho, and Ibaiti presented the highest yields, with mean values of 32.5, 29.9, and 29.3 bags ha−1, respectively. The models calibrated for localities that have Argisol had the highest mean accuracy, with an RMSE of 2.68 bags ha−1. The best models were calibrated for Paranavaí (Latosol), with an RMSE of 0.78 bags ha−1 and R2adj of 0.89, and Ibaiti (Argisol), with RMSE and R2adj values of 3.09 bags ha−1 and 0.83, respectively. Paranavaí has a mean difference between the actual and estimated coffee yield of only 0.86 bags ha−1. The highest deviations were observed in Wenceslau Braz (9.17 bags ha−1) and the lowest deviations were found in Paranavaí (0.86 bags ha−1). The models can be used to predict arabica coffee yield, assisting the planning of coffee farmers in the northern region of the state of Paraná.Federal Institute of Sul de Minas Gerais (IFSULDEMINAS) – Campus Muzambinho, Minas GeraisFederal Institute of Mato Grosso Do Sul (IFMS) - Navirai, Mato Grosso Do SulGraduate Program in Agronomy (Soil Science) of the State University of Sao Paulo (FCAV/UNESP) - Jaboticabal, JaboticabalGraduate Program in Agronomy (Soil Science) of the State University of Sao Paulo (FCAV/UNESP) - Jaboticabal, JaboticabalFederal Institute of Sul de Minas Gerais (IFSULDEMINAS) – Campus MuzambinhoFederal Institute of Mato Grosso Do Sul (IFMS) - NaviraiUniversidade Estadual Paulista (UNESP)de Oliveira Aparecido, Lucas EduardoLorençone, João AntonioLorençone, Pedro AntonioTorsoni, Guilherme BotegaLima, Rafael Fausto [UNESP]dade Silva CabralMoraes, José Reinaldo2022-04-29T08:39:50Z2022-04-29T08:39:50Z2022-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1007/s00704-022-03983-zTheoretical and Applied Climatology.1434-44830177-798Xhttp://hdl.handle.net/11449/23041510.1007/s00704-022-03983-z2-s2.0-85124839115Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengTheoretical and Applied Climatologyinfo:eu-repo/semantics/openAccess2024-06-07T14:24:06Zoai:repositorio.unesp.br:11449/230415Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-06-07T14:24:06Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Predicting coffee yield based on agroclimatic data and machine learning
title Predicting coffee yield based on agroclimatic data and machine learning
spellingShingle Predicting coffee yield based on agroclimatic data and machine learning
de Oliveira Aparecido, Lucas Eduardo
title_short Predicting coffee yield based on agroclimatic data and machine learning
title_full Predicting coffee yield based on agroclimatic data and machine learning
title_fullStr Predicting coffee yield based on agroclimatic data and machine learning
title_full_unstemmed Predicting coffee yield based on agroclimatic data and machine learning
title_sort Predicting coffee yield based on agroclimatic data and machine learning
author de Oliveira Aparecido, Lucas Eduardo
author_facet de Oliveira Aparecido, Lucas Eduardo
Lorençone, João Antonio
Lorençone, Pedro Antonio
Torsoni, Guilherme Botega
Lima, Rafael Fausto [UNESP]
dade Silva CabralMoraes, José Reinaldo
author_role author
author2 Lorençone, João Antonio
Lorençone, Pedro Antonio
Torsoni, Guilherme Botega
Lima, Rafael Fausto [UNESP]
dade Silva CabralMoraes, José Reinaldo
author2_role author
author
author
author
author
dc.contributor.none.fl_str_mv Federal Institute of Sul de Minas Gerais (IFSULDEMINAS) – Campus Muzambinho
Federal Institute of Mato Grosso Do Sul (IFMS) - Navirai
Universidade Estadual Paulista (UNESP)
dc.contributor.author.fl_str_mv de Oliveira Aparecido, Lucas Eduardo
Lorençone, João Antonio
Lorençone, Pedro Antonio
Torsoni, Guilherme Botega
Lima, Rafael Fausto [UNESP]
dade Silva CabralMoraes, José Reinaldo
description Climate directly and indirectly influences agriculture, being the main responsible for low and high yields. Prior knowledge on yield helps coffee farmers in their decision-making and planning for the future harvest, avoiding unnecessary costs and losses during the harvesting process. Thus, we sought to predict coffee yield with regressive models using meteorological data of the state of Paraná, Brazil. This study was carried out in 15 localities that produce Coffea arabica in this Brazilian state. The climate data were collected using the NASA/POWER platform from 1989 to 2020, while the data of arabica coffee yield (bags/ha) were obtained by CONAB from 2003 to 2018. The Penman–Monteith method was used to calculate the reference evapotranspiration and the climatological water balance (WB) was calculated based on Thornthwaite and Mather (1955). Multiple linear regression was used in the data modeling, in which C. arabica yield was the dependent variable and air temperature, precipitation, solar radiation, water deficit, water surplus, and soil water storage were the independent variables. The comparison between the estimation models and the actual data was performed using the statistical indices RMSE (accuracy) and adjusted coefficient of determination (R2adj) (precision). Multiple linear regression models can predict arabica coffee yield in the state of Paraná 2 to 3 months before harvest. The maximum air temperature is the climate element that most influences coffee plants, especially during fruit formation (March). Maximum air temperatures of 31.01 °C in March can reduce coffee production. Wenceslau Braz, Jacarezinho, and Ibaiti presented the highest yields, with mean values of 32.5, 29.9, and 29.3 bags ha−1, respectively. The models calibrated for localities that have Argisol had the highest mean accuracy, with an RMSE of 2.68 bags ha−1. The best models were calibrated for Paranavaí (Latosol), with an RMSE of 0.78 bags ha−1 and R2adj of 0.89, and Ibaiti (Argisol), with RMSE and R2adj values of 3.09 bags ha−1 and 0.83, respectively. Paranavaí has a mean difference between the actual and estimated coffee yield of only 0.86 bags ha−1. The highest deviations were observed in Wenceslau Braz (9.17 bags ha−1) and the lowest deviations were found in Paranavaí (0.86 bags ha−1). The models can be used to predict arabica coffee yield, assisting the planning of coffee farmers in the northern region of the state of Paraná.
publishDate 2022
dc.date.none.fl_str_mv 2022-04-29T08:39:50Z
2022-04-29T08:39:50Z
2022-01-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1007/s00704-022-03983-z
Theoretical and Applied Climatology.
1434-4483
0177-798X
http://hdl.handle.net/11449/230415
10.1007/s00704-022-03983-z
2-s2.0-85124839115
url http://dx.doi.org/10.1007/s00704-022-03983-z
http://hdl.handle.net/11449/230415
identifier_str_mv Theoretical and Applied Climatology.
1434-4483
0177-798X
10.1007/s00704-022-03983-z
2-s2.0-85124839115
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Theoretical and Applied Climatology
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1803650090959634432