Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning

Detalhes bibliográficos
Autor(a) principal: Santana, Dthenifer Cordeiro [UNESP]
Data de Publicação: 2023
Outros Autores: Teodoro, Larissa Pereira Ribeiro, Baio, Fábio Henrique Rojo, Santos, Regimar Garcia dos [UNESP], Coradi, Paulo Carteri, Biduski, Bárbara, Silva Junior, Carlos Antonio da, Teodoro, Paulo Eduardo [UNESP], Shiratsuchi, Luaciano Shozo
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1016/j.rsase.2023.100919
http://hdl.handle.net/11449/246601
Resumo: Soybean genotypes have distinct physicochemical characteristics, mainly regarding the oil and protein contents in the grains. The use of high-throughput phe-notyping technologies allied to data processing by machine learning algorithms facili-tates and can make it faster and more precise to obtain information about the charac-teristics of the grains. Thus, the objective of the study was to identify machine learning algorithms and inputs with better performance for classifying genotypes clustered based on industrial traits. The experiment was implemented in a randomized block design with two replicates. 103 F2 soybean populations were evaluated. Red, green, near-infrared, and infrared spectral bands and the vegetation indices NDVI, NDRE, GNDVI, SAVI, MSAVI, MCARI, EVI, and SCCCI were measured using UAV multispectral imagery. The industrial traits evaluated were: crude protein content, oil yield, and ash and fiber contents. Data were subjected to Pearson correlation analysis expressed by a correlation network. A genotype clustering based on industrial traits was performed using PCA and k-means algorithm, and then the clusters formed were used as output variables of the ML models, while three input configurations were tested: only spectral bands (B), only vegetation indices (VIs), and B + VIs. ML algorithms tested were: artificial neural net-work (ANN), decision tree algorithms J48 (J48), REPTree (DT), and RandomTree (Rt), random forest (RF), Support Vector Machine (SVM), and logistic regression (LR, used as control). Statistical metrics used to evaluate the accuracy of the models were per-centage of correct classification (CC) and F-score. ML algorithms that achieved the highest classification accuracies were ANN, DT and SVM. As for the inputs tested, the best results were obtained using only spectral bands.
id UNSP_f3570804cdebe33a010b1f15ee8834eb
oai_identifier_str oai:repositorio.unesp.br:11449/246601
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learningComputational intelligenceHigh-throughput phenotypingPrecision agri-cultureSpectral bandsVegetation indicesSoybean genotypes have distinct physicochemical characteristics, mainly regarding the oil and protein contents in the grains. The use of high-throughput phe-notyping technologies allied to data processing by machine learning algorithms facili-tates and can make it faster and more precise to obtain information about the charac-teristics of the grains. Thus, the objective of the study was to identify machine learning algorithms and inputs with better performance for classifying genotypes clustered based on industrial traits. The experiment was implemented in a randomized block design with two replicates. 103 F2 soybean populations were evaluated. Red, green, near-infrared, and infrared spectral bands and the vegetation indices NDVI, NDRE, GNDVI, SAVI, MSAVI, MCARI, EVI, and SCCCI were measured using UAV multispectral imagery. The industrial traits evaluated were: crude protein content, oil yield, and ash and fiber contents. Data were subjected to Pearson correlation analysis expressed by a correlation network. A genotype clustering based on industrial traits was performed using PCA and k-means algorithm, and then the clusters formed were used as output variables of the ML models, while three input configurations were tested: only spectral bands (B), only vegetation indices (VIs), and B + VIs. ML algorithms tested were: artificial neural net-work (ANN), decision tree algorithms J48 (J48), REPTree (DT), and RandomTree (Rt), random forest (RF), Support Vector Machine (SVM), and logistic regression (LR, used as control). Statistical metrics used to evaluate the accuracy of the models were per-centage of correct classification (CC) and F-score. ML algorithms that achieved the highest classification accuracies were ANN, DT and SVM. As for the inputs tested, the best results were obtained using only spectral bands.Universidade Federal de Mato Grosso do SulFundação de Apoio ao Desenvolvimento do Ensino, Ciência e Tecnologia do Estado de Mato Grosso do SulConselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Department of Agronomy State University of São Paulo (UNESP), Ilha Solteira, SPFederal University of Mato Grosso do Sul (UFMS), Chapadão do Sul, MSDepartment of Agricultural Engineering Federal University of Santa Maria, Cachoeira do Sul, RSDepartment of Food Science and Technology University of Passo Fundo, RSDepartment of Geography State University of Mato Grosso (UNEMAT), MTLSU Agcenter School of Plant Environmental and Soil Sciences Louisiana State University, 307 Sturgis HallDepartment of Agronomy State University of São Paulo (UNESP), Ilha Solteira, SPFundação de Apoio ao Desenvolvimento do Ensino, Ciência e Tecnologia do Estado de Mato Grosso do Sul: 07/2022CNPq: 303767/2020-0CNPq: 306022/2021-4CNPq: 309250/2021-8Fundação de Apoio ao Desenvolvimento do Ensino, Ciência e Tecnologia do Estado de Mato Grosso do Sul: 88/2021Universidade Estadual Paulista (UNESP)Universidade Federal de Mato Grosso do Sul (UFMS)Federal University of Santa MariaUniversity of Passo FundoState University of Mato Grosso (UNEMAT)Louisiana State UniversitySantana, Dthenifer Cordeiro [UNESP]Teodoro, Larissa Pereira RibeiroBaio, Fábio Henrique RojoSantos, Regimar Garcia dos [UNESP]Coradi, Paulo CarteriBiduski, BárbaraSilva Junior, Carlos Antonio daTeodoro, Paulo Eduardo [UNESP]Shiratsuchi, Luaciano Shozo2023-07-29T12:45:25Z2023-07-29T12:45:25Z2023-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1016/j.rsase.2023.100919Remote Sensing Applications: Society and Environment, v. 29.2352-9385http://hdl.handle.net/11449/24660110.1016/j.rsase.2023.1009192-s2.0-85145701839Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengRemote Sensing Applications: Society and Environmentinfo:eu-repo/semantics/openAccess2023-07-29T12:45:25Zoai:repositorio.unesp.br:11449/246601Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T18:27:58.050620Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning
title Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning
spellingShingle Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning
Santana, Dthenifer Cordeiro [UNESP]
Computational intelligence
High-throughput phenotyping
Precision agri-culture
Spectral bands
Vegetation indices
title_short Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning
title_full Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning
title_fullStr Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning
title_full_unstemmed Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning
title_sort Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning
author Santana, Dthenifer Cordeiro [UNESP]
author_facet Santana, Dthenifer Cordeiro [UNESP]
Teodoro, Larissa Pereira Ribeiro
Baio, Fábio Henrique Rojo
Santos, Regimar Garcia dos [UNESP]
Coradi, Paulo Carteri
Biduski, Bárbara
Silva Junior, Carlos Antonio da
Teodoro, Paulo Eduardo [UNESP]
Shiratsuchi, Luaciano Shozo
author_role author
author2 Teodoro, Larissa Pereira Ribeiro
Baio, Fábio Henrique Rojo
Santos, Regimar Garcia dos [UNESP]
Coradi, Paulo Carteri
Biduski, Bárbara
Silva Junior, Carlos Antonio da
Teodoro, Paulo Eduardo [UNESP]
Shiratsuchi, Luaciano Shozo
author2_role author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (UNESP)
Universidade Federal de Mato Grosso do Sul (UFMS)
Federal University of Santa Maria
University of Passo Fundo
State University of Mato Grosso (UNEMAT)
Louisiana State University
dc.contributor.author.fl_str_mv Santana, Dthenifer Cordeiro [UNESP]
Teodoro, Larissa Pereira Ribeiro
Baio, Fábio Henrique Rojo
Santos, Regimar Garcia dos [UNESP]
Coradi, Paulo Carteri
Biduski, Bárbara
Silva Junior, Carlos Antonio da
Teodoro, Paulo Eduardo [UNESP]
Shiratsuchi, Luaciano Shozo
dc.subject.por.fl_str_mv Computational intelligence
High-throughput phenotyping
Precision agri-culture
Spectral bands
Vegetation indices
topic Computational intelligence
High-throughput phenotyping
Precision agri-culture
Spectral bands
Vegetation indices
description Soybean genotypes have distinct physicochemical characteristics, mainly regarding the oil and protein contents in the grains. The use of high-throughput phe-notyping technologies allied to data processing by machine learning algorithms facili-tates and can make it faster and more precise to obtain information about the charac-teristics of the grains. Thus, the objective of the study was to identify machine learning algorithms and inputs with better performance for classifying genotypes clustered based on industrial traits. The experiment was implemented in a randomized block design with two replicates. 103 F2 soybean populations were evaluated. Red, green, near-infrared, and infrared spectral bands and the vegetation indices NDVI, NDRE, GNDVI, SAVI, MSAVI, MCARI, EVI, and SCCCI were measured using UAV multispectral imagery. The industrial traits evaluated were: crude protein content, oil yield, and ash and fiber contents. Data were subjected to Pearson correlation analysis expressed by a correlation network. A genotype clustering based on industrial traits was performed using PCA and k-means algorithm, and then the clusters formed were used as output variables of the ML models, while three input configurations were tested: only spectral bands (B), only vegetation indices (VIs), and B + VIs. ML algorithms tested were: artificial neural net-work (ANN), decision tree algorithms J48 (J48), REPTree (DT), and RandomTree (Rt), random forest (RF), Support Vector Machine (SVM), and logistic regression (LR, used as control). Statistical metrics used to evaluate the accuracy of the models were per-centage of correct classification (CC) and F-score. ML algorithms that achieved the highest classification accuracies were ANN, DT and SVM. As for the inputs tested, the best results were obtained using only spectral bands.
publishDate 2023
dc.date.none.fl_str_mv 2023-07-29T12:45:25Z
2023-07-29T12:45:25Z
2023-01-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1016/j.rsase.2023.100919
Remote Sensing Applications: Society and Environment, v. 29.
2352-9385
http://hdl.handle.net/11449/246601
10.1016/j.rsase.2023.100919
2-s2.0-85145701839
url http://dx.doi.org/10.1016/j.rsase.2023.100919
http://hdl.handle.net/11449/246601
identifier_str_mv Remote Sensing Applications: Society and Environment, v. 29.
2352-9385
10.1016/j.rsase.2023.100919
2-s2.0-85145701839
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Remote Sensing Applications: Society and Environment
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808128936328036352