Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Outros Autores: | , , , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.1016/j.rsase.2023.100919 http://hdl.handle.net/11449/246601 |
Resumo: | Soybean genotypes have distinct physicochemical characteristics, mainly regarding the oil and protein contents in the grains. The use of high-throughput phe-notyping technologies allied to data processing by machine learning algorithms facili-tates and can make it faster and more precise to obtain information about the charac-teristics of the grains. Thus, the objective of the study was to identify machine learning algorithms and inputs with better performance for classifying genotypes clustered based on industrial traits. The experiment was implemented in a randomized block design with two replicates. 103 F2 soybean populations were evaluated. Red, green, near-infrared, and infrared spectral bands and the vegetation indices NDVI, NDRE, GNDVI, SAVI, MSAVI, MCARI, EVI, and SCCCI were measured using UAV multispectral imagery. The industrial traits evaluated were: crude protein content, oil yield, and ash and fiber contents. Data were subjected to Pearson correlation analysis expressed by a correlation network. A genotype clustering based on industrial traits was performed using PCA and k-means algorithm, and then the clusters formed were used as output variables of the ML models, while three input configurations were tested: only spectral bands (B), only vegetation indices (VIs), and B + VIs. ML algorithms tested were: artificial neural net-work (ANN), decision tree algorithms J48 (J48), REPTree (DT), and RandomTree (Rt), random forest (RF), Support Vector Machine (SVM), and logistic regression (LR, used as control). Statistical metrics used to evaluate the accuracy of the models were per-centage of correct classification (CC) and F-score. ML algorithms that achieved the highest classification accuracies were ANN, DT and SVM. As for the inputs tested, the best results were obtained using only spectral bands. |
id |
UNSP_f3570804cdebe33a010b1f15ee8834eb |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/246601 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learningComputational intelligenceHigh-throughput phenotypingPrecision agri-cultureSpectral bandsVegetation indicesSoybean genotypes have distinct physicochemical characteristics, mainly regarding the oil and protein contents in the grains. The use of high-throughput phe-notyping technologies allied to data processing by machine learning algorithms facili-tates and can make it faster and more precise to obtain information about the charac-teristics of the grains. Thus, the objective of the study was to identify machine learning algorithms and inputs with better performance for classifying genotypes clustered based on industrial traits. The experiment was implemented in a randomized block design with two replicates. 103 F2 soybean populations were evaluated. Red, green, near-infrared, and infrared spectral bands and the vegetation indices NDVI, NDRE, GNDVI, SAVI, MSAVI, MCARI, EVI, and SCCCI were measured using UAV multispectral imagery. The industrial traits evaluated were: crude protein content, oil yield, and ash and fiber contents. Data were subjected to Pearson correlation analysis expressed by a correlation network. A genotype clustering based on industrial traits was performed using PCA and k-means algorithm, and then the clusters formed were used as output variables of the ML models, while three input configurations were tested: only spectral bands (B), only vegetation indices (VIs), and B + VIs. ML algorithms tested were: artificial neural net-work (ANN), decision tree algorithms J48 (J48), REPTree (DT), and RandomTree (Rt), random forest (RF), Support Vector Machine (SVM), and logistic regression (LR, used as control). Statistical metrics used to evaluate the accuracy of the models were per-centage of correct classification (CC) and F-score. ML algorithms that achieved the highest classification accuracies were ANN, DT and SVM. As for the inputs tested, the best results were obtained using only spectral bands.Universidade Federal de Mato Grosso do SulFundação de Apoio ao Desenvolvimento do Ensino, Ciência e Tecnologia do Estado de Mato Grosso do SulConselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Department of Agronomy State University of São Paulo (UNESP), Ilha Solteira, SPFederal University of Mato Grosso do Sul (UFMS), Chapadão do Sul, MSDepartment of Agricultural Engineering Federal University of Santa Maria, Cachoeira do Sul, RSDepartment of Food Science and Technology University of Passo Fundo, RSDepartment of Geography State University of Mato Grosso (UNEMAT), MTLSU Agcenter School of Plant Environmental and Soil Sciences Louisiana State University, 307 Sturgis HallDepartment of Agronomy State University of São Paulo (UNESP), Ilha Solteira, SPFundação de Apoio ao Desenvolvimento do Ensino, Ciência e Tecnologia do Estado de Mato Grosso do Sul: 07/2022CNPq: 303767/2020-0CNPq: 306022/2021-4CNPq: 309250/2021-8Fundação de Apoio ao Desenvolvimento do Ensino, Ciência e Tecnologia do Estado de Mato Grosso do Sul: 88/2021Universidade Estadual Paulista (UNESP)Universidade Federal de Mato Grosso do Sul (UFMS)Federal University of Santa MariaUniversity of Passo FundoState University of Mato Grosso (UNEMAT)Louisiana State UniversitySantana, Dthenifer Cordeiro [UNESP]Teodoro, Larissa Pereira RibeiroBaio, Fábio Henrique RojoSantos, Regimar Garcia dos [UNESP]Coradi, Paulo CarteriBiduski, BárbaraSilva Junior, Carlos Antonio daTeodoro, Paulo Eduardo [UNESP]Shiratsuchi, Luaciano Shozo2023-07-29T12:45:25Z2023-07-29T12:45:25Z2023-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1016/j.rsase.2023.100919Remote Sensing Applications: Society and Environment, v. 29.2352-9385http://hdl.handle.net/11449/24660110.1016/j.rsase.2023.1009192-s2.0-85145701839Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengRemote Sensing Applications: Society and Environmentinfo:eu-repo/semantics/openAccess2023-07-29T12:45:25Zoai:repositorio.unesp.br:11449/246601Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T18:27:58.050620Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning |
title |
Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning |
spellingShingle |
Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning Santana, Dthenifer Cordeiro [UNESP] Computational intelligence High-throughput phenotyping Precision agri-culture Spectral bands Vegetation indices |
title_short |
Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning |
title_full |
Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning |
title_fullStr |
Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning |
title_full_unstemmed |
Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning |
title_sort |
Classification of soybean genotypes for industrial traits using UAV multispectral imagery and machine learning |
author |
Santana, Dthenifer Cordeiro [UNESP] |
author_facet |
Santana, Dthenifer Cordeiro [UNESP] Teodoro, Larissa Pereira Ribeiro Baio, Fábio Henrique Rojo Santos, Regimar Garcia dos [UNESP] Coradi, Paulo Carteri Biduski, Bárbara Silva Junior, Carlos Antonio da Teodoro, Paulo Eduardo [UNESP] Shiratsuchi, Luaciano Shozo |
author_role |
author |
author2 |
Teodoro, Larissa Pereira Ribeiro Baio, Fábio Henrique Rojo Santos, Regimar Garcia dos [UNESP] Coradi, Paulo Carteri Biduski, Bárbara Silva Junior, Carlos Antonio da Teodoro, Paulo Eduardo [UNESP] Shiratsuchi, Luaciano Shozo |
author2_role |
author author author author author author author author |
dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (UNESP) Universidade Federal de Mato Grosso do Sul (UFMS) Federal University of Santa Maria University of Passo Fundo State University of Mato Grosso (UNEMAT) Louisiana State University |
dc.contributor.author.fl_str_mv |
Santana, Dthenifer Cordeiro [UNESP] Teodoro, Larissa Pereira Ribeiro Baio, Fábio Henrique Rojo Santos, Regimar Garcia dos [UNESP] Coradi, Paulo Carteri Biduski, Bárbara Silva Junior, Carlos Antonio da Teodoro, Paulo Eduardo [UNESP] Shiratsuchi, Luaciano Shozo |
dc.subject.por.fl_str_mv |
Computational intelligence High-throughput phenotyping Precision agri-culture Spectral bands Vegetation indices |
topic |
Computational intelligence High-throughput phenotyping Precision agri-culture Spectral bands Vegetation indices |
description |
Soybean genotypes have distinct physicochemical characteristics, mainly regarding the oil and protein contents in the grains. The use of high-throughput phe-notyping technologies allied to data processing by machine learning algorithms facili-tates and can make it faster and more precise to obtain information about the charac-teristics of the grains. Thus, the objective of the study was to identify machine learning algorithms and inputs with better performance for classifying genotypes clustered based on industrial traits. The experiment was implemented in a randomized block design with two replicates. 103 F2 soybean populations were evaluated. Red, green, near-infrared, and infrared spectral bands and the vegetation indices NDVI, NDRE, GNDVI, SAVI, MSAVI, MCARI, EVI, and SCCCI were measured using UAV multispectral imagery. The industrial traits evaluated were: crude protein content, oil yield, and ash and fiber contents. Data were subjected to Pearson correlation analysis expressed by a correlation network. A genotype clustering based on industrial traits was performed using PCA and k-means algorithm, and then the clusters formed were used as output variables of the ML models, while three input configurations were tested: only spectral bands (B), only vegetation indices (VIs), and B + VIs. ML algorithms tested were: artificial neural net-work (ANN), decision tree algorithms J48 (J48), REPTree (DT), and RandomTree (Rt), random forest (RF), Support Vector Machine (SVM), and logistic regression (LR, used as control). Statistical metrics used to evaluate the accuracy of the models were per-centage of correct classification (CC) and F-score. ML algorithms that achieved the highest classification accuracies were ANN, DT and SVM. As for the inputs tested, the best results were obtained using only spectral bands. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-07-29T12:45:25Z 2023-07-29T12:45:25Z 2023-01-01 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1016/j.rsase.2023.100919 Remote Sensing Applications: Society and Environment, v. 29. 2352-9385 http://hdl.handle.net/11449/246601 10.1016/j.rsase.2023.100919 2-s2.0-85145701839 |
url |
http://dx.doi.org/10.1016/j.rsase.2023.100919 http://hdl.handle.net/11449/246601 |
identifier_str_mv |
Remote Sensing Applications: Society and Environment, v. 29. 2352-9385 10.1016/j.rsase.2023.100919 2-s2.0-85145701839 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Remote Sensing Applications: Society and Environment |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808128936328036352 |