Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.

Detalhes bibliográficos
Autor(a) principal: SOUSA, I. C. de
Data de Publicação: 2022
Outros Autores: NASCIMENTO, M., SANT’ANNA, I. de C., CAIXETA, E. T., AZEVEDO, C. F., CRUZ, C. D., SILVA, F. L. da, ALKIMIM, E. R., NASCIMENTO, A. C. C., SERÃO, N. V. L.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo: http://www.alice.cnptia.embrapa.br/alice/handle/doc/1143026
https://doi.org/10.1371/journal.pone.0262055
Resumo: Many methodologies are used to predict the genetic merit in animals and plants, but some of them require priori assumptions that may increase the complexity of the model. Artificial neural network (ANN) has advantage to not require priori assumptions about the relationships between inputs and the output allowing great flexibility to handle different types of complex non-additive effects, such as dominance and epistasis. Despite this advantage, the biological interpretability of ANNs is still limited. The aim of this research was to estimate the heritability and markers effects for two traits in Coffea canephora using an additive-dominance architecture ANN and to compare it with genomic best linear unbiased prediction (GBLUP). The data used consists of 51 clones of C. canephora varietal Conilon, 32 of varietal group Robusta and 82 intervarietal hybrids. From this, 165 phenotyped individuals were genotyped for 14,387 SNPs. Due to the high computational cost of ANNs, we used Bagging decision tree to reduce the dimensionality of the data, selecting the markers that accumulated 70% of the total importance. An ANN with three hidden layers was run, each varying from 1 to 40 neurons summing 64,000 neural networks. The network architectures with the best predictive ability were selected. The best architectures were composed by 4, 15, and 33 neurons in the first, second and third hidden layers, respectively, for yield, and by 13, 20, and 24 neurons, respectively for rust resistance. The predictive ability was greater when using ANN with three hidden layers than using one hidden layer and GBLUP, with 0.72 and 0.88 for yield and coffee leaf rust resistance, respectively. The concordance rate (CR) of the 10% larger markers effects among the methods varied between 10% and 13.8%, for additive effects and between 5.4% and 11.9% for dominance effects. The narrow-sense (h2a ) and dominance-only (h2a ) heritability estimates were 0.25 and 0.06, respectively, for yield, and 0.67 and 0.03, respectively for rust resistance. The ANN was able to estimate the heritabilities from an additive-dominance genomic architectures and the ANN with three hidden layers obtained best predictive ability when compared with those obtained from GBLUP and ANN with one hidden layer.
id EMBR_2a650d85b4ea8b3d6453b8769dcdaaff
oai_identifier_str oai:www.alice.cnptia.embrapa.br:doc/1143026
network_acronym_str EMBR
network_name_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str 2154
spelling Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.Rede neural artificialMarcador GenéticoCoffea CanephoraNeural networksDominance (genetics)Many methodologies are used to predict the genetic merit in animals and plants, but some of them require priori assumptions that may increase the complexity of the model. Artificial neural network (ANN) has advantage to not require priori assumptions about the relationships between inputs and the output allowing great flexibility to handle different types of complex non-additive effects, such as dominance and epistasis. Despite this advantage, the biological interpretability of ANNs is still limited. The aim of this research was to estimate the heritability and markers effects for two traits in Coffea canephora using an additive-dominance architecture ANN and to compare it with genomic best linear unbiased prediction (GBLUP). The data used consists of 51 clones of C. canephora varietal Conilon, 32 of varietal group Robusta and 82 intervarietal hybrids. From this, 165 phenotyped individuals were genotyped for 14,387 SNPs. Due to the high computational cost of ANNs, we used Bagging decision tree to reduce the dimensionality of the data, selecting the markers that accumulated 70% of the total importance. An ANN with three hidden layers was run, each varying from 1 to 40 neurons summing 64,000 neural networks. The network architectures with the best predictive ability were selected. The best architectures were composed by 4, 15, and 33 neurons in the first, second and third hidden layers, respectively, for yield, and by 13, 20, and 24 neurons, respectively for rust resistance. The predictive ability was greater when using ANN with three hidden layers than using one hidden layer and GBLUP, with 0.72 and 0.88 for yield and coffee leaf rust resistance, respectively. The concordance rate (CR) of the 10% larger markers effects among the methods varied between 10% and 13.8%, for additive effects and between 5.4% and 11.9% for dominance effects. The narrow-sense (h2a ) and dominance-only (h2a ) heritability estimates were 0.25 and 0.06, respectively, for yield, and 0.67 and 0.03, respectively for rust resistance. The ANN was able to estimate the heritabilities from an additive-dominance genomic architectures and the ANN with three hidden layers obtained best predictive ability when compared with those obtained from GBLUP and ANN with one hidden layer.ITHALO COELHO DE SOUSA, IOWA STATE UNIVERSITYMOYSÉS NASCIMENTO, UFVISABELA DE CASTRO SANT’ANNA, IACEVELINE TEIXEIRA CAIXETA MOURA, CNPCaCAMILA FERREIRA AZEVEDO, UFVCOSME DAMIÃO CRUZ, UFVFELIPE LOPES DA SILVA, UFVEMILLY RUAS ALKIMIM, UFMTANA CAROLINA CAMPANA NASCIMENTO, UFVNICK VERGARA LOPES SERÃO, IOWA STATE UNIVERSITY.SOUSA, I. C. deNASCIMENTO, M.SANT’ANNA, I. de C.CAIXETA, E. T.AZEVEDO, C. F.CRUZ, C. D.SILVA, F. L. daALKIMIM, E. R.NASCIMENTO, A. C. C.SERÃO, N. V. L.2022-05-16T15:13:40Z2022-05-16T15:13:40Z2022-05-162022info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlePlos One, v. 17, n.1, e0262055, 2022.http://www.alice.cnptia.embrapa.br/alice/handle/doc/1143026https://doi.org/10.1371/journal.pone.0262055enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2022-05-16T15:13:49Zoai:www.alice.cnptia.embrapa.br:doc/1143026Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542022-05-16T15:13:49falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542022-05-16T15:13:49Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.
title Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.
spellingShingle Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.
SOUSA, I. C. de
Rede neural artificial
Marcador Genético
Coffea Canephora
Neural networks
Dominance (genetics)
title_short Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.
title_full Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.
title_fullStr Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.
title_full_unstemmed Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.
title_sort Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora.
author SOUSA, I. C. de
author_facet SOUSA, I. C. de
NASCIMENTO, M.
SANT’ANNA, I. de C.
CAIXETA, E. T.
AZEVEDO, C. F.
CRUZ, C. D.
SILVA, F. L. da
ALKIMIM, E. R.
NASCIMENTO, A. C. C.
SERÃO, N. V. L.
author_role author
author2 NASCIMENTO, M.
SANT’ANNA, I. de C.
CAIXETA, E. T.
AZEVEDO, C. F.
CRUZ, C. D.
SILVA, F. L. da
ALKIMIM, E. R.
NASCIMENTO, A. C. C.
SERÃO, N. V. L.
author2_role author
author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv ITHALO COELHO DE SOUSA, IOWA STATE UNIVERSITY
MOYSÉS NASCIMENTO, UFV
ISABELA DE CASTRO SANT’ANNA, IAC
EVELINE TEIXEIRA CAIXETA MOURA, CNPCa
CAMILA FERREIRA AZEVEDO, UFV
COSME DAMIÃO CRUZ, UFV
FELIPE LOPES DA SILVA, UFV
EMILLY RUAS ALKIMIM, UFMT
ANA CAROLINA CAMPANA NASCIMENTO, UFV
NICK VERGARA LOPES SERÃO, IOWA STATE UNIVERSITY.
dc.contributor.author.fl_str_mv SOUSA, I. C. de
NASCIMENTO, M.
SANT’ANNA, I. de C.
CAIXETA, E. T.
AZEVEDO, C. F.
CRUZ, C. D.
SILVA, F. L. da
ALKIMIM, E. R.
NASCIMENTO, A. C. C.
SERÃO, N. V. L.
dc.subject.por.fl_str_mv Rede neural artificial
Marcador Genético
Coffea Canephora
Neural networks
Dominance (genetics)
topic Rede neural artificial
Marcador Genético
Coffea Canephora
Neural networks
Dominance (genetics)
description Many methodologies are used to predict the genetic merit in animals and plants, but some of them require priori assumptions that may increase the complexity of the model. Artificial neural network (ANN) has advantage to not require priori assumptions about the relationships between inputs and the output allowing great flexibility to handle different types of complex non-additive effects, such as dominance and epistasis. Despite this advantage, the biological interpretability of ANNs is still limited. The aim of this research was to estimate the heritability and markers effects for two traits in Coffea canephora using an additive-dominance architecture ANN and to compare it with genomic best linear unbiased prediction (GBLUP). The data used consists of 51 clones of C. canephora varietal Conilon, 32 of varietal group Robusta and 82 intervarietal hybrids. From this, 165 phenotyped individuals were genotyped for 14,387 SNPs. Due to the high computational cost of ANNs, we used Bagging decision tree to reduce the dimensionality of the data, selecting the markers that accumulated 70% of the total importance. An ANN with three hidden layers was run, each varying from 1 to 40 neurons summing 64,000 neural networks. The network architectures with the best predictive ability were selected. The best architectures were composed by 4, 15, and 33 neurons in the first, second and third hidden layers, respectively, for yield, and by 13, 20, and 24 neurons, respectively for rust resistance. The predictive ability was greater when using ANN with three hidden layers than using one hidden layer and GBLUP, with 0.72 and 0.88 for yield and coffee leaf rust resistance, respectively. The concordance rate (CR) of the 10% larger markers effects among the methods varied between 10% and 13.8%, for additive effects and between 5.4% and 11.9% for dominance effects. The narrow-sense (h2a ) and dominance-only (h2a ) heritability estimates were 0.25 and 0.06, respectively, for yield, and 0.67 and 0.03, respectively for rust resistance. The ANN was able to estimate the heritabilities from an additive-dominance genomic architectures and the ANN with three hidden layers obtained best predictive ability when compared with those obtained from GBLUP and ANN with one hidden layer.
publishDate 2022
dc.date.none.fl_str_mv 2022-05-16T15:13:40Z
2022-05-16T15:13:40Z
2022-05-16
2022
dc.type.driver.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Plos One, v. 17, n.1, e0262055, 2022.
http://www.alice.cnptia.embrapa.br/alice/handle/doc/1143026
https://doi.org/10.1371/journal.pone.0262055
identifier_str_mv Plos One, v. 17, n.1, e0262055, 2022.
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/1143026
https://doi.org/10.1371/journal.pone.0262055
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv cg-riaa@embrapa.br
_version_ 1794503522716221440