Triple categorical regression for genomic selection: application to cassava breeding

Detalhes bibliográficos
Autor(a) principal: Lima, Leísa Pires
Data de Publicação: 2019
Outros Autores: Azevedo, Camila Ferreira, Resende, Marcos Deon Vilela de, Silva, Fabyano Fonseca e, Viana, José Marcelo Soriano, Oliveira, Eder Jorge de
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Scientia Agrícola (Online)
Texto Completo: https://www.revistas.usp.br/sa/article/view/160600
Resumo: Genome-wide selection (GWS) is currently a technique of great importance in plant breeding, since it improves efficiency of genetic evaluations by increasing genetic gains. The process is based on genomic estimated breeding values (GEBVs) obtained through phenotypic and dense marker genomic information. In this context, GEBVs of N individuals are calculated through appropriate models, which estimate the effect of each marker on phenotypes, allowing the early identification of genetically superior individuals. However, GWS leads to statistical challenges, due to high dimensionality and multicollinearity problems. These challenges require the use of statistical methods to approach the regularization of the estimation process. Therefore, we aimed to propose a method denominated as triple categorical regression (TCR) and compare it with the genomic best linear unbiased predictor (G-BLUP) and Bayesian least absolute shrinkage and selection operator (BLASSO) methods that have been widely applied to GWS. The methods were evaluated in simulated populations considering four different scenarios. Additionally, a modification of the G-BLUP method was proposed based on the TCR-estimated (TCR/G-BLUP) results. All methods were applied to real data of cassava (Manihot esculenta) with to increase efficiency of a current breeding program. The methods were compared through independent validation and efficiency measures, such as prediction accuracy, bias, and recovered genomic heritability. The TCR method was suitable to estimate variance components and heritability, and the TCR/G-BLUP method provided efficient GEBV predictions. Thus, the proposed methods provide new insights for GWS.
id USP-18_71f1a0e76af3a9e17da4d77fde7798e6
oai_identifier_str oai:revistas.usp.br:article/160600
network_acronym_str USP-18
network_name_str Scientia Agrícola (Online)
repository_id_str
spelling Triple categorical regression for genomic selection: application to cassava breedingG-BLUPBLASSOgenomic predictiongenomic heritabilityGenome-wide selection (GWS) is currently a technique of great importance in plant breeding, since it improves efficiency of genetic evaluations by increasing genetic gains. The process is based on genomic estimated breeding values (GEBVs) obtained through phenotypic and dense marker genomic information. In this context, GEBVs of N individuals are calculated through appropriate models, which estimate the effect of each marker on phenotypes, allowing the early identification of genetically superior individuals. However, GWS leads to statistical challenges, due to high dimensionality and multicollinearity problems. These challenges require the use of statistical methods to approach the regularization of the estimation process. Therefore, we aimed to propose a method denominated as triple categorical regression (TCR) and compare it with the genomic best linear unbiased predictor (G-BLUP) and Bayesian least absolute shrinkage and selection operator (BLASSO) methods that have been widely applied to GWS. The methods were evaluated in simulated populations considering four different scenarios. Additionally, a modification of the G-BLUP method was proposed based on the TCR-estimated (TCR/G-BLUP) results. All methods were applied to real data of cassava (Manihot esculenta) with to increase efficiency of a current breeding program. The methods were compared through independent validation and efficiency measures, such as prediction accuracy, bias, and recovered genomic heritability. The TCR method was suitable to estimate variance components and heritability, and the TCR/G-BLUP method provided efficient GEBV predictions. Thus, the proposed methods provide new insights for GWS.Universidade de São Paulo. Escola Superior de Agricultura Luiz de Queiroz2019-08-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://www.revistas.usp.br/sa/article/view/16060010.1590/1678-992x-2017-0369Scientia Agricola; v. 76 n. 5 (2019); 368-375Scientia Agricola; Vol. 76 Núm. 5 (2019); 368-375Scientia Agricola; Vol. 76 No. 5 (2019); 368-3751678-992X0103-9016reponame:Scientia Agrícola (Online)instname:Universidade de São Paulo (USP)instacron:USPenghttps://www.revistas.usp.br/sa/article/view/160600/154862Copyright (c) 2019 Scientia Agricolainfo:eu-repo/semantics/openAccessLima, Leísa PiresAzevedo, Camila FerreiraResende, Marcos Deon Vilela deSilva, Fabyano Fonseca eViana, José Marcelo SorianoOliveira, Eder Jorge de2019-08-02T11:25:22Zoai:revistas.usp.br:article/160600Revistahttp://revistas.usp.br/sa/indexPUBhttps://old.scielo.br/oai/scielo-oai.phpscientia@usp.br||alleoni@usp.br1678-992X0103-9016opendoar:2019-08-02T11:25:22Scientia Agrícola (Online) - Universidade de São Paulo (USP)false
dc.title.none.fl_str_mv Triple categorical regression for genomic selection: application to cassava breeding
title Triple categorical regression for genomic selection: application to cassava breeding
spellingShingle Triple categorical regression for genomic selection: application to cassava breeding
Lima, Leísa Pires
G-BLUP
BLASSO
genomic prediction
genomic heritability
title_short Triple categorical regression for genomic selection: application to cassava breeding
title_full Triple categorical regression for genomic selection: application to cassava breeding
title_fullStr Triple categorical regression for genomic selection: application to cassava breeding
title_full_unstemmed Triple categorical regression for genomic selection: application to cassava breeding
title_sort Triple categorical regression for genomic selection: application to cassava breeding
author Lima, Leísa Pires
author_facet Lima, Leísa Pires
Azevedo, Camila Ferreira
Resende, Marcos Deon Vilela de
Silva, Fabyano Fonseca e
Viana, José Marcelo Soriano
Oliveira, Eder Jorge de
author_role author
author2 Azevedo, Camila Ferreira
Resende, Marcos Deon Vilela de
Silva, Fabyano Fonseca e
Viana, José Marcelo Soriano
Oliveira, Eder Jorge de
author2_role author
author
author
author
author
dc.contributor.author.fl_str_mv Lima, Leísa Pires
Azevedo, Camila Ferreira
Resende, Marcos Deon Vilela de
Silva, Fabyano Fonseca e
Viana, José Marcelo Soriano
Oliveira, Eder Jorge de
dc.subject.por.fl_str_mv G-BLUP
BLASSO
genomic prediction
genomic heritability
topic G-BLUP
BLASSO
genomic prediction
genomic heritability
description Genome-wide selection (GWS) is currently a technique of great importance in plant breeding, since it improves efficiency of genetic evaluations by increasing genetic gains. The process is based on genomic estimated breeding values (GEBVs) obtained through phenotypic and dense marker genomic information. In this context, GEBVs of N individuals are calculated through appropriate models, which estimate the effect of each marker on phenotypes, allowing the early identification of genetically superior individuals. However, GWS leads to statistical challenges, due to high dimensionality and multicollinearity problems. These challenges require the use of statistical methods to approach the regularization of the estimation process. Therefore, we aimed to propose a method denominated as triple categorical regression (TCR) and compare it with the genomic best linear unbiased predictor (G-BLUP) and Bayesian least absolute shrinkage and selection operator (BLASSO) methods that have been widely applied to GWS. The methods were evaluated in simulated populations considering four different scenarios. Additionally, a modification of the G-BLUP method was proposed based on the TCR-estimated (TCR/G-BLUP) results. All methods were applied to real data of cassava (Manihot esculenta) with to increase efficiency of a current breeding program. The methods were compared through independent validation and efficiency measures, such as prediction accuracy, bias, and recovered genomic heritability. The TCR method was suitable to estimate variance components and heritability, and the TCR/G-BLUP method provided efficient GEBV predictions. Thus, the proposed methods provide new insights for GWS.
publishDate 2019
dc.date.none.fl_str_mv 2019-08-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://www.revistas.usp.br/sa/article/view/160600
10.1590/1678-992x-2017-0369
url https://www.revistas.usp.br/sa/article/view/160600
identifier_str_mv 10.1590/1678-992x-2017-0369
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://www.revistas.usp.br/sa/article/view/160600/154862
dc.rights.driver.fl_str_mv Copyright (c) 2019 Scientia Agricola
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2019 Scientia Agricola
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade de São Paulo. Escola Superior de Agricultura Luiz de Queiroz
publisher.none.fl_str_mv Universidade de São Paulo. Escola Superior de Agricultura Luiz de Queiroz
dc.source.none.fl_str_mv Scientia Agricola; v. 76 n. 5 (2019); 368-375
Scientia Agricola; Vol. 76 Núm. 5 (2019); 368-375
Scientia Agricola; Vol. 76 No. 5 (2019); 368-375
1678-992X
0103-9016
reponame:Scientia Agrícola (Online)
instname:Universidade de São Paulo (USP)
instacron:USP
instname_str Universidade de São Paulo (USP)
instacron_str USP
institution USP
reponame_str Scientia Agrícola (Online)
collection Scientia Agrícola (Online)
repository.name.fl_str_mv Scientia Agrícola (Online) - Universidade de São Paulo (USP)
repository.mail.fl_str_mv scientia@usp.br||alleoni@usp.br
_version_ 1800222793985425408