Building the Embrapa rice breeding dataset for efficient data reuse.

Detalhes bibliográficos
Autor(a) principal: BRESEGHELLO, F.
Data de Publicação: 2021
Outros Autores: MELLO, R. N. de, PINHEIRO, P. V., SOARES, D. M., LOPES JUNIOR, S., RANGEL, P. H. N., GUIMARÃES, E. P., CASTRO, A. P. de, COLOMBARI FILHO, J. M., MAGALHÃES JUNIOR, A. M. de, FAGUNDES, P. R. R., NEVES, P. de C. F., FURTINI, I. V., UTUMI, M. M., PEREIRA, J. A., CORDEIRO, A. C. C., SILVEIRA FILHO, A., ABREU, G. B., MOURA NETO, F. P., PIETRAGALLA, J., VARGAS HERNÁNDEZ, M., CROSSA, J.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo: http://www.alice.cnptia.embrapa.br/alice/handle/doc/1132588
https://doi.org/10.1002/csc2.20550
Resumo: Embrapa has led breeding programs for irrigated and upland rice (Oryza sativa L.) since 1977, generating a large amount of pedigree and phenotypic data. However, there were no systematic standards for data recording nor long-term data preservation and reuse strategies. With the new aim of making data reuse practical, we recovered all data available and structured it into the Embrapa Rice Breeding Dataset (ERBD). In its current version, the ERBD includes 20,504 crosses involving 9,974 parents, the pedigrees of most of the 4,532 inbred lines that took part in advanced field trials, and phenotypic data from 2,711 field trials (1,118 irrigated, 1,593 upland trials), representing 226,458 field plots. Those trials were conducted over 38 years (1982-2019), in 247 locations, in latitudes ranging from 3°N to 33°S. Phenotypic traits included grain yield, days to flowering, plant height, canopy lodging, and five important fungal diseases: leaf blast, panicle blast, brown spot, leaf scald, and grain discoloration. The total number of data points surpasses 1.27 million. Descriptive statistics were computed over the dataset, split by cropping systems (irrigated or upland). The mean heritability of grain yield was high for both systems, at around .7, whereas the mean coefficient of variation was 13.9% for irrigated trials and 18.7% for upland trials. The ERBD offers the possibility of conducting studies on different aspects of rice breeding and genetics, including genetic gain, G×E analysis, genome-wide association studies and genomic prediction.
id EMBR_6acc4cd5601a618c79674951db2456be
oai_identifier_str oai:www.alice.cnptia.embrapa.br:doc/1132588
network_acronym_str EMBR
network_name_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str 2154
spelling Building the Embrapa rice breeding dataset for efficient data reuse.Banco de dadosArrozOryza SativaMelhoramento Genético VegetalFenótipoRicePlant breedingDatabasesGeneticsEmbrapa has led breeding programs for irrigated and upland rice (Oryza sativa L.) since 1977, generating a large amount of pedigree and phenotypic data. However, there were no systematic standards for data recording nor long-term data preservation and reuse strategies. With the new aim of making data reuse practical, we recovered all data available and structured it into the Embrapa Rice Breeding Dataset (ERBD). In its current version, the ERBD includes 20,504 crosses involving 9,974 parents, the pedigrees of most of the 4,532 inbred lines that took part in advanced field trials, and phenotypic data from 2,711 field trials (1,118 irrigated, 1,593 upland trials), representing 226,458 field plots. Those trials were conducted over 38 years (1982-2019), in 247 locations, in latitudes ranging from 3°N to 33°S. Phenotypic traits included grain yield, days to flowering, plant height, canopy lodging, and five important fungal diseases: leaf blast, panicle blast, brown spot, leaf scald, and grain discoloration. The total number of data points surpasses 1.27 million. Descriptive statistics were computed over the dataset, split by cropping systems (irrigated or upland). The mean heritability of grain yield was high for both systems, at around .7, whereas the mean coefficient of variation was 13.9% for irrigated trials and 18.7% for upland trials. The ERBD offers the possibility of conducting studies on different aspects of rice breeding and genetics, including genetic gain, G×E analysis, genome-wide association studies and genomic prediction.FLAVIO BRESEGHELLO, CNPAF; RAQUEL NEVES DE MELLO, CNPAF; PATRICIA VALLE PINHEIRO, CNPAF; DINO MAGALHAES SOARES, CNPAF; SERGIO LOPES JUNIOR, CNPAF; PAULO HIDEO NAKANO RANGEL, CNPAF; ELCIO PERPETUO GUIMARAES, CNPAF; ADRIANO PEREIRA DE CASTRO, CNPAF; JOSE MANOEL COLOMBARI FILHO, CNPAF; ARIANO MARTINS DE MAGALHAES JUNIOR, CPACT; PAULO RICARDO REIS FAGUNDES, CPACT; PERICLES DE CARVALHO FERREIRA NEVES, CNPAF; ISABELA VOLPI FURTINI, CNPAF; MARLEY MARICO UTUMI, CPAF-RO; JOSE ALMEIDA PEREIRA, CPAMN; ANTONIO CARLOS CENTENO CORDEIRO, CPAF-RR; AUSTRELINO SILVEIRA FILHO, CPATU; GUILHERME BARBOSA ABREU, CPACP; FRANCISCO PEREIRA MOURA NETO, CNPAF; JULIAN PIETRAGALLA, INTEGRATED BREEDING PLATFORM, Texcoco, Mexico; MATEO VARGAS HERNÁNDEZ, CIMMYT, Texcoco-Mexico; JOSE CROSSA, CIMMYT, Texcoco-Mexico.BRESEGHELLO, F.MELLO, R. N. dePINHEIRO, P. V.SOARES, D. M.LOPES JUNIOR, S.RANGEL, P. H. N.GUIMARÃES, E. P.CASTRO, A. P. deCOLOMBARI FILHO, J. M.MAGALHÃES JUNIOR, A. M. deFAGUNDES, P. R. R.NEVES, P. de C. F.FURTINI, I. V.UTUMI, M. M.PEREIRA, J. A.CORDEIRO, A. C. C.SILVEIRA FILHO, A.ABREU, G. B.MOURA NETO, F. P.PIETRAGALLA, J.VARGAS HERNÁNDEZ, M.CROSSA, J.2021-11-30T12:00:26Z2021-11-30T12:00:26Z2021-06-282021info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleCrop Science, v. 61, n. 5, p. 3445-3457, Sept./Oct. 2021.0011-183Xhttp://www.alice.cnptia.embrapa.br/alice/handle/doc/1132588https://doi.org/10.1002/csc2.20550enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2021-11-30T12:00:37Zoai:www.alice.cnptia.embrapa.br:doc/1132588Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542021-11-30T12:00:37falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542021-11-30T12:00:37Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv Building the Embrapa rice breeding dataset for efficient data reuse.
title Building the Embrapa rice breeding dataset for efficient data reuse.
spellingShingle Building the Embrapa rice breeding dataset for efficient data reuse.
BRESEGHELLO, F.
Banco de dados
Arroz
Oryza Sativa
Melhoramento Genético Vegetal
Fenótipo
Rice
Plant breeding
Databases
Genetics
title_short Building the Embrapa rice breeding dataset for efficient data reuse.
title_full Building the Embrapa rice breeding dataset for efficient data reuse.
title_fullStr Building the Embrapa rice breeding dataset for efficient data reuse.
title_full_unstemmed Building the Embrapa rice breeding dataset for efficient data reuse.
title_sort Building the Embrapa rice breeding dataset for efficient data reuse.
author BRESEGHELLO, F.
author_facet BRESEGHELLO, F.
MELLO, R. N. de
PINHEIRO, P. V.
SOARES, D. M.
LOPES JUNIOR, S.
RANGEL, P. H. N.
GUIMARÃES, E. P.
CASTRO, A. P. de
COLOMBARI FILHO, J. M.
MAGALHÃES JUNIOR, A. M. de
FAGUNDES, P. R. R.
NEVES, P. de C. F.
FURTINI, I. V.
UTUMI, M. M.
PEREIRA, J. A.
CORDEIRO, A. C. C.
SILVEIRA FILHO, A.
ABREU, G. B.
MOURA NETO, F. P.
PIETRAGALLA, J.
VARGAS HERNÁNDEZ, M.
CROSSA, J.
author_role author
author2 MELLO, R. N. de
PINHEIRO, P. V.
SOARES, D. M.
LOPES JUNIOR, S.
RANGEL, P. H. N.
GUIMARÃES, E. P.
CASTRO, A. P. de
COLOMBARI FILHO, J. M.
MAGALHÃES JUNIOR, A. M. de
FAGUNDES, P. R. R.
NEVES, P. de C. F.
FURTINI, I. V.
UTUMI, M. M.
PEREIRA, J. A.
CORDEIRO, A. C. C.
SILVEIRA FILHO, A.
ABREU, G. B.
MOURA NETO, F. P.
PIETRAGALLA, J.
VARGAS HERNÁNDEZ, M.
CROSSA, J.
author2_role author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv FLAVIO BRESEGHELLO, CNPAF; RAQUEL NEVES DE MELLO, CNPAF; PATRICIA VALLE PINHEIRO, CNPAF; DINO MAGALHAES SOARES, CNPAF; SERGIO LOPES JUNIOR, CNPAF; PAULO HIDEO NAKANO RANGEL, CNPAF; ELCIO PERPETUO GUIMARAES, CNPAF; ADRIANO PEREIRA DE CASTRO, CNPAF; JOSE MANOEL COLOMBARI FILHO, CNPAF; ARIANO MARTINS DE MAGALHAES JUNIOR, CPACT; PAULO RICARDO REIS FAGUNDES, CPACT; PERICLES DE CARVALHO FERREIRA NEVES, CNPAF; ISABELA VOLPI FURTINI, CNPAF; MARLEY MARICO UTUMI, CPAF-RO; JOSE ALMEIDA PEREIRA, CPAMN; ANTONIO CARLOS CENTENO CORDEIRO, CPAF-RR; AUSTRELINO SILVEIRA FILHO, CPATU; GUILHERME BARBOSA ABREU, CPACP; FRANCISCO PEREIRA MOURA NETO, CNPAF; JULIAN PIETRAGALLA, INTEGRATED BREEDING PLATFORM, Texcoco, Mexico; MATEO VARGAS HERNÁNDEZ, CIMMYT, Texcoco-Mexico; JOSE CROSSA, CIMMYT, Texcoco-Mexico.
dc.contributor.author.fl_str_mv BRESEGHELLO, F.
MELLO, R. N. de
PINHEIRO, P. V.
SOARES, D. M.
LOPES JUNIOR, S.
RANGEL, P. H. N.
GUIMARÃES, E. P.
CASTRO, A. P. de
COLOMBARI FILHO, J. M.
MAGALHÃES JUNIOR, A. M. de
FAGUNDES, P. R. R.
NEVES, P. de C. F.
FURTINI, I. V.
UTUMI, M. M.
PEREIRA, J. A.
CORDEIRO, A. C. C.
SILVEIRA FILHO, A.
ABREU, G. B.
MOURA NETO, F. P.
PIETRAGALLA, J.
VARGAS HERNÁNDEZ, M.
CROSSA, J.
dc.subject.por.fl_str_mv Banco de dados
Arroz
Oryza Sativa
Melhoramento Genético Vegetal
Fenótipo
Rice
Plant breeding
Databases
Genetics
topic Banco de dados
Arroz
Oryza Sativa
Melhoramento Genético Vegetal
Fenótipo
Rice
Plant breeding
Databases
Genetics
description Embrapa has led breeding programs for irrigated and upland rice (Oryza sativa L.) since 1977, generating a large amount of pedigree and phenotypic data. However, there were no systematic standards for data recording nor long-term data preservation and reuse strategies. With the new aim of making data reuse practical, we recovered all data available and structured it into the Embrapa Rice Breeding Dataset (ERBD). In its current version, the ERBD includes 20,504 crosses involving 9,974 parents, the pedigrees of most of the 4,532 inbred lines that took part in advanced field trials, and phenotypic data from 2,711 field trials (1,118 irrigated, 1,593 upland trials), representing 226,458 field plots. Those trials were conducted over 38 years (1982-2019), in 247 locations, in latitudes ranging from 3°N to 33°S. Phenotypic traits included grain yield, days to flowering, plant height, canopy lodging, and five important fungal diseases: leaf blast, panicle blast, brown spot, leaf scald, and grain discoloration. The total number of data points surpasses 1.27 million. Descriptive statistics were computed over the dataset, split by cropping systems (irrigated or upland). The mean heritability of grain yield was high for both systems, at around .7, whereas the mean coefficient of variation was 13.9% for irrigated trials and 18.7% for upland trials. The ERBD offers the possibility of conducting studies on different aspects of rice breeding and genetics, including genetic gain, G×E analysis, genome-wide association studies and genomic prediction.
publishDate 2021
dc.date.none.fl_str_mv 2021-11-30T12:00:26Z
2021-11-30T12:00:26Z
2021-06-28
2021
dc.type.driver.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Crop Science, v. 61, n. 5, p. 3445-3457, Sept./Oct. 2021.
0011-183X
http://www.alice.cnptia.embrapa.br/alice/handle/doc/1132588
https://doi.org/10.1002/csc2.20550
identifier_str_mv Crop Science, v. 61, n. 5, p. 3445-3457, Sept./Oct. 2021.
0011-183X
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/1132588
https://doi.org/10.1002/csc2.20550
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv cg-riaa@embrapa.br
_version_ 1794503512943493120