A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing.
Main Author: | |
---|---|
Publication Date: | 2019 |
Other Authors: | , , , |
Language: | eng |
Source: | Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
Download full: | http://www.alice.cnptia.embrapa.br/alice/handle/doc/1107378 |
Summary: | Ruzigrass (Urochloa ruziziensis) is a diploid, tropical forage grass native to Africa, widely planted in Brazil, known for its high nutritional quality. It is closely related to important forage species of Urochloa, and plays a crucial role in the breeding program of brachiaria grasses, mostly focused on inter-specific hybirds. Previous studies from our group based on shallow Illumina sequencing resulted in the development of the first molecular markers for the species (Silva et al., 2013), as well as in assessments of germplasm diversity and structure (Pessoa-Filho et al., 2015). Assembly and analysis of complete plastid genomes for four Urochloa species allowed the characterization of their phylogenetic divergence (Pessoa-Filho et al., 2017). Here, we present a near-complete phased diploid genome assembly of the heterozygous ruzigrass clone C69. We used PacBio Sequel to generate over 13.3 million long reads (mean size 6.5 kbp), adding up to 87.5 Gbp of raw data (~142x coverage). The current diploid assembly using FALCON-Unzip contains ~603 Mbp in 3,539 primary contigs, with NG50 of 286 kbp, and covers 98.2% of the estimated haploid genome size of 615 Mbp for ruzigrass. In addition, 82% of the assembly could be phased as separate haplotypes, with ~500 Mbp resolved as haplotigs. Assessment of assembly completeness showed 95.2% BUSCO matches as complete, and 83.3% as complete single-copy. Ongoing research includes transcriptome assembly to aid gene prediction and annotation, anchoring of contigs in linkage maps, and Hi-C scaffolding. A high-quality, chromosome-scale genome assembly for ruzigrass will aid research groups in the development and application of genomic tools in breeding and genetics of brachiaria grasses. |
id |
EMBR_d6439b62edd718a9baf2a9ccdfa2ecd5 |
---|---|
oai_identifier_str |
oai:www.alice.cnptia.embrapa.br:doc/1107378 |
network_acronym_str |
EMBR |
network_name_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository_id_str |
2154 |
spelling |
A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing.BrasilGenomaCapim UrochloaRuzigrass (Urochloa ruziziensis) is a diploid, tropical forage grass native to Africa, widely planted in Brazil, known for its high nutritional quality. It is closely related to important forage species of Urochloa, and plays a crucial role in the breeding program of brachiaria grasses, mostly focused on inter-specific hybirds. Previous studies from our group based on shallow Illumina sequencing resulted in the development of the first molecular markers for the species (Silva et al., 2013), as well as in assessments of germplasm diversity and structure (Pessoa-Filho et al., 2015). Assembly and analysis of complete plastid genomes for four Urochloa species allowed the characterization of their phylogenetic divergence (Pessoa-Filho et al., 2017). Here, we present a near-complete phased diploid genome assembly of the heterozygous ruzigrass clone C69. We used PacBio Sequel to generate over 13.3 million long reads (mean size 6.5 kbp), adding up to 87.5 Gbp of raw data (~142x coverage). The current diploid assembly using FALCON-Unzip contains ~603 Mbp in 3,539 primary contigs, with NG50 of 286 kbp, and covers 98.2% of the estimated haploid genome size of 615 Mbp for ruzigrass. In addition, 82% of the assembly could be phased as separate haplotypes, with ~500 Mbp resolved as haplotigs. Assessment of assembly completeness showed 95.2% BUSCO matches as complete, and 83.3% as complete single-copy. Ongoing research includes transcriptome assembly to aid gene prediction and annotation, anchoring of contigs in linkage maps, and Hi-C scaffolding. A high-quality, chromosome-scale genome assembly for ruzigrass will aid research groups in the development and application of genomic tools in breeding and genetics of brachiaria grasses.Plant and Animal Genome XXVII Conference (PAG).MARCO AURELIO CALDAS DE PINHO PESSO, CPAC; FAUSTO DE SOUZA SOBRINHO, CNPGL; RODRIGO DA ROCHA FRAGOSO, CPAC; ORZENIL BONFIM DA SILVA JUNIOR, Cenargen; MARCIO ELIAS FERREIRA, Cenargen.PESSOA FILHO, M. A. C. de P.SOUZA SOBRINHO, F. deFRAGOSO, R. da R.SILVA JUNIOR, O. B. daFERREIRA, M. E.2019-03-24T00:29:22Z2019-03-24T00:29:22Z2019-03-2120192019-03-24T00:29:22ZResumo em anais e proceedingsinfo:eu-repo/semantics/publishedVersionIn: PLANT AND ANIMAL GENOME CONFERENCE, 27., 2019, San Diego. Proceedings... Livingston, NJ: Scherago, 2019http://www.alice.cnptia.embrapa.br/alice/handle/doc/1107378enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2019-03-24T00:29:27Zoai:www.alice.cnptia.embrapa.br:doc/1107378Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542019-03-24T00:29:27Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false |
dc.title.none.fl_str_mv |
A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing. |
title |
A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing. |
spellingShingle |
A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing. PESSOA FILHO, M. A. C. de P. Brasil Genoma Capim Urochloa |
title_short |
A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing. |
title_full |
A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing. |
title_fullStr |
A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing. |
title_full_unstemmed |
A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing. |
title_sort |
A phased diploid genome assembly for the forage grass urochloa ruziziensis based on single-molecule real-time sequencing. |
author |
PESSOA FILHO, M. A. C. de P. |
author_facet |
PESSOA FILHO, M. A. C. de P. SOUZA SOBRINHO, F. de FRAGOSO, R. da R. SILVA JUNIOR, O. B. da FERREIRA, M. E. |
author_role |
author |
author2 |
SOUZA SOBRINHO, F. de FRAGOSO, R. da R. SILVA JUNIOR, O. B. da FERREIRA, M. E. |
author2_role |
author author author author |
dc.contributor.none.fl_str_mv |
MARCO AURELIO CALDAS DE PINHO PESSO, CPAC; FAUSTO DE SOUZA SOBRINHO, CNPGL; RODRIGO DA ROCHA FRAGOSO, CPAC; ORZENIL BONFIM DA SILVA JUNIOR, Cenargen; MARCIO ELIAS FERREIRA, Cenargen. |
dc.contributor.author.fl_str_mv |
PESSOA FILHO, M. A. C. de P. SOUZA SOBRINHO, F. de FRAGOSO, R. da R. SILVA JUNIOR, O. B. da FERREIRA, M. E. |
dc.subject.por.fl_str_mv |
Brasil Genoma Capim Urochloa |
topic |
Brasil Genoma Capim Urochloa |
description |
Ruzigrass (Urochloa ruziziensis) is a diploid, tropical forage grass native to Africa, widely planted in Brazil, known for its high nutritional quality. It is closely related to important forage species of Urochloa, and plays a crucial role in the breeding program of brachiaria grasses, mostly focused on inter-specific hybirds. Previous studies from our group based on shallow Illumina sequencing resulted in the development of the first molecular markers for the species (Silva et al., 2013), as well as in assessments of germplasm diversity and structure (Pessoa-Filho et al., 2015). Assembly and analysis of complete plastid genomes for four Urochloa species allowed the characterization of their phylogenetic divergence (Pessoa-Filho et al., 2017). Here, we present a near-complete phased diploid genome assembly of the heterozygous ruzigrass clone C69. We used PacBio Sequel to generate over 13.3 million long reads (mean size 6.5 kbp), adding up to 87.5 Gbp of raw data (~142x coverage). The current diploid assembly using FALCON-Unzip contains ~603 Mbp in 3,539 primary contigs, with NG50 of 286 kbp, and covers 98.2% of the estimated haploid genome size of 615 Mbp for ruzigrass. In addition, 82% of the assembly could be phased as separate haplotypes, with ~500 Mbp resolved as haplotigs. Assessment of assembly completeness showed 95.2% BUSCO matches as complete, and 83.3% as complete single-copy. Ongoing research includes transcriptome assembly to aid gene prediction and annotation, anchoring of contigs in linkage maps, and Hi-C scaffolding. A high-quality, chromosome-scale genome assembly for ruzigrass will aid research groups in the development and application of genomic tools in breeding and genetics of brachiaria grasses. |
publishDate |
2019 |
dc.date.none.fl_str_mv |
2019-03-24T00:29:22Z 2019-03-24T00:29:22Z 2019-03-21 2019 2019-03-24T00:29:22Z |
dc.type.driver.fl_str_mv |
Resumo em anais e proceedings |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
In: PLANT AND ANIMAL GENOME CONFERENCE, 27., 2019, San Diego. Proceedings... Livingston, NJ: Scherago, 2019 http://www.alice.cnptia.embrapa.br/alice/handle/doc/1107378 |
identifier_str_mv |
In: PLANT AND ANIMAL GENOME CONFERENCE, 27., 2019, San Diego. Proceedings... Livingston, NJ: Scherago, 2019 |
url |
http://www.alice.cnptia.embrapa.br/alice/handle/doc/1107378 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa) instacron:EMBRAPA |
instname_str |
Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
instacron_str |
EMBRAPA |
institution |
EMBRAPA |
reponame_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
collection |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository.name.fl_str_mv |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
repository.mail.fl_str_mv |
cg-riaa@embrapa.br |
_version_ |
1822721391752904704 |