Unraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assembler

Detalhes bibliográficos
Autor(a) principal: Nascimento, Leandro Costa
Data de Publicação: 2019
Outros Autores: Yanagui, Karina, José, Juliana, Camargo, Eduardo L. O., Grassi, Maria Carolina B., Cunha, Camila P., Bressiani, José Antônio, Carvalho, Guilherme M. A., Prado, Paula F., Mieczkowski, Piotr, Pereira, Gonçalo A. G., Carazzolle, Marcelo F., Carvalho, Carlos Roberto
Tipo de documento: Artigo
Idioma: eng
Título da fonte: LOCUS Repositório Institucional da UFV
Texto Completo: http://dx.doi.org/10.1093/dnares/dsz001
http://www.locus.ufv.br/handle/123456789/24480
Resumo: The Polyploid Gene Assembler (PGA), developed and tested in this study, represents a new strategy to perform gene-space assembly from complex genomes using low coverage DNA sequencing. The pipeline integrates reference-assisted loci and de novo assembly strategies to construct high-quality sequences focused on gene content. Pipeline validation was conducted with wheat (Triticum aestivum), a hexaploid species, using barley (Hordeum vulgare) as reference, that resulted in the identification of more than 90% of genes and several new genes. Moreover, PGA was used to assemble gene content in Saccharum spontaneum species, a parental lineage for hybrid sugarcane cultivars. Saccharum spontaneum gene sequence obtained was used to reference-guided transcriptome analysis of six different tissues. A total of 39,234 genes were identified, 60.4% clustered into known grass gene families. Thirty-seven gene families were expanded when compared with other grasses, three of them highlighted by the number of gene copies potentially involved in initial development and stress response. In addition, 3,108 promoters (many showing tissue specificity) were identified in this work. In summary, PGA can reconstruct high-quality gene sequences from polyploid genomes, as shown for wheat and S. spontaneum species, and it is more efficient than conventional genome assemblers using low coverage DNA sequencing.
id UFV_11eddf0993380ac56e57635069b937b9
oai_identifier_str oai:locus.ufv.br:123456789/24480
network_acronym_str UFV
network_name_str LOCUS Repositório Institucional da UFV
repository_id_str 2145
spelling Nascimento, Leandro CostaYanagui, KarinaJosé, JulianaCamargo, Eduardo L. O.Grassi, Maria Carolina B.Cunha, Camila P.Bressiani, José AntônioCarvalho, Guilherme M. A.Prado, Paula F.Mieczkowski, PiotrPereira, Gonçalo A. G.Carazzolle, Marcelo F.Carvalho, Carlos Roberto2019-04-11T14:21:04Z2019-04-11T14:21:04Z2019-021756-1663http://dx.doi.org/10.1093/dnares/dsz001http://www.locus.ufv.br/handle/123456789/24480The Polyploid Gene Assembler (PGA), developed and tested in this study, represents a new strategy to perform gene-space assembly from complex genomes using low coverage DNA sequencing. The pipeline integrates reference-assisted loci and de novo assembly strategies to construct high-quality sequences focused on gene content. Pipeline validation was conducted with wheat (Triticum aestivum), a hexaploid species, using barley (Hordeum vulgare) as reference, that resulted in the identification of more than 90% of genes and several new genes. Moreover, PGA was used to assemble gene content in Saccharum spontaneum species, a parental lineage for hybrid sugarcane cultivars. Saccharum spontaneum gene sequence obtained was used to reference-guided transcriptome analysis of six different tissues. A total of 39,234 genes were identified, 60.4% clustered into known grass gene families. Thirty-seven gene families were expanded when compared with other grasses, three of them highlighted by the number of gene copies potentially involved in initial development and stress response. In addition, 3,108 promoters (many showing tissue specificity) were identified in this work. In summary, PGA can reconstruct high-quality gene sequences from polyploid genomes, as shown for wheat and S. spontaneum species, and it is more efficient than conventional genome assemblers using low coverage DNA sequencing.engDNA Researchp. 1- 12, fev. 2019SugarcaneGenome assemblyTranscriptomeGene discoveryNew assemblerUnraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assemblerinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfinfo:eu-repo/semantics/openAccessreponame:LOCUS Repositório Institucional da UFVinstname:Universidade Federal de Viçosa (UFV)instacron:UFVORIGINALartigo.pdfartigo.pdftexto completoapplication/pdf883848https://locus.ufv.br//bitstream/123456789/24480/1/artigo.pdf119cef019a56e8504ab3d915b29d6012MD51LICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://locus.ufv.br//bitstream/123456789/24480/2/license.txt8a4605be74aa9ea9d79846c1fba20a33MD52123456789/244802019-04-11 11:25:17.541oai:locus.ufv.br:123456789/24480Tk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=Repositório InstitucionalPUBhttps://www.locus.ufv.br/oai/requestfabiojreis@ufv.bropendoar:21452019-04-11T14:25:17LOCUS Repositório Institucional da UFV - Universidade Federal de Viçosa (UFV)false
dc.title.en.fl_str_mv Unraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assembler
title Unraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assembler
spellingShingle Unraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assembler
Nascimento, Leandro Costa
Sugarcane
Genome assembly
Transcriptome
Gene discovery
New assembler
title_short Unraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assembler
title_full Unraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assembler
title_fullStr Unraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assembler
title_full_unstemmed Unraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assembler
title_sort Unraveling the complex genome of Saccharum spontaneum using Polyploid Gene Assembler
author Nascimento, Leandro Costa
author_facet Nascimento, Leandro Costa
Yanagui, Karina
José, Juliana
Camargo, Eduardo L. O.
Grassi, Maria Carolina B.
Cunha, Camila P.
Bressiani, José Antônio
Carvalho, Guilherme M. A.
Prado, Paula F.
Mieczkowski, Piotr
Pereira, Gonçalo A. G.
Carazzolle, Marcelo F.
Carvalho, Carlos Roberto
author_role author
author2 Yanagui, Karina
José, Juliana
Camargo, Eduardo L. O.
Grassi, Maria Carolina B.
Cunha, Camila P.
Bressiani, José Antônio
Carvalho, Guilherme M. A.
Prado, Paula F.
Mieczkowski, Piotr
Pereira, Gonçalo A. G.
Carazzolle, Marcelo F.
Carvalho, Carlos Roberto
author2_role author
author
author
author
author
author
author
author
author
author
author
author
dc.contributor.author.fl_str_mv Nascimento, Leandro Costa
Yanagui, Karina
José, Juliana
Camargo, Eduardo L. O.
Grassi, Maria Carolina B.
Cunha, Camila P.
Bressiani, José Antônio
Carvalho, Guilherme M. A.
Prado, Paula F.
Mieczkowski, Piotr
Pereira, Gonçalo A. G.
Carazzolle, Marcelo F.
Carvalho, Carlos Roberto
dc.subject.pt-BR.fl_str_mv Sugarcane
Genome assembly
Transcriptome
Gene discovery
New assembler
topic Sugarcane
Genome assembly
Transcriptome
Gene discovery
New assembler
description The Polyploid Gene Assembler (PGA), developed and tested in this study, represents a new strategy to perform gene-space assembly from complex genomes using low coverage DNA sequencing. The pipeline integrates reference-assisted loci and de novo assembly strategies to construct high-quality sequences focused on gene content. Pipeline validation was conducted with wheat (Triticum aestivum), a hexaploid species, using barley (Hordeum vulgare) as reference, that resulted in the identification of more than 90% of genes and several new genes. Moreover, PGA was used to assemble gene content in Saccharum spontaneum species, a parental lineage for hybrid sugarcane cultivars. Saccharum spontaneum gene sequence obtained was used to reference-guided transcriptome analysis of six different tissues. A total of 39,234 genes were identified, 60.4% clustered into known grass gene families. Thirty-seven gene families were expanded when compared with other grasses, three of them highlighted by the number of gene copies potentially involved in initial development and stress response. In addition, 3,108 promoters (many showing tissue specificity) were identified in this work. In summary, PGA can reconstruct high-quality gene sequences from polyploid genomes, as shown for wheat and S. spontaneum species, and it is more efficient than conventional genome assemblers using low coverage DNA sequencing.
publishDate 2019
dc.date.accessioned.fl_str_mv 2019-04-11T14:21:04Z
dc.date.available.fl_str_mv 2019-04-11T14:21:04Z
dc.date.issued.fl_str_mv 2019-02
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1093/dnares/dsz001
http://www.locus.ufv.br/handle/123456789/24480
dc.identifier.issn.none.fl_str_mv 1756-1663
identifier_str_mv 1756-1663
url http://dx.doi.org/10.1093/dnares/dsz001
http://www.locus.ufv.br/handle/123456789/24480
dc.language.iso.fl_str_mv eng
language eng
dc.relation.ispartofseries.pt-BR.fl_str_mv p. 1- 12, fev. 2019
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv DNA Research
publisher.none.fl_str_mv DNA Research
dc.source.none.fl_str_mv reponame:LOCUS Repositório Institucional da UFV
instname:Universidade Federal de Viçosa (UFV)
instacron:UFV
instname_str Universidade Federal de Viçosa (UFV)
instacron_str UFV
institution UFV
reponame_str LOCUS Repositório Institucional da UFV
collection LOCUS Repositório Institucional da UFV
bitstream.url.fl_str_mv https://locus.ufv.br//bitstream/123456789/24480/1/artigo.pdf
https://locus.ufv.br//bitstream/123456789/24480/2/license.txt
bitstream.checksum.fl_str_mv 119cef019a56e8504ab3d915b29d6012
8a4605be74aa9ea9d79846c1fba20a33
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv LOCUS Repositório Institucional da UFV - Universidade Federal de Viçosa (UFV)
repository.mail.fl_str_mv fabiojreis@ufv.br
_version_ 1801212913093967872