Embracing heterogeneity: Coalescing the tree of life and the future of phylogenomics

Detalhes bibliográficos
Autor(a) principal: Bravo, Gustavo A.
Data de Publicação: 2019
Outros Autores: Antonelli, Alexandre, Bacon, Christine D., Bartoszek, Krzysztof, Blom, Mozes P.K., Huynh, Stella, Jones, Graham R., Lacey Knowles, L., Lamichhaney, Sangeet, Marcussen, Thomas, Morlon, Hélène, Nakhleh, Luay K., Oxelman, Bengt, Pfeil, Bernard E., Schliep, Alexander, Wahlberg, Niklas, Werneck, F. P., Wiedenhoeft, John, Willows-Munro, Sandi, Edwards, Scott V.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional do INPA
Texto Completo: https://repositorio.inpa.gov.br/handle/1/15348
Resumo: Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are made possible with routine whole-genome sequencing of taxa of interest. We discuss limitations and potential extensions of a model supporting innovation in phylogenomics today, the multispecies coalescent model (MSC). Macroevolutionary models that use phylogenies, such as character mapping, often ignore the heterogeneity on which building phylogenies increasingly rely and suggest that assimilating such heterogeneity is an important goal moving forward. Finally, we argue that an integrative cyberinfrastructure linking all steps of the process of building the ToL, from specimen acquisition in the field to publication and tracking of phylogenomic data, as well as a culture that values contributors at each step, are essential for progress. © 2019 Bravo et al.
id INPA-2_8c75030935d7c2e7ff995a04c987c842
oai_identifier_str oai:repositorio:1/15348
network_acronym_str INPA-2
network_name_str Repositório Institucional do INPA
repository_id_str
spelling Bravo, Gustavo A.Antonelli, AlexandreBacon, Christine D.Bartoszek, KrzysztofBlom, Mozes P.K.Huynh, StellaJones, Graham R.Lacey Knowles, L.Lamichhaney, SangeetMarcussen, ThomasMorlon, HélèneNakhleh, Luay K.Oxelman, BengtPfeil, Bernard E.Schliep, AlexanderWahlberg, NiklasWerneck, F. P.Wiedenhoeft, JohnWillows-Munro, SandiEdwards, Scott V.2020-05-08T20:19:07Z2020-05-08T20:19:07Z2019https://repositorio.inpa.gov.br/handle/1/1534810.7717/peerj.6399Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are made possible with routine whole-genome sequencing of taxa of interest. We discuss limitations and potential extensions of a model supporting innovation in phylogenomics today, the multispecies coalescent model (MSC). Macroevolutionary models that use phylogenies, such as character mapping, often ignore the heterogeneity on which building phylogenies increasingly rely and suggest that assimilating such heterogeneity is an important goal moving forward. Finally, we argue that an integrative cyberinfrastructure linking all steps of the process of building the ToL, from specimen acquisition in the field to publication and tracking of phylogenomic data, as well as a culture that values contributors at each step, are essential for progress. © 2019 Bravo et al.Volume 2019, Número 2Attribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessAncient DnaTranscriptomeArabidopsis ThalianaBiogeographyCopy Number VariationGene DuplicationGene FlowGene LocusGene LossGene MappingGene OrderGenetic ParametersRecombination, GeneticGenotypeHeredityHeterozygosityHigh Throughput SequencingGene Transfer, HorizontalHumanHybridizationIntrogressionPhylogenomicsPhylogenyPhylogeographyPolymerase Chain ReactionPolyploidySanger SequencingPolymorphism, Single NucleotideSpecies DifferentiationSyntenyTree Of LifeWhole Genome SequencingEmbracing heterogeneity: Coalescing the tree of life and the future of phylogenomicsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlePeerJengreponame:Repositório Institucional do INPAinstname:Instituto Nacional de Pesquisas da Amazônia (INPA)instacron:INPAORIGINALartigo-inpa.pdfartigo-inpa.pdfapplication/pdf3900230https://repositorio.inpa.gov.br/bitstream/1/15348/1/artigo-inpa.pdf7957f6c8760297dd6eb20c323fcad4b2MD511/153482020-07-14 11:07:25.281oai:repositorio:1/15348Repositório de PublicaçõesPUBhttps://repositorio.inpa.gov.br/oai/requestopendoar:2020-07-14T15:07:25Repositório Institucional do INPA - Instituto Nacional de Pesquisas da Amazônia (INPA)false
dc.title.en.fl_str_mv Embracing heterogeneity: Coalescing the tree of life and the future of phylogenomics
title Embracing heterogeneity: Coalescing the tree of life and the future of phylogenomics
spellingShingle Embracing heterogeneity: Coalescing the tree of life and the future of phylogenomics
Bravo, Gustavo A.
Ancient Dna
Transcriptome
Arabidopsis Thaliana
Biogeography
Copy Number Variation
Gene Duplication
Gene Flow
Gene Locus
Gene Loss
Gene Mapping
Gene Order
Genetic Parameters
Recombination, Genetic
Genotype
Heredity
Heterozygosity
High Throughput Sequencing
Gene Transfer, Horizontal
Human
Hybridization
Introgression
Phylogenomics
Phylogeny
Phylogeography
Polymerase Chain Reaction
Polyploidy
Sanger Sequencing
Polymorphism, Single Nucleotide
Species Differentiation
Synteny
Tree Of Life
Whole Genome Sequencing
title_short Embracing heterogeneity: Coalescing the tree of life and the future of phylogenomics
title_full Embracing heterogeneity: Coalescing the tree of life and the future of phylogenomics
title_fullStr Embracing heterogeneity: Coalescing the tree of life and the future of phylogenomics
title_full_unstemmed Embracing heterogeneity: Coalescing the tree of life and the future of phylogenomics
title_sort Embracing heterogeneity: Coalescing the tree of life and the future of phylogenomics
author Bravo, Gustavo A.
author_facet Bravo, Gustavo A.
Antonelli, Alexandre
Bacon, Christine D.
Bartoszek, Krzysztof
Blom, Mozes P.K.
Huynh, Stella
Jones, Graham R.
Lacey Knowles, L.
Lamichhaney, Sangeet
Marcussen, Thomas
Morlon, Hélène
Nakhleh, Luay K.
Oxelman, Bengt
Pfeil, Bernard E.
Schliep, Alexander
Wahlberg, Niklas
Werneck, F. P.
Wiedenhoeft, John
Willows-Munro, Sandi
Edwards, Scott V.
author_role author
author2 Antonelli, Alexandre
Bacon, Christine D.
Bartoszek, Krzysztof
Blom, Mozes P.K.
Huynh, Stella
Jones, Graham R.
Lacey Knowles, L.
Lamichhaney, Sangeet
Marcussen, Thomas
Morlon, Hélène
Nakhleh, Luay K.
Oxelman, Bengt
Pfeil, Bernard E.
Schliep, Alexander
Wahlberg, Niklas
Werneck, F. P.
Wiedenhoeft, John
Willows-Munro, Sandi
Edwards, Scott V.
author2_role author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
author
dc.contributor.author.fl_str_mv Bravo, Gustavo A.
Antonelli, Alexandre
Bacon, Christine D.
Bartoszek, Krzysztof
Blom, Mozes P.K.
Huynh, Stella
Jones, Graham R.
Lacey Knowles, L.
Lamichhaney, Sangeet
Marcussen, Thomas
Morlon, Hélène
Nakhleh, Luay K.
Oxelman, Bengt
Pfeil, Bernard E.
Schliep, Alexander
Wahlberg, Niklas
Werneck, F. P.
Wiedenhoeft, John
Willows-Munro, Sandi
Edwards, Scott V.
dc.subject.eng.fl_str_mv Ancient Dna
Transcriptome
Arabidopsis Thaliana
Biogeography
Copy Number Variation
Gene Duplication
Gene Flow
Gene Locus
Gene Loss
Gene Mapping
Gene Order
Genetic Parameters
Recombination, Genetic
Genotype
Heredity
Heterozygosity
High Throughput Sequencing
Gene Transfer, Horizontal
Human
Hybridization
Introgression
Phylogenomics
Phylogeny
Phylogeography
Polymerase Chain Reaction
Polyploidy
Sanger Sequencing
Polymorphism, Single Nucleotide
Species Differentiation
Synteny
Tree Of Life
Whole Genome Sequencing
topic Ancient Dna
Transcriptome
Arabidopsis Thaliana
Biogeography
Copy Number Variation
Gene Duplication
Gene Flow
Gene Locus
Gene Loss
Gene Mapping
Gene Order
Genetic Parameters
Recombination, Genetic
Genotype
Heredity
Heterozygosity
High Throughput Sequencing
Gene Transfer, Horizontal
Human
Hybridization
Introgression
Phylogenomics
Phylogeny
Phylogeography
Polymerase Chain Reaction
Polyploidy
Sanger Sequencing
Polymorphism, Single Nucleotide
Species Differentiation
Synteny
Tree Of Life
Whole Genome Sequencing
description Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are made possible with routine whole-genome sequencing of taxa of interest. We discuss limitations and potential extensions of a model supporting innovation in phylogenomics today, the multispecies coalescent model (MSC). Macroevolutionary models that use phylogenies, such as character mapping, often ignore the heterogeneity on which building phylogenies increasingly rely and suggest that assimilating such heterogeneity is an important goal moving forward. Finally, we argue that an integrative cyberinfrastructure linking all steps of the process of building the ToL, from specimen acquisition in the field to publication and tracking of phylogenomic data, as well as a culture that values contributors at each step, are essential for progress. © 2019 Bravo et al.
publishDate 2019
dc.date.issued.fl_str_mv 2019
dc.date.accessioned.fl_str_mv 2020-05-08T20:19:07Z
dc.date.available.fl_str_mv 2020-05-08T20:19:07Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://repositorio.inpa.gov.br/handle/1/15348
dc.identifier.doi.none.fl_str_mv 10.7717/peerj.6399
url https://repositorio.inpa.gov.br/handle/1/15348
identifier_str_mv 10.7717/peerj.6399
dc.language.iso.fl_str_mv eng
language eng
dc.relation.ispartof.pt_BR.fl_str_mv Volume 2019, Número 2
dc.rights.driver.fl_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv PeerJ
publisher.none.fl_str_mv PeerJ
dc.source.none.fl_str_mv reponame:Repositório Institucional do INPA
instname:Instituto Nacional de Pesquisas da Amazônia (INPA)
instacron:INPA
instname_str Instituto Nacional de Pesquisas da Amazônia (INPA)
instacron_str INPA
institution INPA
reponame_str Repositório Institucional do INPA
collection Repositório Institucional do INPA
bitstream.url.fl_str_mv https://repositorio.inpa.gov.br/bitstream/1/15348/1/artigo-inpa.pdf
bitstream.checksum.fl_str_mv 7957f6c8760297dd6eb20c323fcad4b2
bitstream.checksumAlgorithm.fl_str_mv MD5
repository.name.fl_str_mv Repositório Institucional do INPA - Instituto Nacional de Pesquisas da Amazônia (INPA)
repository.mail.fl_str_mv
_version_ 1809928904880685056