Distance-based phylogenetic inference from typing data: a unifying view

Detalhes bibliográficos
Autor(a) principal: Vaz, Cátia
Data de Publicação: 2020
Outros Autores: Nascimento, Marta, Carrico, Joao, Rocher, Tatiana, Francisco, Alexandre P.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.21/13314
Resumo: Typing methods are widely used in the surveillance of infectious diseases, outbreaks investigation and studies of the natural history of an infection. Moreover, their use is becoming standard, in particular with the introduction of high-throughput sequencing. On the other hand, the data being generated are massive and many algorithms have been proposed for a phylogenetic analysis of typing data, addressing both correctness and scalability issues. Most of the distance-based algorithms for inferring phylogenetic trees follow the closest pair joining scheme. This is one of the approaches used in hierarchical clustering. Moreover, although phylogenetic inference algorithms may seem rather different, the main difference among them resides on how one defines cluster proximity and on which optimization criterion is used. Both cluster proximity and optimization criteria rely often on a model of evolution. In this work, we review, and we provide a unified view of these algorithms. This is an important step not only to better understand such algorithms but also to identify possible computational bottlenecks and improvements, important to deal with large data sets.
id RCAP_4c464ddc03e8bfe0dd0704a98da2896e
oai_identifier_str oai:repositorio.ipl.pt:10400.21/13314
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Distance-based phylogenetic inference from typing data: a unifying viewPhylogenetic inferenceClustering methodsTree search algorithmsTyping methods are widely used in the surveillance of infectious diseases, outbreaks investigation and studies of the natural history of an infection. Moreover, their use is becoming standard, in particular with the introduction of high-throughput sequencing. On the other hand, the data being generated are massive and many algorithms have been proposed for a phylogenetic analysis of typing data, addressing both correctness and scalability issues. Most of the distance-based algorithms for inferring phylogenetic trees follow the closest pair joining scheme. This is one of the approaches used in hierarchical clustering. Moreover, although phylogenetic inference algorithms may seem rather different, the main difference among them resides on how one defines cluster proximity and on which optimization criterion is used. Both cluster proximity and optimization criteria rely often on a model of evolution. In this work, we review, and we provide a unified view of these algorithms. This is an important step not only to better understand such algorithms but also to identify possible computational bottlenecks and improvements, important to deal with large data sets.Oxford AcademicRCIPLVaz, CátiaNascimento, MartaCarrico, JoaoRocher, TatianaFrancisco, Alexandre P.2021-05-10T10:17:03Z20202020-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.21/13314engVAZ, Cátia; [et al] – Distance-based phylogenetic inference from typing data: a unifying view. Briefings in Bioinformatics. eISSN 1477-4054. Vol. 00, N.º 0 (2020), pp. 1-1610.1093/bib/bbaa1471477-4054metadata only accessinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-08-03T10:07:50Zoai:repositorio.ipl.pt:10400.21/13314Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T20:21:17.382180Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Distance-based phylogenetic inference from typing data: a unifying view
title Distance-based phylogenetic inference from typing data: a unifying view
spellingShingle Distance-based phylogenetic inference from typing data: a unifying view
Vaz, Cátia
Phylogenetic inference
Clustering methods
Tree search algorithms
title_short Distance-based phylogenetic inference from typing data: a unifying view
title_full Distance-based phylogenetic inference from typing data: a unifying view
title_fullStr Distance-based phylogenetic inference from typing data: a unifying view
title_full_unstemmed Distance-based phylogenetic inference from typing data: a unifying view
title_sort Distance-based phylogenetic inference from typing data: a unifying view
author Vaz, Cátia
author_facet Vaz, Cátia
Nascimento, Marta
Carrico, Joao
Rocher, Tatiana
Francisco, Alexandre P.
author_role author
author2 Nascimento, Marta
Carrico, Joao
Rocher, Tatiana
Francisco, Alexandre P.
author2_role author
author
author
author
dc.contributor.none.fl_str_mv RCIPL
dc.contributor.author.fl_str_mv Vaz, Cátia
Nascimento, Marta
Carrico, Joao
Rocher, Tatiana
Francisco, Alexandre P.
dc.subject.por.fl_str_mv Phylogenetic inference
Clustering methods
Tree search algorithms
topic Phylogenetic inference
Clustering methods
Tree search algorithms
description Typing methods are widely used in the surveillance of infectious diseases, outbreaks investigation and studies of the natural history of an infection. Moreover, their use is becoming standard, in particular with the introduction of high-throughput sequencing. On the other hand, the data being generated are massive and many algorithms have been proposed for a phylogenetic analysis of typing data, addressing both correctness and scalability issues. Most of the distance-based algorithms for inferring phylogenetic trees follow the closest pair joining scheme. This is one of the approaches used in hierarchical clustering. Moreover, although phylogenetic inference algorithms may seem rather different, the main difference among them resides on how one defines cluster proximity and on which optimization criterion is used. Both cluster proximity and optimization criteria rely often on a model of evolution. In this work, we review, and we provide a unified view of these algorithms. This is an important step not only to better understand such algorithms but also to identify possible computational bottlenecks and improvements, important to deal with large data sets.
publishDate 2020
dc.date.none.fl_str_mv 2020
2020-01-01T00:00:00Z
2021-05-10T10:17:03Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.21/13314
url http://hdl.handle.net/10400.21/13314
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv VAZ, Cátia; [et al] – Distance-based phylogenetic inference from typing data: a unifying view. Briefings in Bioinformatics. eISSN 1477-4054. Vol. 00, N.º 0 (2020), pp. 1-16
10.1093/bib/bbaa147
1477-4054
dc.rights.driver.fl_str_mv metadata only access
info:eu-repo/semantics/openAccess
rights_invalid_str_mv metadata only access
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Oxford Academic
publisher.none.fl_str_mv Oxford Academic
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799133483930484736