Next-generation sequencing reveals large connected networks of intra-host HCV variants

Detalhes bibliográficos
Autor(a) principal: Campo, David S.
Data de Publicação: 2014
Outros Autores: Dimitrova, Zoya, Yamasaki, Lilian [UNESP], Skums, Pavel, Lau, Daryl T. Y., Vaughan, Gilberto, Forbi, Joseph C., Teo, Chong-Gee, Khudyakov, Yury
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1186/1471-2164-15-S5-S4
http://hdl.handle.net/11449/117309
Resumo: Background: Next-generation sequencing (NGS) allows for sampling numerous viral variants from infected patients. This provides a novel opportunity to represent and study the mutational landscape of Hepatitis C Virus (HCV) within a single host.Results: Intra-host variants of the HCV E1/E2 region were extensively sampled from 58 chronically infected patients. After NGS error correction, the average number of reads and variants obtained from each sample were 3202 and 464, respectively. The distance between each pair of variants was calculated and networks were created for each patient, where each node is a variant and two nodes are connected by a link if the nucleotide distance between them is 1. The work focused on large components having > 5% of all reads, which in average account for 93.7% of all reads found in a patient. The distance between any two variants calculated over the component correlated strongly with nucleotide distances (r = 0.9499; p = 0.0001), a better correlation than the one obtained with Neighbour-Joining trees (r = 0.7624; p = 0.0001). In each patient, components were well separated, with the average distance between (6.53%) being 10 times greater than within each component (0.68%). The ratio of nonsynonymous to synonymous changes was calculated and some patients (6.9%) showed a mixture of networks under strong negative and positive selection. All components were robust to in silico stochastic sampling; even after randomly removing 85% of all reads, the largest connected component in the new subsample still involved 82.4% of remaining nodes. In vitro sampling showed that 93.02% of components present in the original sample were also found in experimental replicas, with 81.6% of reads found in both. When syringe-sharing transmission events were simulated, 91.2% of all simulated transmission events seeded all components present in the source.Conclusions: Most intra-host variants are organized into distinct single-mutation components that are: well separated from each other, represent genetic distances between viral variants, robust to sampling, reproducible and likely seeded during transmission events. Facilitated by NGS, large components offer a novel evolutionary framework for genetic analysis of intra-host viral populations and understanding transmission, immune escape and drug resistance.
id UNSP_090891fc5b2bab958eb9cf2dc9e6eccc
oai_identifier_str oai:repositorio.unesp.br:11449/117309
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Next-generation sequencing reveals large connected networks of intra-host HCV variantsBackground: Next-generation sequencing (NGS) allows for sampling numerous viral variants from infected patients. This provides a novel opportunity to represent and study the mutational landscape of Hepatitis C Virus (HCV) within a single host.Results: Intra-host variants of the HCV E1/E2 region were extensively sampled from 58 chronically infected patients. After NGS error correction, the average number of reads and variants obtained from each sample were 3202 and 464, respectively. The distance between each pair of variants was calculated and networks were created for each patient, where each node is a variant and two nodes are connected by a link if the nucleotide distance between them is 1. The work focused on large components having > 5% of all reads, which in average account for 93.7% of all reads found in a patient. The distance between any two variants calculated over the component correlated strongly with nucleotide distances (r = 0.9499; p = 0.0001), a better correlation than the one obtained with Neighbour-Joining trees (r = 0.7624; p = 0.0001). In each patient, components were well separated, with the average distance between (6.53%) being 10 times greater than within each component (0.68%). The ratio of nonsynonymous to synonymous changes was calculated and some patients (6.9%) showed a mixture of networks under strong negative and positive selection. All components were robust to in silico stochastic sampling; even after randomly removing 85% of all reads, the largest connected component in the new subsample still involved 82.4% of remaining nodes. In vitro sampling showed that 93.02% of components present in the original sample were also found in experimental replicas, with 81.6% of reads found in both. When syringe-sharing transmission events were simulated, 91.2% of all simulated transmission events seeded all components present in the source.Conclusions: Most intra-host variants are organized into distinct single-mutation components that are: well separated from each other, represent genetic distances between viral variants, robust to sampling, reproducible and likely seeded during transmission events. Facilitated by NGS, large components offer a novel evolutionary framework for genetic analysis of intra-host viral populations and understanding transmission, immune escape and drug resistance.Centers for Disease Control and PreventionCtr Dis Control & Prevent, Div Viral Hepatitis, Atlanta, GA 30333 USAUNESP Sao Paulo State Univ, Dept Biol, Lab Genom Studies, Sao Paulo, BrazilHarvard Univ, Beth Israel Deaconess Med Ctr, Sch Med, Liver Ctr,Div Gastroenterol, Cambridge, MA 02138 USAUNESP Sao Paulo State Univ, Dept Biol, Lab Genom Studies, Sao Paulo, BrazilBiomed Central LtdCtr Dis Control & PreventUniversidade Estadual Paulista (Unesp)Harvard UnivCampo, David S.Dimitrova, ZoyaYamasaki, Lilian [UNESP]Skums, PavelLau, Daryl T. Y.Vaughan, GilbertoForbi, Joseph C.Teo, Chong-GeeKhudyakov, Yury2015-03-18T15:55:48Z2015-03-18T15:55:48Z2014-07-14info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article9application/pdfhttp://dx.doi.org/10.1186/1471-2164-15-S5-S4Bmc Genomics. London: Biomed Central Ltd, v. 15, 9 p., 2014.1471-2164http://hdl.handle.net/11449/11730910.1186/1471-2164-15-S5-S4WOS:000345682800004WOS000345682800004.pdfWeb of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengBmc Genomics3.7302,110info:eu-repo/semantics/openAccess2023-12-30T06:16:22Zoai:repositorio.unesp.br:11449/117309Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T21:40:34.288996Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Next-generation sequencing reveals large connected networks of intra-host HCV variants
title Next-generation sequencing reveals large connected networks of intra-host HCV variants
spellingShingle Next-generation sequencing reveals large connected networks of intra-host HCV variants
Campo, David S.
title_short Next-generation sequencing reveals large connected networks of intra-host HCV variants
title_full Next-generation sequencing reveals large connected networks of intra-host HCV variants
title_fullStr Next-generation sequencing reveals large connected networks of intra-host HCV variants
title_full_unstemmed Next-generation sequencing reveals large connected networks of intra-host HCV variants
title_sort Next-generation sequencing reveals large connected networks of intra-host HCV variants
author Campo, David S.
author_facet Campo, David S.
Dimitrova, Zoya
Yamasaki, Lilian [UNESP]
Skums, Pavel
Lau, Daryl T. Y.
Vaughan, Gilberto
Forbi, Joseph C.
Teo, Chong-Gee
Khudyakov, Yury
author_role author
author2 Dimitrova, Zoya
Yamasaki, Lilian [UNESP]
Skums, Pavel
Lau, Daryl T. Y.
Vaughan, Gilberto
Forbi, Joseph C.
Teo, Chong-Gee
Khudyakov, Yury
author2_role author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Ctr Dis Control & Prevent
Universidade Estadual Paulista (Unesp)
Harvard Univ
dc.contributor.author.fl_str_mv Campo, David S.
Dimitrova, Zoya
Yamasaki, Lilian [UNESP]
Skums, Pavel
Lau, Daryl T. Y.
Vaughan, Gilberto
Forbi, Joseph C.
Teo, Chong-Gee
Khudyakov, Yury
description Background: Next-generation sequencing (NGS) allows for sampling numerous viral variants from infected patients. This provides a novel opportunity to represent and study the mutational landscape of Hepatitis C Virus (HCV) within a single host.Results: Intra-host variants of the HCV E1/E2 region were extensively sampled from 58 chronically infected patients. After NGS error correction, the average number of reads and variants obtained from each sample were 3202 and 464, respectively. The distance between each pair of variants was calculated and networks were created for each patient, where each node is a variant and two nodes are connected by a link if the nucleotide distance between them is 1. The work focused on large components having > 5% of all reads, which in average account for 93.7% of all reads found in a patient. The distance between any two variants calculated over the component correlated strongly with nucleotide distances (r = 0.9499; p = 0.0001), a better correlation than the one obtained with Neighbour-Joining trees (r = 0.7624; p = 0.0001). In each patient, components were well separated, with the average distance between (6.53%) being 10 times greater than within each component (0.68%). The ratio of nonsynonymous to synonymous changes was calculated and some patients (6.9%) showed a mixture of networks under strong negative and positive selection. All components were robust to in silico stochastic sampling; even after randomly removing 85% of all reads, the largest connected component in the new subsample still involved 82.4% of remaining nodes. In vitro sampling showed that 93.02% of components present in the original sample were also found in experimental replicas, with 81.6% of reads found in both. When syringe-sharing transmission events were simulated, 91.2% of all simulated transmission events seeded all components present in the source.Conclusions: Most intra-host variants are organized into distinct single-mutation components that are: well separated from each other, represent genetic distances between viral variants, robust to sampling, reproducible and likely seeded during transmission events. Facilitated by NGS, large components offer a novel evolutionary framework for genetic analysis of intra-host viral populations and understanding transmission, immune escape and drug resistance.
publishDate 2014
dc.date.none.fl_str_mv 2014-07-14
2015-03-18T15:55:48Z
2015-03-18T15:55:48Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1186/1471-2164-15-S5-S4
Bmc Genomics. London: Biomed Central Ltd, v. 15, 9 p., 2014.
1471-2164
http://hdl.handle.net/11449/117309
10.1186/1471-2164-15-S5-S4
WOS:000345682800004
WOS000345682800004.pdf
url http://dx.doi.org/10.1186/1471-2164-15-S5-S4
http://hdl.handle.net/11449/117309
identifier_str_mv Bmc Genomics. London: Biomed Central Ltd, v. 15, 9 p., 2014.
1471-2164
10.1186/1471-2164-15-S5-S4
WOS:000345682800004
WOS000345682800004.pdf
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Bmc Genomics
3.730
2,110
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 9
application/pdf
dc.publisher.none.fl_str_mv Biomed Central Ltd
publisher.none.fl_str_mv Biomed Central Ltd
dc.source.none.fl_str_mv Web of Science
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808129345496023040