Next-generation sequencing reveals large connected networks of intra-host HCV variants
Autor(a) principal: | |
---|---|
Data de Publicação: | 2014 |
Outros Autores: | , , , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.1186/1471-2164-15-S5-S4 http://hdl.handle.net/11449/117309 |
Resumo: | Background: Next-generation sequencing (NGS) allows for sampling numerous viral variants from infected patients. This provides a novel opportunity to represent and study the mutational landscape of Hepatitis C Virus (HCV) within a single host.Results: Intra-host variants of the HCV E1/E2 region were extensively sampled from 58 chronically infected patients. After NGS error correction, the average number of reads and variants obtained from each sample were 3202 and 464, respectively. The distance between each pair of variants was calculated and networks were created for each patient, where each node is a variant and two nodes are connected by a link if the nucleotide distance between them is 1. The work focused on large components having > 5% of all reads, which in average account for 93.7% of all reads found in a patient. The distance between any two variants calculated over the component correlated strongly with nucleotide distances (r = 0.9499; p = 0.0001), a better correlation than the one obtained with Neighbour-Joining trees (r = 0.7624; p = 0.0001). In each patient, components were well separated, with the average distance between (6.53%) being 10 times greater than within each component (0.68%). The ratio of nonsynonymous to synonymous changes was calculated and some patients (6.9%) showed a mixture of networks under strong negative and positive selection. All components were robust to in silico stochastic sampling; even after randomly removing 85% of all reads, the largest connected component in the new subsample still involved 82.4% of remaining nodes. In vitro sampling showed that 93.02% of components present in the original sample were also found in experimental replicas, with 81.6% of reads found in both. When syringe-sharing transmission events were simulated, 91.2% of all simulated transmission events seeded all components present in the source.Conclusions: Most intra-host variants are organized into distinct single-mutation components that are: well separated from each other, represent genetic distances between viral variants, robust to sampling, reproducible and likely seeded during transmission events. Facilitated by NGS, large components offer a novel evolutionary framework for genetic analysis of intra-host viral populations and understanding transmission, immune escape and drug resistance. |
id |
UNSP_090891fc5b2bab958eb9cf2dc9e6eccc |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/117309 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Next-generation sequencing reveals large connected networks of intra-host HCV variantsBackground: Next-generation sequencing (NGS) allows for sampling numerous viral variants from infected patients. This provides a novel opportunity to represent and study the mutational landscape of Hepatitis C Virus (HCV) within a single host.Results: Intra-host variants of the HCV E1/E2 region were extensively sampled from 58 chronically infected patients. After NGS error correction, the average number of reads and variants obtained from each sample were 3202 and 464, respectively. The distance between each pair of variants was calculated and networks were created for each patient, where each node is a variant and two nodes are connected by a link if the nucleotide distance between them is 1. The work focused on large components having > 5% of all reads, which in average account for 93.7% of all reads found in a patient. The distance between any two variants calculated over the component correlated strongly with nucleotide distances (r = 0.9499; p = 0.0001), a better correlation than the one obtained with Neighbour-Joining trees (r = 0.7624; p = 0.0001). In each patient, components were well separated, with the average distance between (6.53%) being 10 times greater than within each component (0.68%). The ratio of nonsynonymous to synonymous changes was calculated and some patients (6.9%) showed a mixture of networks under strong negative and positive selection. All components were robust to in silico stochastic sampling; even after randomly removing 85% of all reads, the largest connected component in the new subsample still involved 82.4% of remaining nodes. In vitro sampling showed that 93.02% of components present in the original sample were also found in experimental replicas, with 81.6% of reads found in both. When syringe-sharing transmission events were simulated, 91.2% of all simulated transmission events seeded all components present in the source.Conclusions: Most intra-host variants are organized into distinct single-mutation components that are: well separated from each other, represent genetic distances between viral variants, robust to sampling, reproducible and likely seeded during transmission events. Facilitated by NGS, large components offer a novel evolutionary framework for genetic analysis of intra-host viral populations and understanding transmission, immune escape and drug resistance.Centers for Disease Control and PreventionCtr Dis Control & Prevent, Div Viral Hepatitis, Atlanta, GA 30333 USAUNESP Sao Paulo State Univ, Dept Biol, Lab Genom Studies, Sao Paulo, BrazilHarvard Univ, Beth Israel Deaconess Med Ctr, Sch Med, Liver Ctr,Div Gastroenterol, Cambridge, MA 02138 USAUNESP Sao Paulo State Univ, Dept Biol, Lab Genom Studies, Sao Paulo, BrazilBiomed Central LtdCtr Dis Control & PreventUniversidade Estadual Paulista (Unesp)Harvard UnivCampo, David S.Dimitrova, ZoyaYamasaki, Lilian [UNESP]Skums, PavelLau, Daryl T. Y.Vaughan, GilbertoForbi, Joseph C.Teo, Chong-GeeKhudyakov, Yury2015-03-18T15:55:48Z2015-03-18T15:55:48Z2014-07-14info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article9application/pdfhttp://dx.doi.org/10.1186/1471-2164-15-S5-S4Bmc Genomics. London: Biomed Central Ltd, v. 15, 9 p., 2014.1471-2164http://hdl.handle.net/11449/11730910.1186/1471-2164-15-S5-S4WOS:000345682800004WOS000345682800004.pdfWeb of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengBmc Genomics3.7302,110info:eu-repo/semantics/openAccess2023-12-30T06:16:22Zoai:repositorio.unesp.br:11449/117309Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T21:40:34.288996Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Next-generation sequencing reveals large connected networks of intra-host HCV variants |
title |
Next-generation sequencing reveals large connected networks of intra-host HCV variants |
spellingShingle |
Next-generation sequencing reveals large connected networks of intra-host HCV variants Campo, David S. |
title_short |
Next-generation sequencing reveals large connected networks of intra-host HCV variants |
title_full |
Next-generation sequencing reveals large connected networks of intra-host HCV variants |
title_fullStr |
Next-generation sequencing reveals large connected networks of intra-host HCV variants |
title_full_unstemmed |
Next-generation sequencing reveals large connected networks of intra-host HCV variants |
title_sort |
Next-generation sequencing reveals large connected networks of intra-host HCV variants |
author |
Campo, David S. |
author_facet |
Campo, David S. Dimitrova, Zoya Yamasaki, Lilian [UNESP] Skums, Pavel Lau, Daryl T. Y. Vaughan, Gilberto Forbi, Joseph C. Teo, Chong-Gee Khudyakov, Yury |
author_role |
author |
author2 |
Dimitrova, Zoya Yamasaki, Lilian [UNESP] Skums, Pavel Lau, Daryl T. Y. Vaughan, Gilberto Forbi, Joseph C. Teo, Chong-Gee Khudyakov, Yury |
author2_role |
author author author author author author author author |
dc.contributor.none.fl_str_mv |
Ctr Dis Control & Prevent Universidade Estadual Paulista (Unesp) Harvard Univ |
dc.contributor.author.fl_str_mv |
Campo, David S. Dimitrova, Zoya Yamasaki, Lilian [UNESP] Skums, Pavel Lau, Daryl T. Y. Vaughan, Gilberto Forbi, Joseph C. Teo, Chong-Gee Khudyakov, Yury |
description |
Background: Next-generation sequencing (NGS) allows for sampling numerous viral variants from infected patients. This provides a novel opportunity to represent and study the mutational landscape of Hepatitis C Virus (HCV) within a single host.Results: Intra-host variants of the HCV E1/E2 region were extensively sampled from 58 chronically infected patients. After NGS error correction, the average number of reads and variants obtained from each sample were 3202 and 464, respectively. The distance between each pair of variants was calculated and networks were created for each patient, where each node is a variant and two nodes are connected by a link if the nucleotide distance between them is 1. The work focused on large components having > 5% of all reads, which in average account for 93.7% of all reads found in a patient. The distance between any two variants calculated over the component correlated strongly with nucleotide distances (r = 0.9499; p = 0.0001), a better correlation than the one obtained with Neighbour-Joining trees (r = 0.7624; p = 0.0001). In each patient, components were well separated, with the average distance between (6.53%) being 10 times greater than within each component (0.68%). The ratio of nonsynonymous to synonymous changes was calculated and some patients (6.9%) showed a mixture of networks under strong negative and positive selection. All components were robust to in silico stochastic sampling; even after randomly removing 85% of all reads, the largest connected component in the new subsample still involved 82.4% of remaining nodes. In vitro sampling showed that 93.02% of components present in the original sample were also found in experimental replicas, with 81.6% of reads found in both. When syringe-sharing transmission events were simulated, 91.2% of all simulated transmission events seeded all components present in the source.Conclusions: Most intra-host variants are organized into distinct single-mutation components that are: well separated from each other, represent genetic distances between viral variants, robust to sampling, reproducible and likely seeded during transmission events. Facilitated by NGS, large components offer a novel evolutionary framework for genetic analysis of intra-host viral populations and understanding transmission, immune escape and drug resistance. |
publishDate |
2014 |
dc.date.none.fl_str_mv |
2014-07-14 2015-03-18T15:55:48Z 2015-03-18T15:55:48Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1186/1471-2164-15-S5-S4 Bmc Genomics. London: Biomed Central Ltd, v. 15, 9 p., 2014. 1471-2164 http://hdl.handle.net/11449/117309 10.1186/1471-2164-15-S5-S4 WOS:000345682800004 WOS000345682800004.pdf |
url |
http://dx.doi.org/10.1186/1471-2164-15-S5-S4 http://hdl.handle.net/11449/117309 |
identifier_str_mv |
Bmc Genomics. London: Biomed Central Ltd, v. 15, 9 p., 2014. 1471-2164 10.1186/1471-2164-15-S5-S4 WOS:000345682800004 WOS000345682800004.pdf |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Bmc Genomics 3.730 2,110 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
9 application/pdf |
dc.publisher.none.fl_str_mv |
Biomed Central Ltd |
publisher.none.fl_str_mv |
Biomed Central Ltd |
dc.source.none.fl_str_mv |
Web of Science reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808129345496023040 |