Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures
Autor(a) principal: | |
---|---|
Data de Publicação: | 2018 |
Outros Autores: | , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.1016/j.humimm.2018.06.010 http://hdl.handle.net/11449/171178 |
Resumo: | A challenging task when more than one HLA gene is evaluated together by second-generation sequencing is to achieve a reliable read mapping. The polymorphic and repetitive nature of HLA genes might bias the read mapping process, usually underestimating variability at very polymorphic segments, or overestimating variability at some segments. To overcome this issue we developed hla-mapper, which takes into account HLA sequences derived from the IPD-IMGT/HLA database and unpublished HLA sequences to apply a scoring system. This comprehends the evaluation of each read pair, addressing them to the most likely HLA gene they were derived from. Hla-mapper provides a reliable map of HLA sequences, allowing accurate downstream analysis such as variant calling, haplotype inference, and allele typing. Moreover, hla-mapper supports whole genome, exome, and targeted sequencing data. To assess the software performance in comparison with traditional mapping algorithms, we used three different simulated datasets to compare the results obtained with hla-mapper, BWA MEM, and Bowtie2. Overall, hla-mapper presented a superior performance, mainly for the classical HLA class I genes, minimizing wrong mapping and cross-mapping that are typically observed when using BWA MEM or Bowtie2 with a single reference genome. |
id |
UNSP_36b2f2de52c6a32687bb98f57a8d6e8a |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/171178 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing proceduresAlignersHLAMapping toolMHCNext Generation Sequencing (NGS)PolymorphismsSecond Generation SequencingTypingVariabilityA challenging task when more than one HLA gene is evaluated together by second-generation sequencing is to achieve a reliable read mapping. The polymorphic and repetitive nature of HLA genes might bias the read mapping process, usually underestimating variability at very polymorphic segments, or overestimating variability at some segments. To overcome this issue we developed hla-mapper, which takes into account HLA sequences derived from the IPD-IMGT/HLA database and unpublished HLA sequences to apply a scoring system. This comprehends the evaluation of each read pair, addressing them to the most likely HLA gene they were derived from. Hla-mapper provides a reliable map of HLA sequences, allowing accurate downstream analysis such as variant calling, haplotype inference, and allele typing. Moreover, hla-mapper supports whole genome, exome, and targeted sequencing data. To assess the software performance in comparison with traditional mapping algorithms, we used three different simulated datasets to compare the results obtained with hla-mapper, BWA MEM, and Bowtie2. Overall, hla-mapper presented a superior performance, mainly for the classical HLA class I genes, minimizing wrong mapping and cross-mapping that are typically observed when using BWA MEM or Bowtie2 with a single reference genome.Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)São Paulo State University (UNESP) Molecular Genetics and Bioinformatics Laboratory Experimental Research Unit (UNIPEX) School of MedicineSão Paulo State University (UNESP) Pathology Department School of MedicineDepartamento de Química Faculdade de Filosofia Ciências e Letras de Ribeirão Preto Universidade de São PauloSão Paulo State University (UNESP) Molecular Genetics and Bioinformatics Laboratory Experimental Research Unit (UNIPEX) School of MedicineSão Paulo State University (UNESP) Pathology Department School of MedicineFAPESP: 2013/17084-2CNPq: 304471/2013-5CNPq: 309572/2014-2Universidade Estadual Paulista (Unesp)Universidade de São Paulo (USP)Castelli, Erick C. [UNESP]Paz, Michelle A. [UNESP]Souza, Andréia S. [UNESP]Ramalho, Jaqueline [UNESP]Mendes-Junior, Celso Teixeira2018-12-11T16:54:16Z2018-12-11T16:54:16Z2018-09-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article678-684application/pdfhttp://dx.doi.org/10.1016/j.humimm.2018.06.010Human Immunology, v. 79, n. 9, p. 678-684, 2018.1879-11660198-8859http://hdl.handle.net/11449/17117810.1016/j.humimm.2018.06.0102-s2.0-850494800882-s2.0-85049480088.pdfScopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengHuman Immunology0,856info:eu-repo/semantics/openAccess2024-01-17T06:28:10Zoai:repositorio.unesp.br:11449/171178Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T23:17:04.574672Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures |
title |
Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures |
spellingShingle |
Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures Castelli, Erick C. [UNESP] Aligners HLA Mapping tool MHC Next Generation Sequencing (NGS) Polymorphisms Second Generation Sequencing Typing Variability |
title_short |
Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures |
title_full |
Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures |
title_fullStr |
Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures |
title_full_unstemmed |
Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures |
title_sort |
Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures |
author |
Castelli, Erick C. [UNESP] |
author_facet |
Castelli, Erick C. [UNESP] Paz, Michelle A. [UNESP] Souza, Andréia S. [UNESP] Ramalho, Jaqueline [UNESP] Mendes-Junior, Celso Teixeira |
author_role |
author |
author2 |
Paz, Michelle A. [UNESP] Souza, Andréia S. [UNESP] Ramalho, Jaqueline [UNESP] Mendes-Junior, Celso Teixeira |
author2_role |
author author author author |
dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (Unesp) Universidade de São Paulo (USP) |
dc.contributor.author.fl_str_mv |
Castelli, Erick C. [UNESP] Paz, Michelle A. [UNESP] Souza, Andréia S. [UNESP] Ramalho, Jaqueline [UNESP] Mendes-Junior, Celso Teixeira |
dc.subject.por.fl_str_mv |
Aligners HLA Mapping tool MHC Next Generation Sequencing (NGS) Polymorphisms Second Generation Sequencing Typing Variability |
topic |
Aligners HLA Mapping tool MHC Next Generation Sequencing (NGS) Polymorphisms Second Generation Sequencing Typing Variability |
description |
A challenging task when more than one HLA gene is evaluated together by second-generation sequencing is to achieve a reliable read mapping. The polymorphic and repetitive nature of HLA genes might bias the read mapping process, usually underestimating variability at very polymorphic segments, or overestimating variability at some segments. To overcome this issue we developed hla-mapper, which takes into account HLA sequences derived from the IPD-IMGT/HLA database and unpublished HLA sequences to apply a scoring system. This comprehends the evaluation of each read pair, addressing them to the most likely HLA gene they were derived from. Hla-mapper provides a reliable map of HLA sequences, allowing accurate downstream analysis such as variant calling, haplotype inference, and allele typing. Moreover, hla-mapper supports whole genome, exome, and targeted sequencing data. To assess the software performance in comparison with traditional mapping algorithms, we used three different simulated datasets to compare the results obtained with hla-mapper, BWA MEM, and Bowtie2. Overall, hla-mapper presented a superior performance, mainly for the classical HLA class I genes, minimizing wrong mapping and cross-mapping that are typically observed when using BWA MEM or Bowtie2 with a single reference genome. |
publishDate |
2018 |
dc.date.none.fl_str_mv |
2018-12-11T16:54:16Z 2018-12-11T16:54:16Z 2018-09-01 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1016/j.humimm.2018.06.010 Human Immunology, v. 79, n. 9, p. 678-684, 2018. 1879-1166 0198-8859 http://hdl.handle.net/11449/171178 10.1016/j.humimm.2018.06.010 2-s2.0-85049480088 2-s2.0-85049480088.pdf |
url |
http://dx.doi.org/10.1016/j.humimm.2018.06.010 http://hdl.handle.net/11449/171178 |
identifier_str_mv |
Human Immunology, v. 79, n. 9, p. 678-684, 2018. 1879-1166 0198-8859 10.1016/j.humimm.2018.06.010 2-s2.0-85049480088 2-s2.0-85049480088.pdf |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Human Immunology 0,856 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
678-684 application/pdf |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808129503846727680 |