Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures

Detalhes bibliográficos
Autor(a) principal: Castelli, Erick C. [UNESP]
Data de Publicação: 2018
Outros Autores: Paz, Michelle A. [UNESP], Souza, Andréia S. [UNESP], Ramalho, Jaqueline [UNESP], Mendes-Junior, Celso Teixeira
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1016/j.humimm.2018.06.010
http://hdl.handle.net/11449/171178
Resumo: A challenging task when more than one HLA gene is evaluated together by second-generation sequencing is to achieve a reliable read mapping. The polymorphic and repetitive nature of HLA genes might bias the read mapping process, usually underestimating variability at very polymorphic segments, or overestimating variability at some segments. To overcome this issue we developed hla-mapper, which takes into account HLA sequences derived from the IPD-IMGT/HLA database and unpublished HLA sequences to apply a scoring system. This comprehends the evaluation of each read pair, addressing them to the most likely HLA gene they were derived from. Hla-mapper provides a reliable map of HLA sequences, allowing accurate downstream analysis such as variant calling, haplotype inference, and allele typing. Moreover, hla-mapper supports whole genome, exome, and targeted sequencing data. To assess the software performance in comparison with traditional mapping algorithms, we used three different simulated datasets to compare the results obtained with hla-mapper, BWA MEM, and Bowtie2. Overall, hla-mapper presented a superior performance, mainly for the classical HLA class I genes, minimizing wrong mapping and cross-mapping that are typically observed when using BWA MEM or Bowtie2 with a single reference genome.
id UNSP_36b2f2de52c6a32687bb98f57a8d6e8a
oai_identifier_str oai:repositorio.unesp.br:11449/171178
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing proceduresAlignersHLAMapping toolMHCNext Generation Sequencing (NGS)PolymorphismsSecond Generation SequencingTypingVariabilityA challenging task when more than one HLA gene is evaluated together by second-generation sequencing is to achieve a reliable read mapping. The polymorphic and repetitive nature of HLA genes might bias the read mapping process, usually underestimating variability at very polymorphic segments, or overestimating variability at some segments. To overcome this issue we developed hla-mapper, which takes into account HLA sequences derived from the IPD-IMGT/HLA database and unpublished HLA sequences to apply a scoring system. This comprehends the evaluation of each read pair, addressing them to the most likely HLA gene they were derived from. Hla-mapper provides a reliable map of HLA sequences, allowing accurate downstream analysis such as variant calling, haplotype inference, and allele typing. Moreover, hla-mapper supports whole genome, exome, and targeted sequencing data. To assess the software performance in comparison with traditional mapping algorithms, we used three different simulated datasets to compare the results obtained with hla-mapper, BWA MEM, and Bowtie2. Overall, hla-mapper presented a superior performance, mainly for the classical HLA class I genes, minimizing wrong mapping and cross-mapping that are typically observed when using BWA MEM or Bowtie2 with a single reference genome.Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)São Paulo State University (UNESP) Molecular Genetics and Bioinformatics Laboratory Experimental Research Unit (UNIPEX) School of MedicineSão Paulo State University (UNESP) Pathology Department School of MedicineDepartamento de Química Faculdade de Filosofia Ciências e Letras de Ribeirão Preto Universidade de São PauloSão Paulo State University (UNESP) Molecular Genetics and Bioinformatics Laboratory Experimental Research Unit (UNIPEX) School of MedicineSão Paulo State University (UNESP) Pathology Department School of MedicineFAPESP: 2013/17084-2CNPq: 304471/2013-5CNPq: 309572/2014-2Universidade Estadual Paulista (Unesp)Universidade de São Paulo (USP)Castelli, Erick C. [UNESP]Paz, Michelle A. [UNESP]Souza, Andréia S. [UNESP]Ramalho, Jaqueline [UNESP]Mendes-Junior, Celso Teixeira2018-12-11T16:54:16Z2018-12-11T16:54:16Z2018-09-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article678-684application/pdfhttp://dx.doi.org/10.1016/j.humimm.2018.06.010Human Immunology, v. 79, n. 9, p. 678-684, 2018.1879-11660198-8859http://hdl.handle.net/11449/17117810.1016/j.humimm.2018.06.0102-s2.0-850494800882-s2.0-85049480088.pdfScopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengHuman Immunology0,856info:eu-repo/semantics/openAccess2024-01-17T06:28:10Zoai:repositorio.unesp.br:11449/171178Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T23:17:04.574672Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures
title Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures
spellingShingle Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures
Castelli, Erick C. [UNESP]
Aligners
HLA
Mapping tool
MHC
Next Generation Sequencing (NGS)
Polymorphisms
Second Generation Sequencing
Typing
Variability
title_short Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures
title_full Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures
title_fullStr Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures
title_full_unstemmed Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures
title_sort Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures
author Castelli, Erick C. [UNESP]
author_facet Castelli, Erick C. [UNESP]
Paz, Michelle A. [UNESP]
Souza, Andréia S. [UNESP]
Ramalho, Jaqueline [UNESP]
Mendes-Junior, Celso Teixeira
author_role author
author2 Paz, Michelle A. [UNESP]
Souza, Andréia S. [UNESP]
Ramalho, Jaqueline [UNESP]
Mendes-Junior, Celso Teixeira
author2_role author
author
author
author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (Unesp)
Universidade de São Paulo (USP)
dc.contributor.author.fl_str_mv Castelli, Erick C. [UNESP]
Paz, Michelle A. [UNESP]
Souza, Andréia S. [UNESP]
Ramalho, Jaqueline [UNESP]
Mendes-Junior, Celso Teixeira
dc.subject.por.fl_str_mv Aligners
HLA
Mapping tool
MHC
Next Generation Sequencing (NGS)
Polymorphisms
Second Generation Sequencing
Typing
Variability
topic Aligners
HLA
Mapping tool
MHC
Next Generation Sequencing (NGS)
Polymorphisms
Second Generation Sequencing
Typing
Variability
description A challenging task when more than one HLA gene is evaluated together by second-generation sequencing is to achieve a reliable read mapping. The polymorphic and repetitive nature of HLA genes might bias the read mapping process, usually underestimating variability at very polymorphic segments, or overestimating variability at some segments. To overcome this issue we developed hla-mapper, which takes into account HLA sequences derived from the IPD-IMGT/HLA database and unpublished HLA sequences to apply a scoring system. This comprehends the evaluation of each read pair, addressing them to the most likely HLA gene they were derived from. Hla-mapper provides a reliable map of HLA sequences, allowing accurate downstream analysis such as variant calling, haplotype inference, and allele typing. Moreover, hla-mapper supports whole genome, exome, and targeted sequencing data. To assess the software performance in comparison with traditional mapping algorithms, we used three different simulated datasets to compare the results obtained with hla-mapper, BWA MEM, and Bowtie2. Overall, hla-mapper presented a superior performance, mainly for the classical HLA class I genes, minimizing wrong mapping and cross-mapping that are typically observed when using BWA MEM or Bowtie2 with a single reference genome.
publishDate 2018
dc.date.none.fl_str_mv 2018-12-11T16:54:16Z
2018-12-11T16:54:16Z
2018-09-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1016/j.humimm.2018.06.010
Human Immunology, v. 79, n. 9, p. 678-684, 2018.
1879-1166
0198-8859
http://hdl.handle.net/11449/171178
10.1016/j.humimm.2018.06.010
2-s2.0-85049480088
2-s2.0-85049480088.pdf
url http://dx.doi.org/10.1016/j.humimm.2018.06.010
http://hdl.handle.net/11449/171178
identifier_str_mv Human Immunology, v. 79, n. 9, p. 678-684, 2018.
1879-1166
0198-8859
10.1016/j.humimm.2018.06.010
2-s2.0-85049480088
2-s2.0-85049480088.pdf
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Human Immunology
0,856
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 678-684
application/pdf
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808129503846727680