Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample

Detalhes bibliográficos
Autor(a) principal: Valle-Silva, Guilherme
Data de Publicação: 2022
Outros Autores: Frontanilla, Tamara Soledad, Ayala, Jesús, Donadi, Eduardo Antonio, Simões, Aguinaldo Luiz, Castelli, Erick C. [UNESP], Mendes-Junior, Celso Teixeira
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1016/j.fsigen.2022.102676
http://hdl.handle.net/11449/231614
Resumo: Short tandem repeats (STRs) are particularly difficult to genotype with rapid evolving next-generation sequencing (NGS) technology. Long amplicons containing repetitive sequences result in alignment and genotyping errors. Stutters arising from polymerase slippage often result in reads with additional or missing repeat copies. Many tools are available for analysis of STR markers from NGS data. This study has evaluated the concordance of the HipSTR, STRait Razor, and toaSTR tools for STR genotype calling; NGS data obtained from a highly genetically diverse Brazilian population sample have been used. We found that toaSTR can retrieve a larger number of genotypes (93.8%), whereas HipSTR (84.9%) and STRait Razor present much lower genotype calling (75.3%). Accuracy levels for genotype calling are very similar (identical genotypes ~95% and correct alleles ~ 97.5%) across the three methods. All the markers presenting the same genotype through the methods are in Hardy–Weinberg equilibrium. We found that combined match probability and combined exclusion power are 2.90 × 10−28 and 0.99999999982, respectively. Although toaSTR has varying locus-specific differences and better overall performance of toaSTR, the three programs are reliable genotyping tools. Notwithstanding, additional effort is necessary to improve the genotype calling accuracy of next-generation sequencing datasets.
id UNSP_7a5ff2076edc334987a26f7acd845fc7
oai_identifier_str oai:repositorio.unesp.br:11449/231614
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sampleBioinformaticsBrazilCODISForensic geneticsMassively parallel sequencingShort tandem repeatsShort tandem repeats (STRs) are particularly difficult to genotype with rapid evolving next-generation sequencing (NGS) technology. Long amplicons containing repetitive sequences result in alignment and genotyping errors. Stutters arising from polymerase slippage often result in reads with additional or missing repeat copies. Many tools are available for analysis of STR markers from NGS data. This study has evaluated the concordance of the HipSTR, STRait Razor, and toaSTR tools for STR genotype calling; NGS data obtained from a highly genetically diverse Brazilian population sample have been used. We found that toaSTR can retrieve a larger number of genotypes (93.8%), whereas HipSTR (84.9%) and STRait Razor present much lower genotype calling (75.3%). Accuracy levels for genotype calling are very similar (identical genotypes ~95% and correct alleles ~ 97.5%) across the three methods. All the markers presenting the same genotype through the methods are in Hardy–Weinberg equilibrium. We found that combined match probability and combined exclusion power are 2.90 × 10−28 and 0.99999999982, respectively. Although toaSTR has varying locus-specific differences and better overall performance of toaSTR, the three programs are reliable genotyping tools. Notwithstanding, additional effort is necessary to improve the genotype calling accuracy of next-generation sequencing datasets.Departamento de Química Laboratório de Pesquisas Forenses e Genômicas Faculdade de Filosofia Ciências e Letras de Ribeirão Preto Universidade de São Paulo, SPDepartamento de Genética Faculdade de Medicina de Ribeirão Preto Universidade de São Paulo, SPSoftec S.R.LDivisão de Imunologia Clínica Departamento de Clínica Médica Faculdade de Medicina de Ribeirão Preto Universidade de São Paulo, SPSão Paulo State University (UNESP) Department of Pathology School of Medicine, SPSão Paulo State University (UNESP) Department of Pathology School of Medicine, SPUniversidade de São Paulo (USP)Softec S.R.LUniversidade Estadual Paulista (UNESP)Valle-Silva, GuilhermeFrontanilla, Tamara SoledadAyala, JesúsDonadi, Eduardo AntonioSimões, Aguinaldo LuizCastelli, Erick C. [UNESP]Mendes-Junior, Celso Teixeira2022-04-29T08:46:35Z2022-04-29T08:46:35Z2022-05-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1016/j.fsigen.2022.102676Forensic Science International: Genetics, v. 58.1878-03261872-4973http://hdl.handle.net/11449/23161410.1016/j.fsigen.2022.1026762-s2.0-85124188207Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengForensic Science International: Geneticsinfo:eu-repo/semantics/openAccess2022-04-29T08:46:35Zoai:repositorio.unesp.br:11449/231614Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T18:49:19.516293Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample
title Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample
spellingShingle Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample
Valle-Silva, Guilherme
Bioinformatics
Brazil
CODIS
Forensic genetics
Massively parallel sequencing
Short tandem repeats
title_short Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample
title_full Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample
title_fullStr Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample
title_full_unstemmed Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample
title_sort Analysis and comparison of the STR genotypes called with HipSTR, STRait Razor and toaSTR by using next generation sequencing data in a Brazilian population sample
author Valle-Silva, Guilherme
author_facet Valle-Silva, Guilherme
Frontanilla, Tamara Soledad
Ayala, Jesús
Donadi, Eduardo Antonio
Simões, Aguinaldo Luiz
Castelli, Erick C. [UNESP]
Mendes-Junior, Celso Teixeira
author_role author
author2 Frontanilla, Tamara Soledad
Ayala, Jesús
Donadi, Eduardo Antonio
Simões, Aguinaldo Luiz
Castelli, Erick C. [UNESP]
Mendes-Junior, Celso Teixeira
author2_role author
author
author
author
author
author
dc.contributor.none.fl_str_mv Universidade de São Paulo (USP)
Softec S.R.L
Universidade Estadual Paulista (UNESP)
dc.contributor.author.fl_str_mv Valle-Silva, Guilherme
Frontanilla, Tamara Soledad
Ayala, Jesús
Donadi, Eduardo Antonio
Simões, Aguinaldo Luiz
Castelli, Erick C. [UNESP]
Mendes-Junior, Celso Teixeira
dc.subject.por.fl_str_mv Bioinformatics
Brazil
CODIS
Forensic genetics
Massively parallel sequencing
Short tandem repeats
topic Bioinformatics
Brazil
CODIS
Forensic genetics
Massively parallel sequencing
Short tandem repeats
description Short tandem repeats (STRs) are particularly difficult to genotype with rapid evolving next-generation sequencing (NGS) technology. Long amplicons containing repetitive sequences result in alignment and genotyping errors. Stutters arising from polymerase slippage often result in reads with additional or missing repeat copies. Many tools are available for analysis of STR markers from NGS data. This study has evaluated the concordance of the HipSTR, STRait Razor, and toaSTR tools for STR genotype calling; NGS data obtained from a highly genetically diverse Brazilian population sample have been used. We found that toaSTR can retrieve a larger number of genotypes (93.8%), whereas HipSTR (84.9%) and STRait Razor present much lower genotype calling (75.3%). Accuracy levels for genotype calling are very similar (identical genotypes ~95% and correct alleles ~ 97.5%) across the three methods. All the markers presenting the same genotype through the methods are in Hardy–Weinberg equilibrium. We found that combined match probability and combined exclusion power are 2.90 × 10−28 and 0.99999999982, respectively. Although toaSTR has varying locus-specific differences and better overall performance of toaSTR, the three programs are reliable genotyping tools. Notwithstanding, additional effort is necessary to improve the genotype calling accuracy of next-generation sequencing datasets.
publishDate 2022
dc.date.none.fl_str_mv 2022-04-29T08:46:35Z
2022-04-29T08:46:35Z
2022-05-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1016/j.fsigen.2022.102676
Forensic Science International: Genetics, v. 58.
1878-0326
1872-4973
http://hdl.handle.net/11449/231614
10.1016/j.fsigen.2022.102676
2-s2.0-85124188207
url http://dx.doi.org/10.1016/j.fsigen.2022.102676
http://hdl.handle.net/11449/231614
identifier_str_mv Forensic Science International: Genetics, v. 58.
1878-0326
1872-4973
10.1016/j.fsigen.2022.102676
2-s2.0-85124188207
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Forensic Science International: Genetics
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808128983775051776