Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan
Autor(a) principal: | |
---|---|
Data de Publicação: | 2016 |
Outros Autores: | , , , , , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
DOI: | 10.1186/s12864-016-3049-8 |
Texto Completo: | http://dx.doi.org/10.1186/s12864-016-3049-8 http://hdl.handle.net/11449/178256 |
Resumo: | Background: Misassembly signatures, created by shuffling the order of sequences while assembling a genome, can be detected by the unexpected behavior of marker linkage disequilibrium (LD) decay. We developed a heuristic process to identify misassembly signatures, applied it to the bovine reference genome assembly (UMDv3.1) and presented the consequences of misassemblies in two case studies. Results: We identified 2,906 single nucleotide polymorphism (SNP) markers presenting unexpected LD decay behavior in 626 putative misassembled contigs, which comprised less than 1 % of the whole genome. Although this represents a small fraction of the reference sequence, these poorly assembled segments can lead to severe implications to local genome context. For instance, we showed that one of the misassembled regions mapped to the POLL locus, which affected the annotation of positional candidate genes in a GWAS case study for polledness in Nellore (Bos indicus beef cattle). Additionally, we found that poorly performing markers in imputation mapped to putative misassembled regions, and that correction of marker positions based on LD was capable to recover imputation accuracy. Conclusions: This heuristic approach can be useful to cross validate reference assemblies and to filter out markers located at low confidence genomic regions before conducting downstream analyses. |
id |
UNSP_9ed415e275b495023d47d63d7d6ea9b1 |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/178256 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scanBos indicusBos taurusGWASImputationLinkage disequilibriumMisassemblyBackground: Misassembly signatures, created by shuffling the order of sequences while assembling a genome, can be detected by the unexpected behavior of marker linkage disequilibrium (LD) decay. We developed a heuristic process to identify misassembly signatures, applied it to the bovine reference genome assembly (UMDv3.1) and presented the consequences of misassemblies in two case studies. Results: We identified 2,906 single nucleotide polymorphism (SNP) markers presenting unexpected LD decay behavior in 626 putative misassembled contigs, which comprised less than 1 % of the whole genome. Although this represents a small fraction of the reference sequence, these poorly assembled segments can lead to severe implications to local genome context. For instance, we showed that one of the misassembled regions mapped to the POLL locus, which affected the annotation of positional candidate genes in a GWAS case study for polledness in Nellore (Bos indicus beef cattle). Additionally, we found that poorly performing markers in imputation mapped to putative misassembled regions, and that correction of marker positions based on LD was capable to recover imputation accuracy. Conclusions: This heuristic approach can be useful to cross validate reference assemblies and to filter out markers located at low confidence genomic regions before conducting downstream analyses.Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Universidade Estadual Paulista - UNESP Faculdade de Ci�ncias Agr�rias e Veterin�rias, Campus de JaboticabalNofimaUniversidade Estadual Paulista - UNESP Faculdade de Medicina Veterin�ria de Ara�atuba, Campus de Ara�atubaAnimal Genomics and Improvement Laboratory ARS USDAUniversit� Cattolica del Sacro Cuore Institute of Zootechnics and Biodiversity and Ancient DNA Research CenterUniversit� Cattolica del Sacro Cuore Nutrigenomics and Proteomics Research Center - PRONUTRIGENBOKU - University of Natural Resources and Life Sciences Department of Sustainable Agricultural Systems Division of Livestock SciencesInternational Atomic Energy Agency (IAEA) Collaborating Centre on Animal Genomics and BioinformaticsUniversidade Estadual Paulista - UNESP Faculdade de Ci�ncias Agr�rias e Tecnol�gicas, Campus de DracenaEmbrapa Gado de LeiteUniversidade Estadual Paulista - UNESP Faculdade de Ci�ncias Agr�rias e Veterin�rias, Campus de JaboticabalUniversidade Estadual Paulista - UNESP Faculdade de Medicina Veterin�ria de Ara�atuba, Campus de Ara�atubaUniversidade Estadual Paulista - UNESP Faculdade de Ci�ncias Agr�rias e Tecnol�gicas, Campus de DracenaCNPq: 560922/2010-8Universidade Estadual Paulista (Unesp)NofimaUSDAInstitute of Zootechnics and Biodiversity and Ancient DNA Research CenterNutrigenomics and Proteomics Research Center - PRONUTRIGENDivision of Livestock SciencesInternational Atomic Energy Agency (IAEA) Collaborating Centre on Animal Genomics and BioinformaticsEmpresa Brasileira de Pesquisa Agropecuária (EMBRAPA)Utsunomiya, Adam T.H. [UNESP]Santos, Daniel J.A. [UNESP]Boison, Solomon A.Utsunomiya, Yuri T. [UNESP]Milanesi, Marco [UNESP]Bickhart, Derek M.Ajmone-Marsan, PaoloS�lkner, JohannGarcia, Jos� F. [UNESP]da Fonseca, Ricardo [UNESP]da Silva, Marcos V.G.B.2018-12-11T17:29:31Z2018-12-11T17:29:31Z2016-09-05info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://dx.doi.org/10.1186/s12864-016-3049-8BMC Genomics, v. 17, n. 1, 2016.1471-2164http://hdl.handle.net/11449/17825610.1186/s12864-016-3049-82-s2.0-849849936132-s2.0-84984993613.pdf03083523735455580000-0002-1163-6296Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengBMC Genomics2,110info:eu-repo/semantics/openAccess2024-05-07T13:48:05Zoai:repositorio.unesp.br:11449/178256Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T21:45:12.989882Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan |
title |
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan |
spellingShingle |
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan Utsunomiya, Adam T.H. [UNESP] Bos indicus Bos taurus GWAS Imputation Linkage disequilibrium Misassembly Utsunomiya, Adam T.H. [UNESP] Bos indicus Bos taurus GWAS Imputation Linkage disequilibrium Misassembly |
title_short |
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan |
title_full |
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan |
title_fullStr |
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan |
title_full_unstemmed |
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan |
title_sort |
Revealing misassembled segments in the bovine reference genome by high resolution linkage disequilibrium scan |
author |
Utsunomiya, Adam T.H. [UNESP] |
author_facet |
Utsunomiya, Adam T.H. [UNESP] Utsunomiya, Adam T.H. [UNESP] Santos, Daniel J.A. [UNESP] Boison, Solomon A. Utsunomiya, Yuri T. [UNESP] Milanesi, Marco [UNESP] Bickhart, Derek M. Ajmone-Marsan, Paolo S�lkner, Johann Garcia, Jos� F. [UNESP] da Fonseca, Ricardo [UNESP] da Silva, Marcos V.G.B. Santos, Daniel J.A. [UNESP] Boison, Solomon A. Utsunomiya, Yuri T. [UNESP] Milanesi, Marco [UNESP] Bickhart, Derek M. Ajmone-Marsan, Paolo S�lkner, Johann Garcia, Jos� F. [UNESP] da Fonseca, Ricardo [UNESP] da Silva, Marcos V.G.B. |
author_role |
author |
author2 |
Santos, Daniel J.A. [UNESP] Boison, Solomon A. Utsunomiya, Yuri T. [UNESP] Milanesi, Marco [UNESP] Bickhart, Derek M. Ajmone-Marsan, Paolo S�lkner, Johann Garcia, Jos� F. [UNESP] da Fonseca, Ricardo [UNESP] da Silva, Marcos V.G.B. |
author2_role |
author author author author author author author author author author |
dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (Unesp) Nofima USDA Institute of Zootechnics and Biodiversity and Ancient DNA Research Center Nutrigenomics and Proteomics Research Center - PRONUTRIGEN Division of Livestock Sciences International Atomic Energy Agency (IAEA) Collaborating Centre on Animal Genomics and Bioinformatics Empresa Brasileira de Pesquisa Agropecuária (EMBRAPA) |
dc.contributor.author.fl_str_mv |
Utsunomiya, Adam T.H. [UNESP] Santos, Daniel J.A. [UNESP] Boison, Solomon A. Utsunomiya, Yuri T. [UNESP] Milanesi, Marco [UNESP] Bickhart, Derek M. Ajmone-Marsan, Paolo S�lkner, Johann Garcia, Jos� F. [UNESP] da Fonseca, Ricardo [UNESP] da Silva, Marcos V.G.B. |
dc.subject.por.fl_str_mv |
Bos indicus Bos taurus GWAS Imputation Linkage disequilibrium Misassembly |
topic |
Bos indicus Bos taurus GWAS Imputation Linkage disequilibrium Misassembly |
description |
Background: Misassembly signatures, created by shuffling the order of sequences while assembling a genome, can be detected by the unexpected behavior of marker linkage disequilibrium (LD) decay. We developed a heuristic process to identify misassembly signatures, applied it to the bovine reference genome assembly (UMDv3.1) and presented the consequences of misassemblies in two case studies. Results: We identified 2,906 single nucleotide polymorphism (SNP) markers presenting unexpected LD decay behavior in 626 putative misassembled contigs, which comprised less than 1 % of the whole genome. Although this represents a small fraction of the reference sequence, these poorly assembled segments can lead to severe implications to local genome context. For instance, we showed that one of the misassembled regions mapped to the POLL locus, which affected the annotation of positional candidate genes in a GWAS case study for polledness in Nellore (Bos indicus beef cattle). Additionally, we found that poorly performing markers in imputation mapped to putative misassembled regions, and that correction of marker positions based on LD was capable to recover imputation accuracy. Conclusions: This heuristic approach can be useful to cross validate reference assemblies and to filter out markers located at low confidence genomic regions before conducting downstream analyses. |
publishDate |
2016 |
dc.date.none.fl_str_mv |
2016-09-05 2018-12-11T17:29:31Z 2018-12-11T17:29:31Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1186/s12864-016-3049-8 BMC Genomics, v. 17, n. 1, 2016. 1471-2164 http://hdl.handle.net/11449/178256 10.1186/s12864-016-3049-8 2-s2.0-84984993613 2-s2.0-84984993613.pdf 0308352373545558 0000-0002-1163-6296 |
url |
http://dx.doi.org/10.1186/s12864-016-3049-8 http://hdl.handle.net/11449/178256 |
identifier_str_mv |
BMC Genomics, v. 17, n. 1, 2016. 1471-2164 10.1186/s12864-016-3049-8 2-s2.0-84984993613 2-s2.0-84984993613.pdf 0308352373545558 0000-0002-1163-6296 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
BMC Genomics 2,110 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1822182273897725952 |
dc.identifier.doi.none.fl_str_mv |
10.1186/s12864-016-3049-8 |