Unsupervised detection of ancestry tracks with the GHap r package

Detalhes bibliográficos
Autor(a) principal: Utsunomiya, Yuri Tani [UNESP]
Data de Publicação: 2020
Outros Autores: Milanesi, Marco [UNESP], Barbato, Mario, Utsunomiya, Adam Taiti Harth [UNESP], Sölkner, Johann, Ajmone-Marsan, Paolo, Garcia, José Fernando [UNESP]
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1111/2041-210X.13467
http://hdl.handle.net/11449/201006
Resumo: The identification of ancestry tracks is a powerful tool to assist the inference of evolutionary events in the genomes of animals and plants. However, algorithms for ancestry track detection typically require labelled reference population data. This dependency prevents the analysis of genomic data lacking prior information on genetic structure, and may produce classification bias when samples in the reference data are inadvertently admixed. We combined heuristics with K-means clustering to deploy a method that can detect ancestry tracks without the provision of lineage labels for reference population data. The resulting algorithm uses phased genotypes to infer individual ancestry proportions and local ancestry. By piling up ancestry tracks across individuals, our method also allows for mapping loci with excess or deficit ancestry from specific lineages. Using both simulated and real genomic data, we found that the proposed method was accurate in inferring genetic structure, assigning chromosomal segments to lineages and estimating individual ancestry, especially in cases where ancestry tracks resulted from recent admixture of highly divergent lineages. The method is implemented as part of the v2 release of the GHap r package (available at https://cran.r-project.org/package=GHap and https://bitbucket.org/marcomilanesi/ghap/src/master/).
id UNSP_f863ed4bbc5488d43925833cd0dc98e8
oai_identifier_str oai:repositorio.unesp.br:11449/201006
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Unsupervised detection of ancestry tracks with the GHap r packageadmixturechromosome paintingpopulation structuresingle-nucleotide polymorphismThe identification of ancestry tracks is a powerful tool to assist the inference of evolutionary events in the genomes of animals and plants. However, algorithms for ancestry track detection typically require labelled reference population data. This dependency prevents the analysis of genomic data lacking prior information on genetic structure, and may produce classification bias when samples in the reference data are inadvertently admixed. We combined heuristics with K-means clustering to deploy a method that can detect ancestry tracks without the provision of lineage labels for reference population data. The resulting algorithm uses phased genotypes to infer individual ancestry proportions and local ancestry. By piling up ancestry tracks across individuals, our method also allows for mapping loci with excess or deficit ancestry from specific lineages. Using both simulated and real genomic data, we found that the proposed method was accurate in inferring genetic structure, assigning chromosomal segments to lineages and estimating individual ancestry, especially in cases where ancestry tracks resulted from recent admixture of highly divergent lineages. The method is implemented as part of the v2 release of the GHap r package (available at https://cran.r-project.org/package=GHap and https://bitbucket.org/marcomilanesi/ghap/src/master/).Department of Support Production and Animal Health School of Veterinary Medicine of Araçatuba São Paulo State University (Unesp)International Atomic Energy Agency (IAEA) Collaborating Centre on Animal Genomics and BioinformaticsDepartment of Animal Science Food and Nutrition—DIANA and Nutrigenomics and Proteomics Research Center Università Cattolica del Sacro CuoreDivision of Livestook Sciences Department of Sustainable Agriculture System BOKU—University of Natural Resources and Life SciencesDepartment of Preventive Veterinary Medicine and Animal Reproduction School of Agricultural and Veterinarian Sciences São Paulo State University (Unesp)Department of Support Production and Animal Health School of Veterinary Medicine of Araçatuba São Paulo State University (Unesp)Department of Preventive Veterinary Medicine and Animal Reproduction School of Agricultural and Veterinarian Sciences São Paulo State University (Unesp)Universidade Estadual Paulista (Unesp)International Atomic Energy Agency (IAEA) Collaborating Centre on Animal Genomics and BioinformaticsUniversità Cattolica del Sacro CuoreBOKU—University of Natural Resources and Life SciencesUtsunomiya, Yuri Tani [UNESP]Milanesi, Marco [UNESP]Barbato, MarioUtsunomiya, Adam Taiti Harth [UNESP]Sölkner, JohannAjmone-Marsan, PaoloGarcia, José Fernando [UNESP]2020-12-12T02:21:44Z2020-12-12T02:21:44Z2020-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1111/2041-210X.13467Methods in Ecology and Evolution.2041-210Xhttp://hdl.handle.net/11449/20100610.1111/2041-210X.134672-s2.0-85090309413Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengMethods in Ecology and Evolutioninfo:eu-repo/semantics/openAccess2024-09-04T19:14:55Zoai:repositorio.unesp.br:11449/201006Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestrepositoriounesp@unesp.bropendoar:29462024-09-04T19:14:55Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Unsupervised detection of ancestry tracks with the GHap r package
title Unsupervised detection of ancestry tracks with the GHap r package
spellingShingle Unsupervised detection of ancestry tracks with the GHap r package
Utsunomiya, Yuri Tani [UNESP]
admixture
chromosome painting
population structure
single-nucleotide polymorphism
title_short Unsupervised detection of ancestry tracks with the GHap r package
title_full Unsupervised detection of ancestry tracks with the GHap r package
title_fullStr Unsupervised detection of ancestry tracks with the GHap r package
title_full_unstemmed Unsupervised detection of ancestry tracks with the GHap r package
title_sort Unsupervised detection of ancestry tracks with the GHap r package
author Utsunomiya, Yuri Tani [UNESP]
author_facet Utsunomiya, Yuri Tani [UNESP]
Milanesi, Marco [UNESP]
Barbato, Mario
Utsunomiya, Adam Taiti Harth [UNESP]
Sölkner, Johann
Ajmone-Marsan, Paolo
Garcia, José Fernando [UNESP]
author_role author
author2 Milanesi, Marco [UNESP]
Barbato, Mario
Utsunomiya, Adam Taiti Harth [UNESP]
Sölkner, Johann
Ajmone-Marsan, Paolo
Garcia, José Fernando [UNESP]
author2_role author
author
author
author
author
author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (Unesp)
International Atomic Energy Agency (IAEA) Collaborating Centre on Animal Genomics and Bioinformatics
Università Cattolica del Sacro Cuore
BOKU—University of Natural Resources and Life Sciences
dc.contributor.author.fl_str_mv Utsunomiya, Yuri Tani [UNESP]
Milanesi, Marco [UNESP]
Barbato, Mario
Utsunomiya, Adam Taiti Harth [UNESP]
Sölkner, Johann
Ajmone-Marsan, Paolo
Garcia, José Fernando [UNESP]
dc.subject.por.fl_str_mv admixture
chromosome painting
population structure
single-nucleotide polymorphism
topic admixture
chromosome painting
population structure
single-nucleotide polymorphism
description The identification of ancestry tracks is a powerful tool to assist the inference of evolutionary events in the genomes of animals and plants. However, algorithms for ancestry track detection typically require labelled reference population data. This dependency prevents the analysis of genomic data lacking prior information on genetic structure, and may produce classification bias when samples in the reference data are inadvertently admixed. We combined heuristics with K-means clustering to deploy a method that can detect ancestry tracks without the provision of lineage labels for reference population data. The resulting algorithm uses phased genotypes to infer individual ancestry proportions and local ancestry. By piling up ancestry tracks across individuals, our method also allows for mapping loci with excess or deficit ancestry from specific lineages. Using both simulated and real genomic data, we found that the proposed method was accurate in inferring genetic structure, assigning chromosomal segments to lineages and estimating individual ancestry, especially in cases where ancestry tracks resulted from recent admixture of highly divergent lineages. The method is implemented as part of the v2 release of the GHap r package (available at https://cran.r-project.org/package=GHap and https://bitbucket.org/marcomilanesi/ghap/src/master/).
publishDate 2020
dc.date.none.fl_str_mv 2020-12-12T02:21:44Z
2020-12-12T02:21:44Z
2020-01-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1111/2041-210X.13467
Methods in Ecology and Evolution.
2041-210X
http://hdl.handle.net/11449/201006
10.1111/2041-210X.13467
2-s2.0-85090309413
url http://dx.doi.org/10.1111/2041-210X.13467
http://hdl.handle.net/11449/201006
identifier_str_mv Methods in Ecology and Evolution.
2041-210X
10.1111/2041-210X.13467
2-s2.0-85090309413
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Methods in Ecology and Evolution
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv repositoriounesp@unesp.br
_version_ 1810021360958701568