Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems

Detalhes bibliográficos
Autor(a) principal: Pina-Martins, Francisco
Data de Publicação: 2017
Outros Autores: Silva, Diogo N., Fino, Joana, Paulo, Octávio S.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.18/5464
Resumo: Structure_threader is a program to parallelize multiple runs of genetic clustering software that does not make use of multithreading technology (structure, fastStructure and MavericK) on multicore computers. Our approach was benchmarked across multiple systems and displayed great speed improvements relative to the single-threaded implementation, scaling very close to linearly with the number of physical cores used. Structure_threader was compared to previous software written for the same task-ParallelStructure and StrAuto and was proven to be the faster (up to 25% faster) wrapper under all tested scenarios. Furthermore, Structure_threader can perform several automatic and convenient operations, assisting the user in assessing the most biologically likely value of 'K' via implementations such as the "Evanno," or "Thermodynamic Integration" tests and automatically draw the "meanQ" plots (static or interactive) for each value of K (or even combined plots). Structure_threader is written in python 3 and licensed under the GPLv3. It can be downloaded free of charge at https://github.com/StuntsPT/Structure_threader.
id RCAP_c48bd9260d2f286a83ef9e8362919c94
oai_identifier_str oai:repositorio.insa.pt:10400.18/5464
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systemsBioinformatics/phyloinformaticsClusteringEmpiricalGenomics/proteomicsMolecular EvolutionParallel ComputingPopulation GeneticsDoenças GenéticasStructure_threader is a program to parallelize multiple runs of genetic clustering software that does not make use of multithreading technology (structure, fastStructure and MavericK) on multicore computers. Our approach was benchmarked across multiple systems and displayed great speed improvements relative to the single-threaded implementation, scaling very close to linearly with the number of physical cores used. Structure_threader was compared to previous software written for the same task-ParallelStructure and StrAuto and was proven to be the faster (up to 25% faster) wrapper under all tested scenarios. Furthermore, Structure_threader can perform several automatic and convenient operations, assisting the user in assessing the most biologically likely value of 'K' via implementations such as the "Evanno," or "Thermodynamic Integration" tests and automatically draw the "meanQ" plots (static or interactive) for each value of K (or even combined plots). Structure_threader is written in python 3 and licensed under the GPLv3. It can be downloaded free of charge at https://github.com/StuntsPT/Structure_threader.This study was financed by Portuguese National Funds, through FCT — FundaÇão para a Ciência e a Tecnologia, within the projects UID/ BIA/00329/2013, SFRH/BD/51411/2011 and SFRH/BD/86736/ 2012.Wiley:Repositório Científico do Instituto Nacional de SaúdePina-Martins, FranciscoSilva, Diogo N.Fino, JoanaPaulo, Octávio S.2018-03-22T19:39:24Z2017-112017-11-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.18/5464engMol Ecol Resour. 2017 Nov;17(6):e268-e274. doi: 10.1111/1755-0998.12702. Epub 2017 Sep 16.1755-098X10.1111/1755-0998.12702info:eu-repo/semantics/embargoedAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-20T15:40:52Zoai:repositorio.insa.pt:10400.18/5464Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T18:40:09.274902Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
title Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
spellingShingle Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
Pina-Martins, Francisco
Bioinformatics/phyloinformatics
Clustering
Empirical
Genomics/proteomics
Molecular Evolution
Parallel Computing
Population Genetics
Doenças Genéticas
title_short Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
title_full Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
title_fullStr Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
title_full_unstemmed Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
title_sort Structure_threader: an improved method for automation and parallelization of programs structure, fastStructure and MavericK on multicore CPU systems
author Pina-Martins, Francisco
author_facet Pina-Martins, Francisco
Silva, Diogo N.
Fino, Joana
Paulo, Octávio S.
author_role author
author2 Silva, Diogo N.
Fino, Joana
Paulo, Octávio S.
author2_role author
author
author
dc.contributor.none.fl_str_mv Repositório Científico do Instituto Nacional de Saúde
dc.contributor.author.fl_str_mv Pina-Martins, Francisco
Silva, Diogo N.
Fino, Joana
Paulo, Octávio S.
dc.subject.por.fl_str_mv Bioinformatics/phyloinformatics
Clustering
Empirical
Genomics/proteomics
Molecular Evolution
Parallel Computing
Population Genetics
Doenças Genéticas
topic Bioinformatics/phyloinformatics
Clustering
Empirical
Genomics/proteomics
Molecular Evolution
Parallel Computing
Population Genetics
Doenças Genéticas
description Structure_threader is a program to parallelize multiple runs of genetic clustering software that does not make use of multithreading technology (structure, fastStructure and MavericK) on multicore computers. Our approach was benchmarked across multiple systems and displayed great speed improvements relative to the single-threaded implementation, scaling very close to linearly with the number of physical cores used. Structure_threader was compared to previous software written for the same task-ParallelStructure and StrAuto and was proven to be the faster (up to 25% faster) wrapper under all tested scenarios. Furthermore, Structure_threader can perform several automatic and convenient operations, assisting the user in assessing the most biologically likely value of 'K' via implementations such as the "Evanno," or "Thermodynamic Integration" tests and automatically draw the "meanQ" plots (static or interactive) for each value of K (or even combined plots). Structure_threader is written in python 3 and licensed under the GPLv3. It can be downloaded free of charge at https://github.com/StuntsPT/Structure_threader.
publishDate 2017
dc.date.none.fl_str_mv 2017-11
2017-11-01T00:00:00Z
2018-03-22T19:39:24Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.18/5464
url http://hdl.handle.net/10400.18/5464
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Mol Ecol Resour. 2017 Nov;17(6):e268-e274. doi: 10.1111/1755-0998.12702. Epub 2017 Sep 16.
1755-098X
10.1111/1755-0998.12702
dc.rights.driver.fl_str_mv info:eu-repo/semantics/embargoedAccess
eu_rights_str_mv embargoedAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Wiley:
publisher.none.fl_str_mv Wiley:
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799132143243231232