GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos

Detalhes bibliográficos
Autor(a) principal: Oliveira, Matheus Brito de
Data de Publicação: 2017
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Biblioteca Digital de Teses e Dissertações da UEFS
Texto Completo: http://localhost:8080/tede/handle/tede/513
Resumo: The assembly of bacterial genomes consists of a process of reordering fragments so that the original genome can be represented. However, to maximize the results of genome assembly, some steps are required, for instance, read quality analysis and preprocessing, repetition identification and quality check. The process of assembly of genomes is a complex step that involves the type of sequencing that was used, there are several types of sequencers which imply different characteristics for each one for example: fragments size, throughput, among others. Analyzing these characteristics requires the use of several computational tools, to assist in all the processes mentioned above, and since the range of software available is quite broad and distinct, it is necessary for the user to learn to work with this computational diversity, dominating often knowledge that is not of the biological area, implying in less time for a deepening in biological questions. Based on this context, we developed a pipeline to perform an automated fragment analysis, read preprocessing, genome assembly and orientation of contigs, having as the assembly the main objective of the pipeline and that it will be managed by a Web application called GATOOL (Genome Assembly Tool). Aiming to evaluate the performance of the application, tests were carried out with two samples of prokaryotic organisms, which are: Bacillus amyloliquefaciens and Serratia marcescens. Also perform a test with seven SRA samples. Both organisms are sequenced on the Ion PGMTM platform. The tools used to perform the assembly were SPAdes and Velvet, both assemblers use de Bruijn graph algorithm as a paradigm for the assembly of the genome, after this stage the resulting set of contigs was ordered through the CONTIGuator, which is a reference ordering. We observed that the interface GATOOL allowed a quick and easy execution of several steps and processes in the field of genome assembly, including the assembly of two prokaryotic species in an automated way, thus facilitating the use and accomplishment of such processes by any user.
id UEFS_4fb9892e0342322fa4fa9ef63c1a663d
oai_identifier_str oai:tede2.uefs.br:8080:tede/513
network_acronym_str UEFS
network_name_str Biblioteca Digital de Teses e Dissertações da UEFS
repository_id_str
spelling Queiróz, Artur Trancoso Lopo01493806580http://lattes.cnpq.br/0008785408235675Oliveira, Matheus Brito de2017-10-09T22:34:41Z2017-06-12OLIVEIRA, Matheus Brito de. GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos. 2017. 95 f. Dissertação (Mestrado em Computação Aplicada)- Universidade Estadual de Feira de Santana, Feira de Santana, 2017.http://localhost:8080/tede/handle/tede/513The assembly of bacterial genomes consists of a process of reordering fragments so that the original genome can be represented. However, to maximize the results of genome assembly, some steps are required, for instance, read quality analysis and preprocessing, repetition identification and quality check. The process of assembly of genomes is a complex step that involves the type of sequencing that was used, there are several types of sequencers which imply different characteristics for each one for example: fragments size, throughput, among others. Analyzing these characteristics requires the use of several computational tools, to assist in all the processes mentioned above, and since the range of software available is quite broad and distinct, it is necessary for the user to learn to work with this computational diversity, dominating often knowledge that is not of the biological area, implying in less time for a deepening in biological questions. Based on this context, we developed a pipeline to perform an automated fragment analysis, read preprocessing, genome assembly and orientation of contigs, having as the assembly the main objective of the pipeline and that it will be managed by a Web application called GATOOL (Genome Assembly Tool). Aiming to evaluate the performance of the application, tests were carried out with two samples of prokaryotic organisms, which are: Bacillus amyloliquefaciens and Serratia marcescens. Also perform a test with seven SRA samples. Both organisms are sequenced on the Ion PGMTM platform. The tools used to perform the assembly were SPAdes and Velvet, both assemblers use de Bruijn graph algorithm as a paradigm for the assembly of the genome, after this stage the resulting set of contigs was ordered through the CONTIGuator, which is a reference ordering. We observed that the interface GATOOL allowed a quick and easy execution of several steps and processes in the field of genome assembly, including the assembly of two prokaryotic species in an automated way, thus facilitating the use and accomplishment of such processes by any user.A montagem de genomas bacterianos é um processo de reordenação de fragmentos, de forma que se possa representar o genoma original. Entretanto, para que a montagem de um genoma seja realizada visando maximizar os resultados, é preciso que algumas etapas sejam cumpridas, por exemplo: a análise dos fragmentos, o pré-processamento destes fragmentos e novamente uma repetição do processo de análise, para verificar a eficácia do pré-processamento realizado. O processo de montagem de genomas é uma etapa complexa, que envolve o tipo de sequenciamento que foi utilizado. Existem diversos tipos de sequenciadores, o que implica características distintas em cada um, como por exemplo: tamanho dos fragmentos, quantidade de fragmentos gerados por corrida, dentre outros. Analisando essas características, faz-se necessária a utilização de diversas ferramentas computacionais para auxiliar a todos os processos citados anteriormente e, como a gama de softwares disponíveis é bem ampla e distinta, é importante que o usuário domine essa diversidade computacional, contendo muitas vezes conhecimentos que não são da área biológica, implicando menos tempo para um aprofundamento das questões biológicas. Com base neste contexto, propõem-se um pipeline para a realização da análise de fragmentos, pré-processamento dos fragmentos, montagem de genomas e orientação de contigs, tendo como a montagem o objetivo principal do pipeline e este será gerenciado por uma aplicação web chamada GATOOL (Genome Assembly Tool). Visando avaliar o desempenho da aplicação, foram feitos testes com duas amostras de organismos procariontes, que são: Bacillus amyloliquefaciens e Serratia marcescens. Também foram realizados testes com sete amostras SRA. Ambos os organismos estão sequenciados na plataforma Ion PGMTM. Os montadores usados foram o SPAdes e o Velvet, ambos montadores, utilizam o algorítmo grafo de Bruijn como paradigma para a montagem do genoma; após esta etapa, o conjunto de contigs resultante foi ordenado através do CONTIGuator, que é uma ordenação por referência. Observamos que a interface GATOOL permitiu uma execução rápida e fácil de diversas etapas e processos no campo da montagem de genomas, inclusive realizando a montagem de duas espécies procariontes de maneira automatizada, facilitando assim a utilização e realização de tais processos por qualquer usuário.Submitted by Ricardo Cedraz Duque Moliterno (ricardo.moliterno@uefs.br) on 2017-10-09T22:34:41Z No. of bitstreams: 1 MATHUES BRITO DE OLIVEIRA Dissertaçãov.pdf: 5287293 bytes, checksum: 8d3e3b854b5799f16c0b61b6a5d33f1c (MD5)Made available in DSpace on 2017-10-09T22:34:41Z (GMT). No. of bitstreams: 1 MATHUES BRITO DE OLIVEIRA Dissertaçãov.pdf: 5287293 bytes, checksum: 8d3e3b854b5799f16c0b61b6a5d33f1c (MD5) Previous issue date: 2017-06-12application/pdfporUniversidade Estadual de Feira de SantanaMestrado em Computação AplicadaUEFSBrasilDEPARTAMENTO DE CIÊNCIAS EXATASGenome assemblyBacterialNGSPipelineMontagem de genomaBactériaCIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOGATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianosinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesis303317282311144204600600600-54868328166115062113671711205811204509info:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UEFSinstname:Universidade Estadual de Feira de Santana (UEFS)instacron:UEFSORIGINALMATHUES BRITO DE OLIVEIRA Dissertaçãov.pdfMATHUES BRITO DE OLIVEIRA Dissertaçãov.pdfapplication/pdf5287293http://tede2.uefs.br:8080/bitstream/tede/513/2/MATHUES+BRITO+DE+OLIVEIRA+Disserta%C3%A7%C3%A3ov.pdf8d3e3b854b5799f16c0b61b6a5d33f1cMD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82089http://tede2.uefs.br:8080/bitstream/tede/513/1/license.txt7b5ba3d2445355f386edab96125d42b7MD51tede/5132017-10-09 19:34:41.181oai:tede2.uefs.br:8080:tede/513Tk9UQTogQ09MT1FVRSBBUVVJIEEgU1VBIFBSP1BSSUEgTElDRU4/QQpFc3RhIGxpY2VuP2EgZGUgZXhlbXBsbyA/IGZvcm5lY2lkYSBhcGVuYXMgcGFyYSBmaW5zIGluZm9ybWF0aXZvcy4KCkxJQ0VOP0EgREUgRElTVFJJQlVJPz9PIE4/Ty1FWENMVVNJVkEKCkNvbSBhIGFwcmVzZW50YT8/byBkZXN0YSBsaWNlbj9hLCB2b2M/IChvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSA/IFVuaXZlcnNpZGFkZSAKWFhYIChTaWdsYSBkYSBVbml2ZXJzaWRhZGUpIG8gZGlyZWl0byBuP28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsICB0cmFkdXppciAoY29uZm9ybWUgZGVmaW5pZG8gYWJhaXhvKSwgZS9vdSAKZGlzdHJpYnVpciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhPz9vIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyP25pY28gZSAKZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zID91ZGlvIG91IHY/ZGVvLgoKVm9jPyBjb25jb3JkYSBxdWUgYSBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250ZT9kbywgdHJhbnNwb3IgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byAKcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhPz9vLgoKVm9jPyB0YW1iP20gY29uY29yZGEgcXVlIGEgU2lnbGEgZGUgVW5pdmVyc2lkYWRlIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGM/cGlhIGEgc3VhIHRlc2Ugb3UgCmRpc3NlcnRhPz9vIHBhcmEgZmlucyBkZSBzZWd1cmFuP2EsIGJhY2stdXAgZSBwcmVzZXJ2YT8/by4KClZvYz8gZGVjbGFyYSBxdWUgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byA/IG9yaWdpbmFsIGUgcXVlIHZvYz8gdGVtIG8gcG9kZXIgZGUgY29uY2VkZXIgb3MgZGlyZWl0b3MgY29udGlkb3MgCm5lc3RhIGxpY2VuP2EuIFZvYz8gdGFtYj9tIGRlY2xhcmEgcXVlIG8gZGVwP3NpdG8gZGEgc3VhIHRlc2Ugb3UgZGlzc2VydGE/P28gbj9vLCBxdWUgc2VqYSBkZSBzZXUgCmNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3U/bS4KCkNhc28gYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jPyBuP28gcG9zc3VpIGEgdGl0dWxhcmlkYWRlIGRvcyBkaXJlaXRvcyBhdXRvcmFpcywgdm9jPyAKZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzcz9vIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgPyBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgCm9zIGRpcmVpdG9zIGFwcmVzZW50YWRvcyBuZXN0YSBsaWNlbj9hLCBlIHF1ZSBlc3NlIG1hdGVyaWFsIGRlIHByb3ByaWVkYWRlIGRlIHRlcmNlaXJvcyBlc3Q/IGNsYXJhbWVudGUgCmlkZW50aWZpY2FkbyBlIHJlY29uaGVjaWRvIG5vIHRleHRvIG91IG5vIGNvbnRlP2RvIGRhIHRlc2Ugb3UgZGlzc2VydGE/P28gb3JhIGRlcG9zaXRhZGEuCgpDQVNPIEEgVEVTRSBPVSBESVNTRVJUQT8/TyBPUkEgREVQT1NJVEFEQSBURU5IQSBTSURPIFJFU1VMVEFETyBERSBVTSBQQVRST0M/TklPIE9VIApBUE9JTyBERSBVTUEgQUc/TkNJQSBERSBGT01FTlRPIE9VIE9VVFJPIE9SR0FOSVNNTyBRVUUgTj9PIFNFSkEgQSBTSUdMQSBERSAKVU5JVkVSU0lEQURFLCBWT0M/IERFQ0xBUkEgUVVFIFJFU1BFSVRPVSBUT0RPUyBFIFFVQUlTUVVFUiBESVJFSVRPUyBERSBSRVZJUz9PIENPTU8gClRBTUI/TSBBUyBERU1BSVMgT0JSSUdBPz9FUyBFWElHSURBUyBQT1IgQ09OVFJBVE8gT1UgQUNPUkRPLgoKQSBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgc2UgY29tcHJvbWV0ZSBhIGlkZW50aWZpY2FyIGNsYXJhbWVudGUgbyBzZXUgbm9tZSAocykgb3UgbyhzKSBub21lKHMpIGRvKHMpIApkZXRlbnRvcihlcykgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRhIHRlc2Ugb3UgZGlzc2VydGE/P28sIGUgbj9vIGZhcj8gcXVhbHF1ZXIgYWx0ZXJhPz9vLCBhbD9tIGRhcXVlbGFzIApjb25jZWRpZGFzIHBvciBlc3RhIGxpY2VuP2EuCg==Biblioteca Digital de Teses e Dissertaçõeshttp://tede2.uefs.br:8080/PUBhttp://tede2.uefs.br:8080/oai/requestbcuefs@uefs.br|| bcref@uefs.br||bcuefs@uefs.bropendoar:2017-10-09T22:34:41Biblioteca Digital de Teses e Dissertações da UEFS - Universidade Estadual de Feira de Santana (UEFS)false
dc.title.por.fl_str_mv GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
spellingShingle GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
Oliveira, Matheus Brito de
Genome assembly
Bacterial
NGS
Pipeline
Montagem de genoma
Bactéria
CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title_full GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title_fullStr GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title_full_unstemmed GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title_sort GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
author Oliveira, Matheus Brito de
author_facet Oliveira, Matheus Brito de
author_role author
dc.contributor.advisor1.fl_str_mv Queiróz, Artur Trancoso Lopo
dc.contributor.authorID.fl_str_mv 01493806580
dc.contributor.authorLattes.fl_str_mv http://lattes.cnpq.br/0008785408235675
dc.contributor.author.fl_str_mv Oliveira, Matheus Brito de
contributor_str_mv Queiróz, Artur Trancoso Lopo
dc.subject.eng.fl_str_mv Genome assembly
Bacterial
topic Genome assembly
Bacterial
NGS
Pipeline
Montagem de genoma
Bactéria
CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
dc.subject.por.fl_str_mv NGS
Pipeline
Montagem de genoma
Bactéria
dc.subject.cnpq.fl_str_mv CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description The assembly of bacterial genomes consists of a process of reordering fragments so that the original genome can be represented. However, to maximize the results of genome assembly, some steps are required, for instance, read quality analysis and preprocessing, repetition identification and quality check. The process of assembly of genomes is a complex step that involves the type of sequencing that was used, there are several types of sequencers which imply different characteristics for each one for example: fragments size, throughput, among others. Analyzing these characteristics requires the use of several computational tools, to assist in all the processes mentioned above, and since the range of software available is quite broad and distinct, it is necessary for the user to learn to work with this computational diversity, dominating often knowledge that is not of the biological area, implying in less time for a deepening in biological questions. Based on this context, we developed a pipeline to perform an automated fragment analysis, read preprocessing, genome assembly and orientation of contigs, having as the assembly the main objective of the pipeline and that it will be managed by a Web application called GATOOL (Genome Assembly Tool). Aiming to evaluate the performance of the application, tests were carried out with two samples of prokaryotic organisms, which are: Bacillus amyloliquefaciens and Serratia marcescens. Also perform a test with seven SRA samples. Both organisms are sequenced on the Ion PGMTM platform. The tools used to perform the assembly were SPAdes and Velvet, both assemblers use de Bruijn graph algorithm as a paradigm for the assembly of the genome, after this stage the resulting set of contigs was ordered through the CONTIGuator, which is a reference ordering. We observed that the interface GATOOL allowed a quick and easy execution of several steps and processes in the field of genome assembly, including the assembly of two prokaryotic species in an automated way, thus facilitating the use and accomplishment of such processes by any user.
publishDate 2017
dc.date.accessioned.fl_str_mv 2017-10-09T22:34:41Z
dc.date.issued.fl_str_mv 2017-06-12
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv OLIVEIRA, Matheus Brito de. GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos. 2017. 95 f. Dissertação (Mestrado em Computação Aplicada)- Universidade Estadual de Feira de Santana, Feira de Santana, 2017.
dc.identifier.uri.fl_str_mv http://localhost:8080/tede/handle/tede/513
identifier_str_mv OLIVEIRA, Matheus Brito de. GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos. 2017. 95 f. Dissertação (Mestrado em Computação Aplicada)- Universidade Estadual de Feira de Santana, Feira de Santana, 2017.
url http://localhost:8080/tede/handle/tede/513
dc.language.iso.fl_str_mv por
language por
dc.relation.program.fl_str_mv 303317282311144204
dc.relation.confidence.fl_str_mv 600
600
600
dc.relation.department.fl_str_mv -5486832816611506211
dc.relation.cnpq.fl_str_mv 3671711205811204509
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Estadual de Feira de Santana
dc.publisher.program.fl_str_mv Mestrado em Computação Aplicada
dc.publisher.initials.fl_str_mv UEFS
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv DEPARTAMENTO DE CIÊNCIAS EXATAS
publisher.none.fl_str_mv Universidade Estadual de Feira de Santana
dc.source.none.fl_str_mv reponame:Biblioteca Digital de Teses e Dissertações da UEFS
instname:Universidade Estadual de Feira de Santana (UEFS)
instacron:UEFS
instname_str Universidade Estadual de Feira de Santana (UEFS)
instacron_str UEFS
institution UEFS
reponame_str Biblioteca Digital de Teses e Dissertações da UEFS
collection Biblioteca Digital de Teses e Dissertações da UEFS
bitstream.url.fl_str_mv http://tede2.uefs.br:8080/bitstream/tede/513/2/MATHUES+BRITO+DE+OLIVEIRA+Disserta%C3%A7%C3%A3ov.pdf
http://tede2.uefs.br:8080/bitstream/tede/513/1/license.txt
bitstream.checksum.fl_str_mv 8d3e3b854b5799f16c0b61b6a5d33f1c
7b5ba3d2445355f386edab96125d42b7
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da UEFS - Universidade Estadual de Feira de Santana (UEFS)
repository.mail.fl_str_mv bcuefs@uefs.br|| bcref@uefs.br||bcuefs@uefs.br
_version_ 1809288772693524480