TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes

Detalhes bibliográficos
Autor(a) principal: Oliveira, Mauro de Medeiros
Data de Publicação: 2021
Outros Autores: Bonadio, Igor, Melo, Alicia Lie de, Souza, Glaucia Mendes, Durham, Alan Mitchell
Tipo de documento: Artigo
Idioma: por
Título da fonte: Repositório Institucional da FIOCRUZ (ARCA)
Texto Completo: https://www.arca.fiocruz.br/handle/icict/47751
Resumo: Fundação Oswaldo Cruz. Instituto Carlos Chagas. Laboratório de Proteômica Estrutural e Computacional. Curitiba, PR, Brasil.
id CRUZ_3752eb5f58ec48655afffca60e56600f
oai_identifier_str oai:www.arca.fiocruz.br:icict/47751
network_acronym_str CRUZ
network_name_str Repositório Institucional da FIOCRUZ (ARCA)
repository_id_str 2135
spelling Oliveira, Mauro de MedeirosBonadio, IgorMelo, Alicia Lie deSouza, Glaucia MendesDurham, Alan Mitchell2021-06-17T17:55:30Z2021-06-17T17:55:30Z2021OLIVEIRA, Mauro de Medeiros et al. TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes. Briefings in Bioinformatics, p. 1–12, 2021.1477-4054https://www.arca.fiocruz.br/handle/icict/4775110.1093/bib/bbab198porOxford University PressTSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomesinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleFundação Oswaldo Cruz. Instituto Carlos Chagas. Laboratório de Proteômica Estrutural e Computacional. Curitiba, PR, Brasil.Technology Company Elo7. São Paulo, SP, Brasil.Universidade de São Paulo. São Paulo, SP, Brasil.Universidade de São Paulo. Instituto de Química. São Paulo, SP, Brasil.Universidade de São Paulo. São Paulo, SP, Brasil.Promoter annotation is an important task in the analysis of a genome. One of the main challenges for this task is locating the border between the promoter region and the transcribing region of the gene, the transcription start site (TSS). The TSS is the reference point to delimit the DNA sequence responsible for the assembly of the transcribing complex. As the same gene can have more than one TSS, so to delimit the promoter region, it is important to locate the closest TSS to the site of the beginning of the translation. This paper presents TSSFinder, a new software for the prediction of the TSS signal of eukaryotic genes that is significantly more accurate than other available software.We currently are the only application to offer pre-trained models for six different eukaryotic organisms: Arabidopsis thaliana, Drosophila melanogaster, Gallus gallus, Homo sapiens, Oryza sativa and Saccharomyces cerevisiae. Additionally, our software can be easily customized for specific organisms using only 125 DNA sequences with a validated TSS signal and corresponding genomic locations as a training set. TSSFinder is a valuable new tool for the annotation of genomes. TSSFinder source code and docker container can be downloaded from http://tssfinder.github.io. Alternatively, TSSFinder is also available as a web service at http://sucest-fun.org/wsapp/tssfinder/.Transcription Initiation SitePromoter Regions, Geneticannotation of genomesGenomicsconditional random fieldsSitio de Iniciación de la TranscripciónRegiones Promotoras GenéticasGenómicaSite d'initiation de la transcriptionRégions promotrices (génétique)GénomiqueSítio de Iniciação de TranscriçãoRegiões Promotoras GenéticasGenômicainfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da FIOCRUZ (ARCA)instname:Fundação Oswaldo Cruz (FIOCRUZ)instacron:FIOCRUZLICENSElicense.txtlicense.txttext/plain; charset=utf-83084https://www.arca.fiocruz.br/bitstream/icict/47751/1/license.txt783568c2893d2e25a99990b126be1772MD51ORIGINALbbab19ok.pdfbbab19ok.pdfapplication/pdf523885https://www.arca.fiocruz.br/bitstream/icict/47751/2/bbab19ok.pdf6d00c547e455d9df4b4b4d622557ed14MD52TEXTbbab19ok.pdf.txtbbab19ok.pdf.txtExtracted texttext/plain70434https://www.arca.fiocruz.br/bitstream/icict/47751/3/bbab19ok.pdf.txtd183959227d2e8754bfe500433d0d055MD53icict/477512021-06-18 02:01:07.862oai:www.arca.fiocruz.br:icict/47751Q0VTU8ODTyBOw4NPIEVYQ0xVU0lWQSBERSBESVJFSVRPUyBBVVRPUkFJUw0KDQpNYW5vZWwgQmFyYXRhLCBDUEY6IDA3MC43NjQuMzM3LTYxLCB2aW5jdWxhZG8gYSBGaW9jcnV6IFBhcmFuw6EgLSBJbnN0aXR1dG8gQ2FybG9zIENoYWdhcwoKQW8gYWNlaXRhciBvcyBURVJNT1MgZSBDT05EScOHw5VFUyBkZXN0YSBDRVNTw4NPLCBvIEFVVE9SIGUvb3UgVElUVUxBUiBkZSBkaXJlaXRvcwphdXRvcmFpcyBzb2JyZSBhIE9CUkEgZGUgcXVlIHRyYXRhIGVzdGUgZG9jdW1lbnRvOgoKKDEpIENFREUgZSBUUkFOU0ZFUkUsIHRvdGFsIGUgZ3JhdHVpdGFtZW50ZSwgw6AgRklPQ1JVWiAtIEZVTkRBw4fDg08gT1NXQUxETyBDUlVaLCBlbQpjYXLDoXRlciBwZXJtYW5lbnRlLCBpcnJldm9nw6F2ZWwgZSBOw4NPIEVYQ0xVU0lWTywgdG9kb3Mgb3MgZGlyZWl0b3MgcGF0cmltb25pYWlzIE7Dg08KQ09NRVJDSUFJUyBkZSB1dGlsaXphw6fDo28gZGEgT0JSQSBhcnTDrXN0aWNhIGUvb3UgY2llbnTDrWZpY2EgaW5kaWNhZGEgYWNpbWEsIGluY2x1c2l2ZSBvcyBkaXJlaXRvcwpkZSB2b3ogZSBpbWFnZW0gdmluY3VsYWRvcyDDoCBPQlJBLCBkdXJhbnRlIHRvZG8gbyBwcmF6byBkZSBkdXJhw6fDo28gZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCBlbQpxdWFscXVlciBpZGlvbWEgZSBlbSB0b2RvcyBvcyBwYcOtc2VzOwoKKDIpIEFDRUlUQSBxdWUgYSBjZXNzw6NvIHRvdGFsIG7Do28gZXhjbHVzaXZhLCBwZXJtYW5lbnRlIGUgaXJyZXZvZ8OhdmVsIGRvcyBkaXJlaXRvcyBhdXRvcmFpcwpwYXRyaW1vbmlhaXMgbsOjbyBjb21lcmNpYWlzIGRlIHV0aWxpemHDp8OjbyBkZSBxdWUgdHJhdGEgZXN0ZSBkb2N1bWVudG8gaW5jbHVpLCBleGVtcGxpZmljYXRpdmFtZW50ZSwKb3MgZGlyZWl0b3MgZGUgZGlzcG9uaWJpbGl6YcOnw6NvIGUgY29tdW5pY2HDp8OjbyBww7pibGljYSBkYSBPQlJBLCBlbSBxdWFscXVlciBtZWlvIG91IHZlw61jdWxvLAppbmNsdXNpdmUgZW0gUmVwb3NpdMOzcmlvcyBEaWdpdGFpcywgYmVtIGNvbW8gb3MgZGlyZWl0b3MgZGUgcmVwcm9kdcOnw6NvLCBleGliacOnw6NvLCBleGVjdcOnw6NvLApkZWNsYW1hw6fDo28sIHJlY2l0YcOnw6NvLCBleHBvc2nDp8OjbywgYXJxdWl2YW1lbnRvLCBpbmNsdXPDo28gZW0gYmFuY28gZGUgZGFkb3MsIHByZXNlcnZhw6fDo28sIGRpZnVzw6NvLApkaXN0cmlidWnDp8OjbywgZGl2dWxnYcOnw6NvLCBlbXByw6lzdGltbywgdHJhZHXDp8OjbywgZHVibGFnZW0sIGxlZ2VuZGFnZW0sIGluY2x1c8OjbyBlbSBub3ZhcyBvYnJhcyBvdQpjb2xldMOibmVhcywgcmV1dGlsaXphw6fDo28sIGVkacOnw6NvLCBwcm9kdcOnw6NvIGRlIG1hdGVyaWFsIGRpZMOhdGljbyBlIGN1cnNvcyBvdSBxdWFscXVlciBmb3JtYSBkZQp1dGlsaXphw6fDo28gbsOjbyBjb21lcmNpYWw7CgooMykgUkVDT05IRUNFIHF1ZSBhIGNlc3PDo28gYXF1aSBlc3BlY2lmaWNhZGEgY29uY2VkZSDDoCBGSU9DUlVaIC0gRlVOREHDh8ODTyBPU1dBTERPCkNSVVogbyBkaXJlaXRvIGRlIGF1dG9yaXphciBxdWFscXVlciBwZXNzb2Eg4oCTIGbDrXNpY2Egb3UganVyw61kaWNhLCBww7pibGljYSBvdSBwcml2YWRhLCBuYWNpb25hbCBvdQplc3RyYW5nZWlyYSDigJMgYSBhY2Vzc2FyIGUgdXRpbGl6YXIgYW1wbGFtZW50ZSBhIE9CUkEsIHNlbSBleGNsdXNpdmlkYWRlLCBwYXJhIHF1YWlzcXVlcgpmaW5hbGlkYWRlcyBuw6NvIGNvbWVyY2lhaXM7CgooNCkgREVDTEFSQSBxdWUgYSBvYnJhIMOpIGNyaWHDp8OjbyBvcmlnaW5hbCBlIHF1ZSDDqSBvIHRpdHVsYXIgZG9zIGRpcmVpdG9zIGFxdWkgY2VkaWRvcyBlIGF1dG9yaXphZG9zLApyZXNwb25zYWJpbGl6YW5kby1zZSBpbnRlZ3JhbG1lbnRlIHBlbG8gY29udGXDumRvIGUgb3V0cm9zIGVsZW1lbnRvcyBxdWUgZmF6ZW0gcGFydGUgZGEgT0JSQSwKaW5jbHVzaXZlIG9zIGRpcmVpdG9zIGRlIHZveiBlIGltYWdlbSB2aW5jdWxhZG9zIMOgIE9CUkEsIG9icmlnYW5kby1zZSBhIGluZGVuaXphciB0ZXJjZWlyb3MgcG9yCmRhbm9zLCBiZW0gY29tbyBpbmRlbml6YXIgZSByZXNzYXJjaXIgYSBGSU9DUlVaIC0gRlVOREHDh8ODTyBPU1dBTERPIENSVVogZGUKZXZlbnR1YWlzIGRlc3Blc2FzIHF1ZSB2aWVyZW0gYSBzdXBvcnRhciwgZW0gcmF6w6NvIGRlIHF1YWxxdWVyIG9mZW5zYSBhIGRpcmVpdG9zIGF1dG9yYWlzIG91CmRpcmVpdG9zIGRlIHZveiBvdSBpbWFnZW0sIHByaW5jaXBhbG1lbnRlIG5vIHF1ZSBkaXogcmVzcGVpdG8gYSBwbMOhZ2lvIGUgdmlvbGHDp8O1ZXMgZGUgZGlyZWl0b3M7CgooNSkgQUZJUk1BIHF1ZSBjb25oZWNlIGEgUG9sw610aWNhIEluc3RpdHVjaW9uYWwgZGUgQWNlc3NvIEFiZXJ0byBkYSBGSU9DUlVaIC0gRlVOREHDh8ODTwpPU1dBTERPIENSVVogZSBhcyBkaXJldHJpemVzIHBhcmEgbyBmdW5jaW9uYW1lbnRvIGRvIHJlcG9zaXTDs3JpbyBpbnN0aXR1Y2lvbmFsIEFSQ0EuCgpBIFBvbMOtdGljYSBJbnN0aXR1Y2lvbmFsIGRlIEFjZXNzbyBBYmVydG8gZGEgRklPQ1JVWiAtIEZVTkRBw4fDg08gT1NXQUxETyBDUlVaIHJlc2VydmEKZXhjbHVzaXZhbWVudGUgYW8gQVVUT1Igb3MgZGlyZWl0b3MgbW9yYWlzIGUgb3MgdXNvcyBjb21lcmNpYWlzIHNvYnJlIGFzIG9icmFzIGRlIHN1YSBhdXRvcmlhCmUvb3UgdGl0dWxhcmlkYWRlLCBzZW5kbyBvcyB0ZXJjZWlyb3MgdXN1w6FyaW9zIHJlc3BvbnPDoXZlaXMgcGVsYSBhdHJpYnVpw6fDo28gZGUgYXV0b3JpYSBlIG1hbnV0ZW7Dp8OjbwpkYSBpbnRlZ3JpZGFkZSBkYSBPQlJBIGVtIHF1YWxxdWVyIHV0aWxpemHDp8Ojby4KCkEgUG9sw610aWNhIEluc3RpdHVjaW9uYWwgZGUgQWNlc3NvIEFiZXJ0byBkYSBGSU9DUlVaIC0gRlVOREHDh8ODTyBPU1dBTERPIENSVVoKcmVzcGVpdGEgb3MgY29udHJhdG9zIGUgYWNvcmRvcyBwcmVleGlzdGVudGVzIGRvcyBBdXRvcmVzIGNvbSB0ZXJjZWlyb3MsIGNhYmVuZG8gYW9zIEF1dG9yZXMKaW5mb3JtYXIgw6AgSW5zdGl0dWnDp8OjbyBhcyBjb25kacOnw7VlcyBlIG91dHJhcyByZXN0cmnDp8O1ZXMgaW1wb3N0YXMgcG9yIGVzdGVzIGluc3RydW1lbnRvcy4KRepositório InstitucionalPUBhttps://www.arca.fiocruz.br/oai/requestrepositorio.arca@fiocruz.bropendoar:21352021-06-18T05:01:07Repositório Institucional da FIOCRUZ (ARCA) - Fundação Oswaldo Cruz (FIOCRUZ)false
dc.title.pt_BR.fl_str_mv TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
spellingShingle TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
Oliveira, Mauro de Medeiros
Transcription Initiation Site
Promoter Regions, Genetic
annotation of genomes
Genomics
conditional random fields
Sitio de Iniciación de la Transcripción
Regiones Promotoras Genéticas
Genómica
Site d'initiation de la transcription
Régions promotrices (génétique)
Génomique
Sítio de Iniciação de Transcrição
Regiões Promotoras Genéticas
Genômica
title_short TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title_full TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title_fullStr TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title_full_unstemmed TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
title_sort TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
author Oliveira, Mauro de Medeiros
author_facet Oliveira, Mauro de Medeiros
Bonadio, Igor
Melo, Alicia Lie de
Souza, Glaucia Mendes
Durham, Alan Mitchell
author_role author
author2 Bonadio, Igor
Melo, Alicia Lie de
Souza, Glaucia Mendes
Durham, Alan Mitchell
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Oliveira, Mauro de Medeiros
Bonadio, Igor
Melo, Alicia Lie de
Souza, Glaucia Mendes
Durham, Alan Mitchell
dc.subject.en.pt_BR.fl_str_mv Transcription Initiation Site
Promoter Regions, Genetic
annotation of genomes
Genomics
conditional random fields
topic Transcription Initiation Site
Promoter Regions, Genetic
annotation of genomes
Genomics
conditional random fields
Sitio de Iniciación de la Transcripción
Regiones Promotoras Genéticas
Genómica
Site d'initiation de la transcription
Régions promotrices (génétique)
Génomique
Sítio de Iniciação de Transcrição
Regiões Promotoras Genéticas
Genômica
dc.subject.es.pt_BR.fl_str_mv Sitio de Iniciación de la Transcripción
Regiones Promotoras Genéticas
Genómica
dc.subject.fr.pt_BR.fl_str_mv Site d'initiation de la transcription
Régions promotrices (génétique)
Génomique
dc.subject.decs.pt_BR.fl_str_mv Sítio de Iniciação de Transcrição
Regiões Promotoras Genéticas
Genômica
description Fundação Oswaldo Cruz. Instituto Carlos Chagas. Laboratório de Proteômica Estrutural e Computacional. Curitiba, PR, Brasil.
publishDate 2021
dc.date.accessioned.fl_str_mv 2021-06-17T17:55:30Z
dc.date.available.fl_str_mv 2021-06-17T17:55:30Z
dc.date.issued.fl_str_mv 2021
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.citation.fl_str_mv OLIVEIRA, Mauro de Medeiros et al. TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes. Briefings in Bioinformatics, p. 1–12, 2021.
dc.identifier.uri.fl_str_mv https://www.arca.fiocruz.br/handle/icict/47751
dc.identifier.issn.pt_BR.fl_str_mv 1477-4054
dc.identifier.doi.none.fl_str_mv 10.1093/bib/bbab198
identifier_str_mv OLIVEIRA, Mauro de Medeiros et al. TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes. Briefings in Bioinformatics, p. 1–12, 2021.
1477-4054
10.1093/bib/bbab198
url https://www.arca.fiocruz.br/handle/icict/47751
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Oxford University Press
publisher.none.fl_str_mv Oxford University Press
dc.source.none.fl_str_mv reponame:Repositório Institucional da FIOCRUZ (ARCA)
instname:Fundação Oswaldo Cruz (FIOCRUZ)
instacron:FIOCRUZ
instname_str Fundação Oswaldo Cruz (FIOCRUZ)
instacron_str FIOCRUZ
institution FIOCRUZ
reponame_str Repositório Institucional da FIOCRUZ (ARCA)
collection Repositório Institucional da FIOCRUZ (ARCA)
bitstream.url.fl_str_mv https://www.arca.fiocruz.br/bitstream/icict/47751/1/license.txt
https://www.arca.fiocruz.br/bitstream/icict/47751/2/bbab19ok.pdf
https://www.arca.fiocruz.br/bitstream/icict/47751/3/bbab19ok.pdf.txt
bitstream.checksum.fl_str_mv 783568c2893d2e25a99990b126be1772
6d00c547e455d9df4b4b4d622557ed14
d183959227d2e8754bfe500433d0d055
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da FIOCRUZ (ARCA) - Fundação Oswaldo Cruz (FIOCRUZ)
repository.mail.fl_str_mv repositorio.arca@fiocruz.br
_version_ 1798324630777233408