TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Outros Autores: | , , , |
Tipo de documento: | Artigo |
Idioma: | por |
Título da fonte: | Repositório Institucional da FIOCRUZ (ARCA) |
Texto Completo: | https://www.arca.fiocruz.br/handle/icict/47751 |
Resumo: | Fundação Oswaldo Cruz. Instituto Carlos Chagas. Laboratório de Proteômica Estrutural e Computacional. Curitiba, PR, Brasil. |
id |
CRUZ_3752eb5f58ec48655afffca60e56600f |
---|---|
oai_identifier_str |
oai:www.arca.fiocruz.br:icict/47751 |
network_acronym_str |
CRUZ |
network_name_str |
Repositório Institucional da FIOCRUZ (ARCA) |
repository_id_str |
2135 |
spelling |
Oliveira, Mauro de MedeirosBonadio, IgorMelo, Alicia Lie deSouza, Glaucia MendesDurham, Alan Mitchell2021-06-17T17:55:30Z2021-06-17T17:55:30Z2021OLIVEIRA, Mauro de Medeiros et al. TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes. Briefings in Bioinformatics, p. 1–12, 2021.1477-4054https://www.arca.fiocruz.br/handle/icict/4775110.1093/bib/bbab198porOxford University PressTSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomesinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleFundação Oswaldo Cruz. Instituto Carlos Chagas. Laboratório de Proteômica Estrutural e Computacional. Curitiba, PR, Brasil.Technology Company Elo7. São Paulo, SP, Brasil.Universidade de São Paulo. São Paulo, SP, Brasil.Universidade de São Paulo. Instituto de Química. São Paulo, SP, Brasil.Universidade de São Paulo. São Paulo, SP, Brasil.Promoter annotation is an important task in the analysis of a genome. One of the main challenges for this task is locating the border between the promoter region and the transcribing region of the gene, the transcription start site (TSS). The TSS is the reference point to delimit the DNA sequence responsible for the assembly of the transcribing complex. As the same gene can have more than one TSS, so to delimit the promoter region, it is important to locate the closest TSS to the site of the beginning of the translation. This paper presents TSSFinder, a new software for the prediction of the TSS signal of eukaryotic genes that is significantly more accurate than other available software.We currently are the only application to offer pre-trained models for six different eukaryotic organisms: Arabidopsis thaliana, Drosophila melanogaster, Gallus gallus, Homo sapiens, Oryza sativa and Saccharomyces cerevisiae. Additionally, our software can be easily customized for specific organisms using only 125 DNA sequences with a validated TSS signal and corresponding genomic locations as a training set. TSSFinder is a valuable new tool for the annotation of genomes. TSSFinder source code and docker container can be downloaded from http://tssfinder.github.io. Alternatively, TSSFinder is also available as a web service at http://sucest-fun.org/wsapp/tssfinder/.Transcription Initiation SitePromoter Regions, Geneticannotation of genomesGenomicsconditional random fieldsSitio de Iniciación de la TranscripciónRegiones Promotoras GenéticasGenómicaSite d'initiation de la transcriptionRégions promotrices (génétique)GénomiqueSítio de Iniciação de TranscriçãoRegiões Promotoras GenéticasGenômicainfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da FIOCRUZ (ARCA)instname:Fundação Oswaldo Cruz (FIOCRUZ)instacron:FIOCRUZLICENSElicense.txtlicense.txttext/plain; charset=utf-83084https://www.arca.fiocruz.br/bitstream/icict/47751/1/license.txt783568c2893d2e25a99990b126be1772MD51ORIGINALbbab19ok.pdfbbab19ok.pdfapplication/pdf523885https://www.arca.fiocruz.br/bitstream/icict/47751/2/bbab19ok.pdf6d00c547e455d9df4b4b4d622557ed14MD52TEXTbbab19ok.pdf.txtbbab19ok.pdf.txtExtracted texttext/plain70434https://www.arca.fiocruz.br/bitstream/icict/47751/3/bbab19ok.pdf.txtd183959227d2e8754bfe500433d0d055MD53icict/477512021-06-18 02:01:07.862oai:www.arca.fiocruz.br:icict/47751Q0VTU8ODTyBOw4NPIEVYQ0xVU0lWQSBERSBESVJFSVRPUyBBVVRPUkFJUw0KDQpNYW5vZWwgQmFyYXRhLCBDUEY6IDA3MC43NjQuMzM3LTYxLCB2aW5jdWxhZG8gYSBGaW9jcnV6IFBhcmFuw6EgLSBJbnN0aXR1dG8gQ2FybG9zIENoYWdhcwoKQW8gYWNlaXRhciBvcyBURVJNT1MgZSBDT05EScOHw5VFUyBkZXN0YSBDRVNTw4NPLCBvIEFVVE9SIGUvb3UgVElUVUxBUiBkZSBkaXJlaXRvcwphdXRvcmFpcyBzb2JyZSBhIE9CUkEgZGUgcXVlIHRyYXRhIGVzdGUgZG9jdW1lbnRvOgoKKDEpIENFREUgZSBUUkFOU0ZFUkUsIHRvdGFsIGUgZ3JhdHVpdGFtZW50ZSwgw6AgRklPQ1JVWiAtIEZVTkRBw4fDg08gT1NXQUxETyBDUlVaLCBlbQpjYXLDoXRlciBwZXJtYW5lbnRlLCBpcnJldm9nw6F2ZWwgZSBOw4NPIEVYQ0xVU0lWTywgdG9kb3Mgb3MgZGlyZWl0b3MgcGF0cmltb25pYWlzIE7Dg08KQ09NRVJDSUFJUyBkZSB1dGlsaXphw6fDo28gZGEgT0JSQSBhcnTDrXN0aWNhIGUvb3UgY2llbnTDrWZpY2EgaW5kaWNhZGEgYWNpbWEsIGluY2x1c2l2ZSBvcyBkaXJlaXRvcwpkZSB2b3ogZSBpbWFnZW0gdmluY3VsYWRvcyDDoCBPQlJBLCBkdXJhbnRlIHRvZG8gbyBwcmF6byBkZSBkdXJhw6fDo28gZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCBlbQpxdWFscXVlciBpZGlvbWEgZSBlbSB0b2RvcyBvcyBwYcOtc2VzOwoKKDIpIEFDRUlUQSBxdWUgYSBjZXNzw6NvIHRvdGFsIG7Do28gZXhjbHVzaXZhLCBwZXJtYW5lbnRlIGUgaXJyZXZvZ8OhdmVsIGRvcyBkaXJlaXRvcyBhdXRvcmFpcwpwYXRyaW1vbmlhaXMgbsOjbyBjb21lcmNpYWlzIGRlIHV0aWxpemHDp8OjbyBkZSBxdWUgdHJhdGEgZXN0ZSBkb2N1bWVudG8gaW5jbHVpLCBleGVtcGxpZmljYXRpdmFtZW50ZSwKb3MgZGlyZWl0b3MgZGUgZGlzcG9uaWJpbGl6YcOnw6NvIGUgY29tdW5pY2HDp8OjbyBww7pibGljYSBkYSBPQlJBLCBlbSBxdWFscXVlciBtZWlvIG91IHZlw61jdWxvLAppbmNsdXNpdmUgZW0gUmVwb3NpdMOzcmlvcyBEaWdpdGFpcywgYmVtIGNvbW8gb3MgZGlyZWl0b3MgZGUgcmVwcm9kdcOnw6NvLCBleGliacOnw6NvLCBleGVjdcOnw6NvLApkZWNsYW1hw6fDo28sIHJlY2l0YcOnw6NvLCBleHBvc2nDp8OjbywgYXJxdWl2YW1lbnRvLCBpbmNsdXPDo28gZW0gYmFuY28gZGUgZGFkb3MsIHByZXNlcnZhw6fDo28sIGRpZnVzw6NvLApkaXN0cmlidWnDp8OjbywgZGl2dWxnYcOnw6NvLCBlbXByw6lzdGltbywgdHJhZHXDp8OjbywgZHVibGFnZW0sIGxlZ2VuZGFnZW0sIGluY2x1c8OjbyBlbSBub3ZhcyBvYnJhcyBvdQpjb2xldMOibmVhcywgcmV1dGlsaXphw6fDo28sIGVkacOnw6NvLCBwcm9kdcOnw6NvIGRlIG1hdGVyaWFsIGRpZMOhdGljbyBlIGN1cnNvcyBvdSBxdWFscXVlciBmb3JtYSBkZQp1dGlsaXphw6fDo28gbsOjbyBjb21lcmNpYWw7CgooMykgUkVDT05IRUNFIHF1ZSBhIGNlc3PDo28gYXF1aSBlc3BlY2lmaWNhZGEgY29uY2VkZSDDoCBGSU9DUlVaIC0gRlVOREHDh8ODTyBPU1dBTERPCkNSVVogbyBkaXJlaXRvIGRlIGF1dG9yaXphciBxdWFscXVlciBwZXNzb2Eg4oCTIGbDrXNpY2Egb3UganVyw61kaWNhLCBww7pibGljYSBvdSBwcml2YWRhLCBuYWNpb25hbCBvdQplc3RyYW5nZWlyYSDigJMgYSBhY2Vzc2FyIGUgdXRpbGl6YXIgYW1wbGFtZW50ZSBhIE9CUkEsIHNlbSBleGNsdXNpdmlkYWRlLCBwYXJhIHF1YWlzcXVlcgpmaW5hbGlkYWRlcyBuw6NvIGNvbWVyY2lhaXM7CgooNCkgREVDTEFSQSBxdWUgYSBvYnJhIMOpIGNyaWHDp8OjbyBvcmlnaW5hbCBlIHF1ZSDDqSBvIHRpdHVsYXIgZG9zIGRpcmVpdG9zIGFxdWkgY2VkaWRvcyBlIGF1dG9yaXphZG9zLApyZXNwb25zYWJpbGl6YW5kby1zZSBpbnRlZ3JhbG1lbnRlIHBlbG8gY29udGXDumRvIGUgb3V0cm9zIGVsZW1lbnRvcyBxdWUgZmF6ZW0gcGFydGUgZGEgT0JSQSwKaW5jbHVzaXZlIG9zIGRpcmVpdG9zIGRlIHZveiBlIGltYWdlbSB2aW5jdWxhZG9zIMOgIE9CUkEsIG9icmlnYW5kby1zZSBhIGluZGVuaXphciB0ZXJjZWlyb3MgcG9yCmRhbm9zLCBiZW0gY29tbyBpbmRlbml6YXIgZSByZXNzYXJjaXIgYSBGSU9DUlVaIC0gRlVOREHDh8ODTyBPU1dBTERPIENSVVogZGUKZXZlbnR1YWlzIGRlc3Blc2FzIHF1ZSB2aWVyZW0gYSBzdXBvcnRhciwgZW0gcmF6w6NvIGRlIHF1YWxxdWVyIG9mZW5zYSBhIGRpcmVpdG9zIGF1dG9yYWlzIG91CmRpcmVpdG9zIGRlIHZveiBvdSBpbWFnZW0sIHByaW5jaXBhbG1lbnRlIG5vIHF1ZSBkaXogcmVzcGVpdG8gYSBwbMOhZ2lvIGUgdmlvbGHDp8O1ZXMgZGUgZGlyZWl0b3M7CgooNSkgQUZJUk1BIHF1ZSBjb25oZWNlIGEgUG9sw610aWNhIEluc3RpdHVjaW9uYWwgZGUgQWNlc3NvIEFiZXJ0byBkYSBGSU9DUlVaIC0gRlVOREHDh8ODTwpPU1dBTERPIENSVVogZSBhcyBkaXJldHJpemVzIHBhcmEgbyBmdW5jaW9uYW1lbnRvIGRvIHJlcG9zaXTDs3JpbyBpbnN0aXR1Y2lvbmFsIEFSQ0EuCgpBIFBvbMOtdGljYSBJbnN0aXR1Y2lvbmFsIGRlIEFjZXNzbyBBYmVydG8gZGEgRklPQ1JVWiAtIEZVTkRBw4fDg08gT1NXQUxETyBDUlVaIHJlc2VydmEKZXhjbHVzaXZhbWVudGUgYW8gQVVUT1Igb3MgZGlyZWl0b3MgbW9yYWlzIGUgb3MgdXNvcyBjb21lcmNpYWlzIHNvYnJlIGFzIG9icmFzIGRlIHN1YSBhdXRvcmlhCmUvb3UgdGl0dWxhcmlkYWRlLCBzZW5kbyBvcyB0ZXJjZWlyb3MgdXN1w6FyaW9zIHJlc3BvbnPDoXZlaXMgcGVsYSBhdHJpYnVpw6fDo28gZGUgYXV0b3JpYSBlIG1hbnV0ZW7Dp8OjbwpkYSBpbnRlZ3JpZGFkZSBkYSBPQlJBIGVtIHF1YWxxdWVyIHV0aWxpemHDp8Ojby4KCkEgUG9sw610aWNhIEluc3RpdHVjaW9uYWwgZGUgQWNlc3NvIEFiZXJ0byBkYSBGSU9DUlVaIC0gRlVOREHDh8ODTyBPU1dBTERPIENSVVoKcmVzcGVpdGEgb3MgY29udHJhdG9zIGUgYWNvcmRvcyBwcmVleGlzdGVudGVzIGRvcyBBdXRvcmVzIGNvbSB0ZXJjZWlyb3MsIGNhYmVuZG8gYW9zIEF1dG9yZXMKaW5mb3JtYXIgw6AgSW5zdGl0dWnDp8OjbyBhcyBjb25kacOnw7VlcyBlIG91dHJhcyByZXN0cmnDp8O1ZXMgaW1wb3N0YXMgcG9yIGVzdGVzIGluc3RydW1lbnRvcy4KRepositório InstitucionalPUBhttps://www.arca.fiocruz.br/oai/requestrepositorio.arca@fiocruz.bropendoar:21352021-06-18T05:01:07Repositório Institucional da FIOCRUZ (ARCA) - Fundação Oswaldo Cruz (FIOCRUZ)false |
dc.title.pt_BR.fl_str_mv |
TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes |
title |
TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes |
spellingShingle |
TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes Oliveira, Mauro de Medeiros Transcription Initiation Site Promoter Regions, Genetic annotation of genomes Genomics conditional random fields Sitio de Iniciación de la Transcripción Regiones Promotoras Genéticas Genómica Site d'initiation de la transcription Régions promotrices (génétique) Génomique Sítio de Iniciação de Transcrição Regiões Promotoras Genéticas Genômica |
title_short |
TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes |
title_full |
TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes |
title_fullStr |
TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes |
title_full_unstemmed |
TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes |
title_sort |
TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes |
author |
Oliveira, Mauro de Medeiros |
author_facet |
Oliveira, Mauro de Medeiros Bonadio, Igor Melo, Alicia Lie de Souza, Glaucia Mendes Durham, Alan Mitchell |
author_role |
author |
author2 |
Bonadio, Igor Melo, Alicia Lie de Souza, Glaucia Mendes Durham, Alan Mitchell |
author2_role |
author author author author |
dc.contributor.author.fl_str_mv |
Oliveira, Mauro de Medeiros Bonadio, Igor Melo, Alicia Lie de Souza, Glaucia Mendes Durham, Alan Mitchell |
dc.subject.en.pt_BR.fl_str_mv |
Transcription Initiation Site Promoter Regions, Genetic annotation of genomes Genomics conditional random fields |
topic |
Transcription Initiation Site Promoter Regions, Genetic annotation of genomes Genomics conditional random fields Sitio de Iniciación de la Transcripción Regiones Promotoras Genéticas Genómica Site d'initiation de la transcription Régions promotrices (génétique) Génomique Sítio de Iniciação de Transcrição Regiões Promotoras Genéticas Genômica |
dc.subject.es.pt_BR.fl_str_mv |
Sitio de Iniciación de la Transcripción Regiones Promotoras Genéticas Genómica |
dc.subject.fr.pt_BR.fl_str_mv |
Site d'initiation de la transcription Régions promotrices (génétique) Génomique |
dc.subject.decs.pt_BR.fl_str_mv |
Sítio de Iniciação de Transcrição Regiões Promotoras Genéticas Genômica |
description |
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Laboratório de Proteômica Estrutural e Computacional. Curitiba, PR, Brasil. |
publishDate |
2021 |
dc.date.accessioned.fl_str_mv |
2021-06-17T17:55:30Z |
dc.date.available.fl_str_mv |
2021-06-17T17:55:30Z |
dc.date.issued.fl_str_mv |
2021 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.citation.fl_str_mv |
OLIVEIRA, Mauro de Medeiros et al. TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes. Briefings in Bioinformatics, p. 1–12, 2021. |
dc.identifier.uri.fl_str_mv |
https://www.arca.fiocruz.br/handle/icict/47751 |
dc.identifier.issn.pt_BR.fl_str_mv |
1477-4054 |
dc.identifier.doi.none.fl_str_mv |
10.1093/bib/bbab198 |
identifier_str_mv |
OLIVEIRA, Mauro de Medeiros et al. TSSFinder—fast and accurate ab initio prediction of the core promoter in eukaryotic genomes. Briefings in Bioinformatics, p. 1–12, 2021. 1477-4054 10.1093/bib/bbab198 |
url |
https://www.arca.fiocruz.br/handle/icict/47751 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
Oxford University Press |
publisher.none.fl_str_mv |
Oxford University Press |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da FIOCRUZ (ARCA) instname:Fundação Oswaldo Cruz (FIOCRUZ) instacron:FIOCRUZ |
instname_str |
Fundação Oswaldo Cruz (FIOCRUZ) |
instacron_str |
FIOCRUZ |
institution |
FIOCRUZ |
reponame_str |
Repositório Institucional da FIOCRUZ (ARCA) |
collection |
Repositório Institucional da FIOCRUZ (ARCA) |
bitstream.url.fl_str_mv |
https://www.arca.fiocruz.br/bitstream/icict/47751/1/license.txt https://www.arca.fiocruz.br/bitstream/icict/47751/2/bbab19ok.pdf https://www.arca.fiocruz.br/bitstream/icict/47751/3/bbab19ok.pdf.txt |
bitstream.checksum.fl_str_mv |
783568c2893d2e25a99990b126be1772 6d00c547e455d9df4b4b4d622557ed14 d183959227d2e8754bfe500433d0d055 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 |
repository.name.fl_str_mv |
Repositório Institucional da FIOCRUZ (ARCA) - Fundação Oswaldo Cruz (FIOCRUZ) |
repository.mail.fl_str_mv |
repositorio.arca@fiocruz.br |
_version_ |
1813008845648166912 |