Rama: a machine learning approach for ribosomal protein prediction in plants

Detalhes bibliográficos
Autor(a) principal: Carvalho, Thales Francisco Mota
Data de Publicação: 2017
Outros Autores: Silva, José Cleydson F., Calil, Iara Pinheiro, Fontes, Elizabeth Pacheco Batista, Cerqueira, Fabio Ribeiro
Tipo de documento: Artigo
Idioma: eng
Título da fonte: LOCUS Repositório Institucional da UFV
Texto Completo: http://dx.doi.org/10.1038/s41598-017-16322-4
http://www.locus.ufv.br/handle/123456789/16389
Resumo: Ribosomal proteins (RPs) play a fundamental role within all type of cells, as they are major components of ribosomes, which are essential for translation of mRNAs. Furthermore, these proteins are involved in various physiological and pathological processes. The intrinsic biological relevance of RPs motivated advanced studies for the identification of unrevealed RPs. In this work, we propose a new computational method, termed Rama, for the prediction of RPs, based on machine learning techniques, with a particular interest in plants. To perform an effective classification, Rama uses a set of fundamental attributes of the amino acid side chains and applies a two-step procedure to classify proteins with unknown function as RPs. The evaluation of the resultant predictive models showed that Rama could achieve mean sensitivity, precision, and specificity of 0.91, 0.91, and 0.82, respectively. Furthermore, a list of proteins that have no annotation in Phytozome v.10, and are annotated as RPs in Phytozome v.12, were correctly classified by our models. Additional computational experiments have also shown that Rama presents high accuracy to differentiate ribosomal proteins from RNA-binding proteins. Finally, two novel proteins of Arabidopsis thaliana were validated in biological experiments.
id UFV_7bada93fb1213c6568dfb9711a72f328
oai_identifier_str oai:locus.ufv.br:123456789/16389
network_acronym_str UFV
network_name_str LOCUS Repositório Institucional da UFV
repository_id_str 2145
spelling Carvalho, Thales Francisco MotaSilva, José Cleydson F.Calil, Iara PinheiroFontes, Elizabeth Pacheco BatistaCerqueira, Fabio Ribeiro2018-01-16T16:58:21Z2018-01-16T16:58:21Z2017-11-242045-2322http://dx.doi.org/10.1038/s41598-017-16322-4http://www.locus.ufv.br/handle/123456789/16389Ribosomal proteins (RPs) play a fundamental role within all type of cells, as they are major components of ribosomes, which are essential for translation of mRNAs. Furthermore, these proteins are involved in various physiological and pathological processes. The intrinsic biological relevance of RPs motivated advanced studies for the identification of unrevealed RPs. In this work, we propose a new computational method, termed Rama, for the prediction of RPs, based on machine learning techniques, with a particular interest in plants. To perform an effective classification, Rama uses a set of fundamental attributes of the amino acid side chains and applies a two-step procedure to classify proteins with unknown function as RPs. The evaluation of the resultant predictive models showed that Rama could achieve mean sensitivity, precision, and specificity of 0.91, 0.91, and 0.82, respectively. Furthermore, a list of proteins that have no annotation in Phytozome v.10, and are annotated as RPs in Phytozome v.12, were correctly classified by our models. Additional computational experiments have also shown that Rama presents high accuracy to differentiate ribosomal proteins from RNA-binding proteins. Finally, two novel proteins of Arabidopsis thaliana were validated in biological experiments.engScientific Reports7: 16273, Nov. 2017RamaRibosomal proteinRama: a machine learning approach for ribosomal protein prediction in plantsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfinfo:eu-repo/semantics/openAccessreponame:LOCUS Repositório Institucional da UFVinstname:Universidade Federal de Viçosa (UFV)instacron:UFVORIGINALartigo.pdfartigo.pdftexto completoapplication/pdf1471977https://locus.ufv.br//bitstream/123456789/16389/1/artigo.pdf30ca48981d770d77414aae3d29b113eeMD51LICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://locus.ufv.br//bitstream/123456789/16389/2/license.txt8a4605be74aa9ea9d79846c1fba20a33MD52THUMBNAILartigo.pdf.jpgartigo.pdf.jpgIM Thumbnailimage/jpeg4934https://locus.ufv.br//bitstream/123456789/16389/3/artigo.pdf.jpg0feec553103704355c3522bace4586caMD53123456789/163892018-01-16 22:00:37.74oai:locus.ufv.br:123456789/16389Tk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=Repositório InstitucionalPUBhttps://www.locus.ufv.br/oai/requestfabiojreis@ufv.bropendoar:21452018-01-17T01:00:37LOCUS Repositório Institucional da UFV - Universidade Federal de Viçosa (UFV)false
dc.title.en.fl_str_mv Rama: a machine learning approach for ribosomal protein prediction in plants
title Rama: a machine learning approach for ribosomal protein prediction in plants
spellingShingle Rama: a machine learning approach for ribosomal protein prediction in plants
Carvalho, Thales Francisco Mota
Rama
Ribosomal protein
title_short Rama: a machine learning approach for ribosomal protein prediction in plants
title_full Rama: a machine learning approach for ribosomal protein prediction in plants
title_fullStr Rama: a machine learning approach for ribosomal protein prediction in plants
title_full_unstemmed Rama: a machine learning approach for ribosomal protein prediction in plants
title_sort Rama: a machine learning approach for ribosomal protein prediction in plants
author Carvalho, Thales Francisco Mota
author_facet Carvalho, Thales Francisco Mota
Silva, José Cleydson F.
Calil, Iara Pinheiro
Fontes, Elizabeth Pacheco Batista
Cerqueira, Fabio Ribeiro
author_role author
author2 Silva, José Cleydson F.
Calil, Iara Pinheiro
Fontes, Elizabeth Pacheco Batista
Cerqueira, Fabio Ribeiro
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Carvalho, Thales Francisco Mota
Silva, José Cleydson F.
Calil, Iara Pinheiro
Fontes, Elizabeth Pacheco Batista
Cerqueira, Fabio Ribeiro
dc.subject.pt-BR.fl_str_mv Rama
Ribosomal protein
topic Rama
Ribosomal protein
description Ribosomal proteins (RPs) play a fundamental role within all type of cells, as they are major components of ribosomes, which are essential for translation of mRNAs. Furthermore, these proteins are involved in various physiological and pathological processes. The intrinsic biological relevance of RPs motivated advanced studies for the identification of unrevealed RPs. In this work, we propose a new computational method, termed Rama, for the prediction of RPs, based on machine learning techniques, with a particular interest in plants. To perform an effective classification, Rama uses a set of fundamental attributes of the amino acid side chains and applies a two-step procedure to classify proteins with unknown function as RPs. The evaluation of the resultant predictive models showed that Rama could achieve mean sensitivity, precision, and specificity of 0.91, 0.91, and 0.82, respectively. Furthermore, a list of proteins that have no annotation in Phytozome v.10, and are annotated as RPs in Phytozome v.12, were correctly classified by our models. Additional computational experiments have also shown that Rama presents high accuracy to differentiate ribosomal proteins from RNA-binding proteins. Finally, two novel proteins of Arabidopsis thaliana were validated in biological experiments.
publishDate 2017
dc.date.issued.fl_str_mv 2017-11-24
dc.date.accessioned.fl_str_mv 2018-01-16T16:58:21Z
dc.date.available.fl_str_mv 2018-01-16T16:58:21Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1038/s41598-017-16322-4
http://www.locus.ufv.br/handle/123456789/16389
dc.identifier.issn.none.fl_str_mv 2045-2322
identifier_str_mv 2045-2322
url http://dx.doi.org/10.1038/s41598-017-16322-4
http://www.locus.ufv.br/handle/123456789/16389
dc.language.iso.fl_str_mv eng
language eng
dc.relation.ispartofseries.pt-BR.fl_str_mv 7: 16273, Nov. 2017
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Scientific Reports
publisher.none.fl_str_mv Scientific Reports
dc.source.none.fl_str_mv reponame:LOCUS Repositório Institucional da UFV
instname:Universidade Federal de Viçosa (UFV)
instacron:UFV
instname_str Universidade Federal de Viçosa (UFV)
instacron_str UFV
institution UFV
reponame_str LOCUS Repositório Institucional da UFV
collection LOCUS Repositório Institucional da UFV
bitstream.url.fl_str_mv https://locus.ufv.br//bitstream/123456789/16389/1/artigo.pdf
https://locus.ufv.br//bitstream/123456789/16389/2/license.txt
https://locus.ufv.br//bitstream/123456789/16389/3/artigo.pdf.jpg
bitstream.checksum.fl_str_mv 30ca48981d770d77414aae3d29b113ee
8a4605be74aa9ea9d79846c1fba20a33
0feec553103704355c3522bace4586ca
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv LOCUS Repositório Institucional da UFV - Universidade Federal de Viçosa (UFV)
repository.mail.fl_str_mv fabiojreis@ufv.br
_version_ 1798053118624137216