RFMirTarget : predicting human microRNA target genes with a random forest classifier

Detalhes bibliográficos
Autor(a) principal: Recamonde-Mendoza, Mariana
Data de Publicação: 2013
Outros Autores: Fonseca, Guilherme Cordenonsi da, Morais, Guilherme Loss de, Alves, Ronnie Cley de Oliveira, Margis, Rogerio, Bazzan, Ana Lucia Cetertich
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UFRGS
Texto Completo: http://hdl.handle.net/10183/225281
Resumo: MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment.
id UFRGS-2_1701433c5a7963859f47b8f00d48b3f7
oai_identifier_str oai:www.lume.ufrgs.br:10183/225281
network_acronym_str UFRGS-2
network_name_str Repositório Institucional da UFRGS
repository_id_str
spelling Recamonde-Mendoza, MarianaFonseca, Guilherme Cordenonsi daMorais, Guilherme Loss deAlves, Ronnie Cley de OliveiraMargis, RogerioBazzan, Ana Lucia Cetertich2021-08-06T04:41:45Z20131932-6203http://hdl.handle.net/10183/225281000912859MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment.application/pdfengPLoS ONE. San Francisco. Vol. 8, no. 7 (July 2013), e70153, 18 p.BioinformáticaAlgoritmos genéticosMicroRNAsRFMirTarget : predicting human microRNA target genes with a random forest classifierEstrangeiroinfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFRGSinstname:Universidade Federal do Rio Grande do Sul (UFRGS)instacron:UFRGSTEXT000912859.pdf.txt000912859.pdf.txtExtracted Texttext/plain90116http://www.lume.ufrgs.br/bitstream/10183/225281/2/000912859.pdf.txtde996ad4bc12efd998d8b6260cb7bfc6MD52ORIGINAL000912859.pdfTexto completo (inglês)application/pdf4084376http://www.lume.ufrgs.br/bitstream/10183/225281/1/000912859.pdfe012f1a4911b0127c74965ddba7605b5MD5110183/2252812023-09-23 03:36:09.247345oai:www.lume.ufrgs.br:10183/225281Repositório de PublicaçõesPUBhttps://lume.ufrgs.br/oai/requestopendoar:2023-09-23T06:36:09Repositório Institucional da UFRGS - Universidade Federal do Rio Grande do Sul (UFRGS)false
dc.title.pt_BR.fl_str_mv RFMirTarget : predicting human microRNA target genes with a random forest classifier
title RFMirTarget : predicting human microRNA target genes with a random forest classifier
spellingShingle RFMirTarget : predicting human microRNA target genes with a random forest classifier
Recamonde-Mendoza, Mariana
Bioinformática
Algoritmos genéticos
MicroRNAs
title_short RFMirTarget : predicting human microRNA target genes with a random forest classifier
title_full RFMirTarget : predicting human microRNA target genes with a random forest classifier
title_fullStr RFMirTarget : predicting human microRNA target genes with a random forest classifier
title_full_unstemmed RFMirTarget : predicting human microRNA target genes with a random forest classifier
title_sort RFMirTarget : predicting human microRNA target genes with a random forest classifier
author Recamonde-Mendoza, Mariana
author_facet Recamonde-Mendoza, Mariana
Fonseca, Guilherme Cordenonsi da
Morais, Guilherme Loss de
Alves, Ronnie Cley de Oliveira
Margis, Rogerio
Bazzan, Ana Lucia Cetertich
author_role author
author2 Fonseca, Guilherme Cordenonsi da
Morais, Guilherme Loss de
Alves, Ronnie Cley de Oliveira
Margis, Rogerio
Bazzan, Ana Lucia Cetertich
author2_role author
author
author
author
author
dc.contributor.author.fl_str_mv Recamonde-Mendoza, Mariana
Fonseca, Guilherme Cordenonsi da
Morais, Guilherme Loss de
Alves, Ronnie Cley de Oliveira
Margis, Rogerio
Bazzan, Ana Lucia Cetertich
dc.subject.por.fl_str_mv Bioinformática
Algoritmos genéticos
MicroRNAs
topic Bioinformática
Algoritmos genéticos
MicroRNAs
description MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment.
publishDate 2013
dc.date.issued.fl_str_mv 2013
dc.date.accessioned.fl_str_mv 2021-08-06T04:41:45Z
dc.type.driver.fl_str_mv Estrangeiro
info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10183/225281
dc.identifier.issn.pt_BR.fl_str_mv 1932-6203
dc.identifier.nrb.pt_BR.fl_str_mv 000912859
identifier_str_mv 1932-6203
000912859
url http://hdl.handle.net/10183/225281
dc.language.iso.fl_str_mv eng
language eng
dc.relation.ispartof.pt_BR.fl_str_mv PLoS ONE. San Francisco. Vol. 8, no. 7 (July 2013), e70153, 18 p.
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFRGS
instname:Universidade Federal do Rio Grande do Sul (UFRGS)
instacron:UFRGS
instname_str Universidade Federal do Rio Grande do Sul (UFRGS)
instacron_str UFRGS
institution UFRGS
reponame_str Repositório Institucional da UFRGS
collection Repositório Institucional da UFRGS
bitstream.url.fl_str_mv http://www.lume.ufrgs.br/bitstream/10183/225281/2/000912859.pdf.txt
http://www.lume.ufrgs.br/bitstream/10183/225281/1/000912859.pdf
bitstream.checksum.fl_str_mv de996ad4bc12efd998d8b6260cb7bfc6
e012f1a4911b0127c74965ddba7605b5
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFRGS - Universidade Federal do Rio Grande do Sul (UFRGS)
repository.mail.fl_str_mv
_version_ 1815447761535893504