RFMirTarget : predicting human microRNA target genes with a random forest classifier
Autor(a) principal: | |
---|---|
Data de Publicação: | 2013 |
Outros Autores: | , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UFRGS |
Texto Completo: | http://hdl.handle.net/10183/225281 |
Resumo: | MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment. |
id |
UFRGS-2_1701433c5a7963859f47b8f00d48b3f7 |
---|---|
oai_identifier_str |
oai:www.lume.ufrgs.br:10183/225281 |
network_acronym_str |
UFRGS-2 |
network_name_str |
Repositório Institucional da UFRGS |
repository_id_str |
|
spelling |
Recamonde-Mendoza, MarianaFonseca, Guilherme Cordenonsi daMorais, Guilherme Loss deAlves, Ronnie Cley de OliveiraMargis, RogerioBazzan, Ana Lucia Cetertich2021-08-06T04:41:45Z20131932-6203http://hdl.handle.net/10183/225281000912859MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment.application/pdfengPLoS ONE. San Francisco. Vol. 8, no. 7 (July 2013), e70153, 18 p.BioinformáticaAlgoritmos genéticosMicroRNAsRFMirTarget : predicting human microRNA target genes with a random forest classifierEstrangeiroinfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFRGSinstname:Universidade Federal do Rio Grande do Sul (UFRGS)instacron:UFRGSTEXT000912859.pdf.txt000912859.pdf.txtExtracted Texttext/plain90116http://www.lume.ufrgs.br/bitstream/10183/225281/2/000912859.pdf.txtde996ad4bc12efd998d8b6260cb7bfc6MD52ORIGINAL000912859.pdfTexto completo (inglês)application/pdf4084376http://www.lume.ufrgs.br/bitstream/10183/225281/1/000912859.pdfe012f1a4911b0127c74965ddba7605b5MD5110183/2252812023-09-23 03:36:09.247345oai:www.lume.ufrgs.br:10183/225281Repositório de PublicaçõesPUBhttps://lume.ufrgs.br/oai/requestopendoar:2023-09-23T06:36:09Repositório Institucional da UFRGS - Universidade Federal do Rio Grande do Sul (UFRGS)false |
dc.title.pt_BR.fl_str_mv |
RFMirTarget : predicting human microRNA target genes with a random forest classifier |
title |
RFMirTarget : predicting human microRNA target genes with a random forest classifier |
spellingShingle |
RFMirTarget : predicting human microRNA target genes with a random forest classifier Recamonde-Mendoza, Mariana Bioinformática Algoritmos genéticos MicroRNAs |
title_short |
RFMirTarget : predicting human microRNA target genes with a random forest classifier |
title_full |
RFMirTarget : predicting human microRNA target genes with a random forest classifier |
title_fullStr |
RFMirTarget : predicting human microRNA target genes with a random forest classifier |
title_full_unstemmed |
RFMirTarget : predicting human microRNA target genes with a random forest classifier |
title_sort |
RFMirTarget : predicting human microRNA target genes with a random forest classifier |
author |
Recamonde-Mendoza, Mariana |
author_facet |
Recamonde-Mendoza, Mariana Fonseca, Guilherme Cordenonsi da Morais, Guilherme Loss de Alves, Ronnie Cley de Oliveira Margis, Rogerio Bazzan, Ana Lucia Cetertich |
author_role |
author |
author2 |
Fonseca, Guilherme Cordenonsi da Morais, Guilherme Loss de Alves, Ronnie Cley de Oliveira Margis, Rogerio Bazzan, Ana Lucia Cetertich |
author2_role |
author author author author author |
dc.contributor.author.fl_str_mv |
Recamonde-Mendoza, Mariana Fonseca, Guilherme Cordenonsi da Morais, Guilherme Loss de Alves, Ronnie Cley de Oliveira Margis, Rogerio Bazzan, Ana Lucia Cetertich |
dc.subject.por.fl_str_mv |
Bioinformática Algoritmos genéticos MicroRNAs |
topic |
Bioinformática Algoritmos genéticos MicroRNAs |
description |
MicroRNAs are key regulators of eukaryotic gene expression whose fundamental role has already been identified in many cell pathways. The correct identification of miRNAs targets is still a major challenge in bioinformatics and has motivated the development of several computational methods to overcome inherent limitations of experimental analysis. Indeed, the best results reported so far in terms of specificity and sensitivity are associated to machine learning-based methods for microRNA-target prediction. Following this trend, in the current paper we discuss and explore a microRNA-target prediction method based on a random forest classifier, namely RFMirTarget. Despite its well-known robustness regarding general classifying tasks, to the best of our knowledge, random forest have not been deeply explored for the specific context of predicting microRNAs targets. Our framework first analyzes alignments between candidate microRNA-target pairs and extracts a set of structural, thermodynamics, alignment, seed and position-based features, upon which classification is performed. Experiments have shown that RFMirTarget outperforms several well-known classifiers with statistical significance, and that its performance is not impaired by the class imbalance problem or features correlation. Moreover, comparing it against other algorithms for microRNA target prediction using independent test data sets from TarBase and starBase, we observe a very promising performance, with higher sensitivity in relation to other methods. Finally, tests performed with RFMirTarget show the benefits of feature selection even for a classifier with embedded feature importance analysis, and the consistency between relevant features identified and important biological properties for effective microRNA-target gene alignment. |
publishDate |
2013 |
dc.date.issued.fl_str_mv |
2013 |
dc.date.accessioned.fl_str_mv |
2021-08-06T04:41:45Z |
dc.type.driver.fl_str_mv |
Estrangeiro info:eu-repo/semantics/article |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10183/225281 |
dc.identifier.issn.pt_BR.fl_str_mv |
1932-6203 |
dc.identifier.nrb.pt_BR.fl_str_mv |
000912859 |
identifier_str_mv |
1932-6203 000912859 |
url |
http://hdl.handle.net/10183/225281 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.ispartof.pt_BR.fl_str_mv |
PLoS ONE. San Francisco. Vol. 8, no. 7 (July 2013), e70153, 18 p. |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFRGS instname:Universidade Federal do Rio Grande do Sul (UFRGS) instacron:UFRGS |
instname_str |
Universidade Federal do Rio Grande do Sul (UFRGS) |
instacron_str |
UFRGS |
institution |
UFRGS |
reponame_str |
Repositório Institucional da UFRGS |
collection |
Repositório Institucional da UFRGS |
bitstream.url.fl_str_mv |
http://www.lume.ufrgs.br/bitstream/10183/225281/2/000912859.pdf.txt http://www.lume.ufrgs.br/bitstream/10183/225281/1/000912859.pdf |
bitstream.checksum.fl_str_mv |
de996ad4bc12efd998d8b6260cb7bfc6 e012f1a4911b0127c74965ddba7605b5 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 |
repository.name.fl_str_mv |
Repositório Institucional da UFRGS - Universidade Federal do Rio Grande do Sul (UFRGS) |
repository.mail.fl_str_mv |
|
_version_ |
1815447761535893504 |