Identification of tissue‐specific tumor biomarker using different optimization algorithms

Detalhes bibliográficos
Autor(a) principal: Bhowmick, Shib Sankar
Data de Publicação: 2018
Outros Autores: Bhattacharjee, Debotosh, Rato, Luis
Tipo de documento: Artigo
Idioma: por
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10174/25325
https://doi.org/10.1007/s13258-018-0773-2
Resumo: Background Identification of differentially expressed genes, i.e., genes whose transcript abundance level differs across different biological or physiological conditions, was indeed a challenging task. However, the inception of transcriptome sequencing (RNA-seq) technology revolutionized the simultaneous measurement of the transcript abundance levels for thousands of genes. Objective In this paper, such next-generation sequencing (NGS) data is used to identify biomarker signatures for several of the most common cancer types (bladder, colon, kidney, brain, liver, lung, prostate, skin, and thyroid) Methods Here, the problem is mapped into the comparison of optimization algorithms for selecting a set of genes that lead to the highest classification accuracy of a two-class classification task between healthy and tumor samples. As the opti- mization algorithms Artificial Bee Colony (ABC), Ant Colony Optimization, Differential Evolution, and Particle Swarm Optimization are chosen for this experiment. A standard statistical method called DESeq2 is used to select differentially expressed genes before being feed to the optimization algorithms. Classification of healthy and tumor samples is done by support vector machine Results Cancer-specific validation yields remarkably good results in terms of accuracy. Highest classification accuracy is achieved by the ABC algorithm for Brain lower grade glioma data is 99.10%. This validation is well supported by a statisti- cal test, gene ontology enrichment analysis, and KEGG pathway enrichment analysis for each cancer biomarker signature Conclusion The current study identified robust genes as biomarker signatures and these identified biomarkers might be helpful to accurately identify tumors of unknown origin
id RCAP_a9fa794192140ea9a362100946841441
oai_identifier_str oai:dspace.uevora.pt:10174/25325
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Identification of tissue‐specific tumor biomarker using different optimization algorithmsbiomarkermachine learningmessenger RNAoptimizationpathway analysisBackground Identification of differentially expressed genes, i.e., genes whose transcript abundance level differs across different biological or physiological conditions, was indeed a challenging task. However, the inception of transcriptome sequencing (RNA-seq) technology revolutionized the simultaneous measurement of the transcript abundance levels for thousands of genes. Objective In this paper, such next-generation sequencing (NGS) data is used to identify biomarker signatures for several of the most common cancer types (bladder, colon, kidney, brain, liver, lung, prostate, skin, and thyroid) Methods Here, the problem is mapped into the comparison of optimization algorithms for selecting a set of genes that lead to the highest classification accuracy of a two-class classification task between healthy and tumor samples. As the opti- mization algorithms Artificial Bee Colony (ABC), Ant Colony Optimization, Differential Evolution, and Particle Swarm Optimization are chosen for this experiment. A standard statistical method called DESeq2 is used to select differentially expressed genes before being feed to the optimization algorithms. Classification of healthy and tumor samples is done by support vector machine Results Cancer-specific validation yields remarkably good results in terms of accuracy. Highest classification accuracy is achieved by the ABC algorithm for Brain lower grade glioma data is 99.10%. This validation is well supported by a statisti- cal test, gene ontology enrichment analysis, and KEGG pathway enrichment analysis for each cancer biomarker signature Conclusion The current study identified robust genes as biomarker signatures and these identified biomarkers might be helpful to accurately identify tumors of unknown originSpringer2019-03-01T10:59:06Z2019-03-012018-12-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/25325https://doi.org/10.1007/s13258-018-0773-2http://hdl.handle.net/10174/25325https://doi.org/10.1007/s13258-018-0773-2porBhowmick, S.S., Bhattacharjee, D. & Rato, L., Identification of tissue‐specific tumor biomarker using different optimization algorithms, Genes and Genomics, The Genetics Society of Korea, Springer, 2018.ndndlmr@uevora.pt498Bhowmick, Shib SankarBhattacharjee, DebotoshRato, Luisinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:19:04Zoai:dspace.uevora.pt:10174/25325Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:15:48.676791Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Identification of tissue‐specific tumor biomarker using different optimization algorithms
title Identification of tissue‐specific tumor biomarker using different optimization algorithms
spellingShingle Identification of tissue‐specific tumor biomarker using different optimization algorithms
Bhowmick, Shib Sankar
biomarker
machine learning
messenger RNA
optimization
pathway analysis
title_short Identification of tissue‐specific tumor biomarker using different optimization algorithms
title_full Identification of tissue‐specific tumor biomarker using different optimization algorithms
title_fullStr Identification of tissue‐specific tumor biomarker using different optimization algorithms
title_full_unstemmed Identification of tissue‐specific tumor biomarker using different optimization algorithms
title_sort Identification of tissue‐specific tumor biomarker using different optimization algorithms
author Bhowmick, Shib Sankar
author_facet Bhowmick, Shib Sankar
Bhattacharjee, Debotosh
Rato, Luis
author_role author
author2 Bhattacharjee, Debotosh
Rato, Luis
author2_role author
author
dc.contributor.author.fl_str_mv Bhowmick, Shib Sankar
Bhattacharjee, Debotosh
Rato, Luis
dc.subject.por.fl_str_mv biomarker
machine learning
messenger RNA
optimization
pathway analysis
topic biomarker
machine learning
messenger RNA
optimization
pathway analysis
description Background Identification of differentially expressed genes, i.e., genes whose transcript abundance level differs across different biological or physiological conditions, was indeed a challenging task. However, the inception of transcriptome sequencing (RNA-seq) technology revolutionized the simultaneous measurement of the transcript abundance levels for thousands of genes. Objective In this paper, such next-generation sequencing (NGS) data is used to identify biomarker signatures for several of the most common cancer types (bladder, colon, kidney, brain, liver, lung, prostate, skin, and thyroid) Methods Here, the problem is mapped into the comparison of optimization algorithms for selecting a set of genes that lead to the highest classification accuracy of a two-class classification task between healthy and tumor samples. As the opti- mization algorithms Artificial Bee Colony (ABC), Ant Colony Optimization, Differential Evolution, and Particle Swarm Optimization are chosen for this experiment. A standard statistical method called DESeq2 is used to select differentially expressed genes before being feed to the optimization algorithms. Classification of healthy and tumor samples is done by support vector machine Results Cancer-specific validation yields remarkably good results in terms of accuracy. Highest classification accuracy is achieved by the ABC algorithm for Brain lower grade glioma data is 99.10%. This validation is well supported by a statisti- cal test, gene ontology enrichment analysis, and KEGG pathway enrichment analysis for each cancer biomarker signature Conclusion The current study identified robust genes as biomarker signatures and these identified biomarkers might be helpful to accurately identify tumors of unknown origin
publishDate 2018
dc.date.none.fl_str_mv 2018-12-01T00:00:00Z
2019-03-01T10:59:06Z
2019-03-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10174/25325
https://doi.org/10.1007/s13258-018-0773-2
http://hdl.handle.net/10174/25325
https://doi.org/10.1007/s13258-018-0773-2
url http://hdl.handle.net/10174/25325
https://doi.org/10.1007/s13258-018-0773-2
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv Bhowmick, S.S., Bhattacharjee, D. & Rato, L., Identification of tissue‐specific tumor biomarker using different optimization algorithms, Genes and Genomics, The Genetics Society of Korea, Springer, 2018.
nd
nd
lmr@uevora.pt
498
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Springer
publisher.none.fl_str_mv Springer
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136640002686976