Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles

Detalhes bibliográficos
Autor(a) principal: Mosquim Junior, Sergio [UNESP]
Data de Publicação: 2017
Outros Autores: Oliveira, Juliana de [UNESP], Ali, H., Fred, A., Gamboa, H., Vaz, M.
Tipo de documento: Artigo de conferência
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.5220/0006170201680175
http://hdl.handle.net/11449/165837
Resumo: Breast cancer has the second highest incidence among all cancer types and is the fifth cause of cancer related death among women. In Brazil, breast cancer mortality rates have been rising. Cancer classification is intricate, mainly when differentiating subtypes. In this context, data mining becomes a fundamental tool to analyze genotypic data, improving diagnostics, treatment and patient care. As the data dimensionality is problematic, methods to reduce it must be applied. Hence, the present study aims at the analysis of two data mining methods (i.e., decision trees and artificial neural networks). Weka (R) and MATLAB (R) were used to implement these two methodologies. Decision trees appointed important genes for the classification. Optimal artificial neural network architecture consists of two layers, one with 99 neurons and the other with 5. Both data mining techniques were able to classify data with high accuracy.
id UNSP_9bb6b3658ff7f1b30c6036083f844275
oai_identifier_str oai:repositorio.unesp.br:11449/165837
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression ProfilesData MiningBreast CancerDecision TreesArtificial Neural NetworksBreast cancer has the second highest incidence among all cancer types and is the fifth cause of cancer related death among women. In Brazil, breast cancer mortality rates have been rising. Cancer classification is intricate, mainly when differentiating subtypes. In this context, data mining becomes a fundamental tool to analyze genotypic data, improving diagnostics, treatment and patient care. As the data dimensionality is problematic, methods to reduce it must be applied. Hence, the present study aims at the analysis of two data mining methods (i.e., decision trees and artificial neural networks). Weka (R) and MATLAB (R) were used to implement these two methodologies. Decision trees appointed important genes for the classification. Optimal artificial neural network architecture consists of two layers, one with 99 neurons and the other with 5. Both data mining techniques were able to classify data with high accuracy.Sao Paulo State Univ, Sch Sci Humanities & Languages, Av Dom Antonio 2100, Assis, SP, BrazilUppsala Univ, Uppsala, SwedenSao Paulo State Univ, Sch Sci Humanities & Languages, Av Dom Antonio 2100, Assis, SP, BrazilScitepressUniversidade Estadual Paulista (Unesp)Uppsala UnivMosquim Junior, Sergio [UNESP]Oliveira, Juliana de [UNESP]Ali, H.Fred, A.Gamboa, H.Vaz, M.2018-11-28T23:59:40Z2018-11-28T23:59:40Z2017-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject168-175http://dx.doi.org/10.5220/0006170201680175Proceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformatics. Setubal: Scitepress, p. 168-175, 2017.http://hdl.handle.net/11449/16583710.5220/0006170201680175WOS:000413258500018Web of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengProceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformaticsinfo:eu-repo/semantics/openAccess2024-06-13T17:39:07Zoai:repositorio.unesp.br:11449/165837Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T20:47:36.463322Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
title Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
spellingShingle Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
Mosquim Junior, Sergio [UNESP]
Data Mining
Breast Cancer
Decision Trees
Artificial Neural Networks
title_short Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
title_full Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
title_fullStr Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
title_full_unstemmed Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
title_sort Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
author Mosquim Junior, Sergio [UNESP]
author_facet Mosquim Junior, Sergio [UNESP]
Oliveira, Juliana de [UNESP]
Ali, H.
Fred, A.
Gamboa, H.
Vaz, M.
author_role author
author2 Oliveira, Juliana de [UNESP]
Ali, H.
Fred, A.
Gamboa, H.
Vaz, M.
author2_role author
author
author
author
author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (Unesp)
Uppsala Univ
dc.contributor.author.fl_str_mv Mosquim Junior, Sergio [UNESP]
Oliveira, Juliana de [UNESP]
Ali, H.
Fred, A.
Gamboa, H.
Vaz, M.
dc.subject.por.fl_str_mv Data Mining
Breast Cancer
Decision Trees
Artificial Neural Networks
topic Data Mining
Breast Cancer
Decision Trees
Artificial Neural Networks
description Breast cancer has the second highest incidence among all cancer types and is the fifth cause of cancer related death among women. In Brazil, breast cancer mortality rates have been rising. Cancer classification is intricate, mainly when differentiating subtypes. In this context, data mining becomes a fundamental tool to analyze genotypic data, improving diagnostics, treatment and patient care. As the data dimensionality is problematic, methods to reduce it must be applied. Hence, the present study aims at the analysis of two data mining methods (i.e., decision trees and artificial neural networks). Weka (R) and MATLAB (R) were used to implement these two methodologies. Decision trees appointed important genes for the classification. Optimal artificial neural network architecture consists of two layers, one with 99 neurons and the other with 5. Both data mining techniques were able to classify data with high accuracy.
publishDate 2017
dc.date.none.fl_str_mv 2017-01-01
2018-11-28T23:59:40Z
2018-11-28T23:59:40Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.5220/0006170201680175
Proceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformatics. Setubal: Scitepress, p. 168-175, 2017.
http://hdl.handle.net/11449/165837
10.5220/0006170201680175
WOS:000413258500018
url http://dx.doi.org/10.5220/0006170201680175
http://hdl.handle.net/11449/165837
identifier_str_mv Proceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformatics. Setubal: Scitepress, p. 168-175, 2017.
10.5220/0006170201680175
WOS:000413258500018
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Proceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformatics
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 168-175
dc.publisher.none.fl_str_mv Scitepress
publisher.none.fl_str_mv Scitepress
dc.source.none.fl_str_mv Web of Science
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808129249152860160