Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles
Autor(a) principal: | |
---|---|
Data de Publicação: | 2017 |
Outros Autores: | , , , , |
Tipo de documento: | Artigo de conferência |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.5220/0006170201680175 http://hdl.handle.net/11449/165837 |
Resumo: | Breast cancer has the second highest incidence among all cancer types and is the fifth cause of cancer related death among women. In Brazil, breast cancer mortality rates have been rising. Cancer classification is intricate, mainly when differentiating subtypes. In this context, data mining becomes a fundamental tool to analyze genotypic data, improving diagnostics, treatment and patient care. As the data dimensionality is problematic, methods to reduce it must be applied. Hence, the present study aims at the analysis of two data mining methods (i.e., decision trees and artificial neural networks). Weka (R) and MATLAB (R) were used to implement these two methodologies. Decision trees appointed important genes for the classification. Optimal artificial neural network architecture consists of two layers, one with 99 neurons and the other with 5. Both data mining techniques were able to classify data with high accuracy. |
id |
UNSP_9bb6b3658ff7f1b30c6036083f844275 |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/165837 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression ProfilesData MiningBreast CancerDecision TreesArtificial Neural NetworksBreast cancer has the second highest incidence among all cancer types and is the fifth cause of cancer related death among women. In Brazil, breast cancer mortality rates have been rising. Cancer classification is intricate, mainly when differentiating subtypes. In this context, data mining becomes a fundamental tool to analyze genotypic data, improving diagnostics, treatment and patient care. As the data dimensionality is problematic, methods to reduce it must be applied. Hence, the present study aims at the analysis of two data mining methods (i.e., decision trees and artificial neural networks). Weka (R) and MATLAB (R) were used to implement these two methodologies. Decision trees appointed important genes for the classification. Optimal artificial neural network architecture consists of two layers, one with 99 neurons and the other with 5. Both data mining techniques were able to classify data with high accuracy.Sao Paulo State Univ, Sch Sci Humanities & Languages, Av Dom Antonio 2100, Assis, SP, BrazilUppsala Univ, Uppsala, SwedenSao Paulo State Univ, Sch Sci Humanities & Languages, Av Dom Antonio 2100, Assis, SP, BrazilScitepressUniversidade Estadual Paulista (Unesp)Uppsala UnivMosquim Junior, Sergio [UNESP]Oliveira, Juliana de [UNESP]Ali, H.Fred, A.Gamboa, H.Vaz, M.2018-11-28T23:59:40Z2018-11-28T23:59:40Z2017-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject168-175http://dx.doi.org/10.5220/0006170201680175Proceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformatics. Setubal: Scitepress, p. 168-175, 2017.http://hdl.handle.net/11449/16583710.5220/0006170201680175WOS:000413258500018Web of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengProceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformaticsinfo:eu-repo/semantics/openAccess2024-06-13T17:39:07Zoai:repositorio.unesp.br:11449/165837Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T20:47:36.463322Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles |
title |
Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles |
spellingShingle |
Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles Mosquim Junior, Sergio [UNESP] Data Mining Breast Cancer Decision Trees Artificial Neural Networks |
title_short |
Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles |
title_full |
Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles |
title_fullStr |
Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles |
title_full_unstemmed |
Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles |
title_sort |
Comparative Study on Data Mining Techniques Applied to Breast Cancer Gene Expression Profiles |
author |
Mosquim Junior, Sergio [UNESP] |
author_facet |
Mosquim Junior, Sergio [UNESP] Oliveira, Juliana de [UNESP] Ali, H. Fred, A. Gamboa, H. Vaz, M. |
author_role |
author |
author2 |
Oliveira, Juliana de [UNESP] Ali, H. Fred, A. Gamboa, H. Vaz, M. |
author2_role |
author author author author author |
dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (Unesp) Uppsala Univ |
dc.contributor.author.fl_str_mv |
Mosquim Junior, Sergio [UNESP] Oliveira, Juliana de [UNESP] Ali, H. Fred, A. Gamboa, H. Vaz, M. |
dc.subject.por.fl_str_mv |
Data Mining Breast Cancer Decision Trees Artificial Neural Networks |
topic |
Data Mining Breast Cancer Decision Trees Artificial Neural Networks |
description |
Breast cancer has the second highest incidence among all cancer types and is the fifth cause of cancer related death among women. In Brazil, breast cancer mortality rates have been rising. Cancer classification is intricate, mainly when differentiating subtypes. In this context, data mining becomes a fundamental tool to analyze genotypic data, improving diagnostics, treatment and patient care. As the data dimensionality is problematic, methods to reduce it must be applied. Hence, the present study aims at the analysis of two data mining methods (i.e., decision trees and artificial neural networks). Weka (R) and MATLAB (R) were used to implement these two methodologies. Decision trees appointed important genes for the classification. Optimal artificial neural network architecture consists of two layers, one with 99 neurons and the other with 5. Both data mining techniques were able to classify data with high accuracy. |
publishDate |
2017 |
dc.date.none.fl_str_mv |
2017-01-01 2018-11-28T23:59:40Z 2018-11-28T23:59:40Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.5220/0006170201680175 Proceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformatics. Setubal: Scitepress, p. 168-175, 2017. http://hdl.handle.net/11449/165837 10.5220/0006170201680175 WOS:000413258500018 |
url |
http://dx.doi.org/10.5220/0006170201680175 http://hdl.handle.net/11449/165837 |
identifier_str_mv |
Proceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformatics. Setubal: Scitepress, p. 168-175, 2017. 10.5220/0006170201680175 WOS:000413258500018 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Proceedings Of The 10th International Joint Conference On Biomedical Engineering Systems And Technologies, Vol 3: Bioinformatics |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
168-175 |
dc.publisher.none.fl_str_mv |
Scitepress |
publisher.none.fl_str_mv |
Scitepress |
dc.source.none.fl_str_mv |
Web of Science reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808129249152860160 |