Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype

Detalhes bibliográficos
Autor(a) principal: Atrey,Kushangi
Data de Publicação: 2019
Outros Autores: Sharma,Yogesh, Bodhey,Narendra K., Singh,Bikesh Kumar
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Brazilian Archives of Biology and Technology
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132019000100611
Resumo: Abstract Breast cancer is the most commonly witnessed cancer amongst women around the world. Computer aided diagnosis (CAD) have been playing a significant role in early detection of breast tumors hence to curb the overall mortality rate. This work presents an enhanced empirical study of impact of dominance-based filtering approach on performances of various state-of-the-art classifiers. The feature dominance level is proportional to the difference in means of benign and malignant tumors. The experiments were done on original Wisconsin Breast Cancer Dataset (WBCD) with total nine features. It is found that the classifiers’ performances for top 4 and top 5 dominant-based features are almost equivalent to performances for all nine features. Artificial neural network (ANN) is come forth as the best performing classifier among all with accuracies of 98.9% and 99.6% for top 4 and top 5 dominant features respectively. The error rate of ANN between all nine and top 4 &5 dominant features is less than 2% for four performance evaluation parameters namely sensitivity, specificity, accuracy and AUC. Thus, it can be stated that the dominance-based filtering approach is appropriate for selecting a sound set of features from the feature pool, consequently, helps to reduce computation time with no deterioration in classifier’s performance.
id TECPAR-1_0ed7f5ca9329c147f21b56006ef3b0c9
oai_identifier_str oai:scielo:S1516-89132019000100611
network_acronym_str TECPAR-1
network_name_str Brazilian Archives of Biology and Technology
repository_id_str
spelling Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning ArchetypeBreast cancerComputer aided diagnosisDominance-based filteringMachine learning Abstract Breast cancer is the most commonly witnessed cancer amongst women around the world. Computer aided diagnosis (CAD) have been playing a significant role in early detection of breast tumors hence to curb the overall mortality rate. This work presents an enhanced empirical study of impact of dominance-based filtering approach on performances of various state-of-the-art classifiers. The feature dominance level is proportional to the difference in means of benign and malignant tumors. The experiments were done on original Wisconsin Breast Cancer Dataset (WBCD) with total nine features. It is found that the classifiers’ performances for top 4 and top 5 dominant-based features are almost equivalent to performances for all nine features. Artificial neural network (ANN) is come forth as the best performing classifier among all with accuracies of 98.9% and 99.6% for top 4 and top 5 dominant features respectively. The error rate of ANN between all nine and top 4 &5 dominant features is less than 2% for four performance evaluation parameters namely sensitivity, specificity, accuracy and AUC. Thus, it can be stated that the dominance-based filtering approach is appropriate for selecting a sound set of features from the feature pool, consequently, helps to reduce computation time with no deterioration in classifier’s performance.Instituto de Tecnologia do Paraná - Tecpar2019-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132019000100611Brazilian Archives of Biology and Technology v.62 2019reponame:Brazilian Archives of Biology and Technologyinstname:Instituto de Tecnologia do Paraná (Tecpar)instacron:TECPAR10.1590/1678-4324-2019180486info:eu-repo/semantics/openAccessAtrey,KushangiSharma,YogeshBodhey,Narendra K.Singh,Bikesh Kumareng2020-01-31T00:00:00Zoai:scielo:S1516-89132019000100611Revistahttps://www.scielo.br/j/babt/https://old.scielo.br/oai/scielo-oai.phpbabt@tecpar.br||babt@tecpar.br1678-43241516-8913opendoar:2020-01-31T00:00Brazilian Archives of Biology and Technology - Instituto de Tecnologia do Paraná (Tecpar)false
dc.title.none.fl_str_mv Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype
title Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype
spellingShingle Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype
Atrey,Kushangi
Breast cancer
Computer aided diagnosis
Dominance-based filtering
Machine learning
title_short Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype
title_full Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype
title_fullStr Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype
title_full_unstemmed Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype
title_sort Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype
author Atrey,Kushangi
author_facet Atrey,Kushangi
Sharma,Yogesh
Bodhey,Narendra K.
Singh,Bikesh Kumar
author_role author
author2 Sharma,Yogesh
Bodhey,Narendra K.
Singh,Bikesh Kumar
author2_role author
author
author
dc.contributor.author.fl_str_mv Atrey,Kushangi
Sharma,Yogesh
Bodhey,Narendra K.
Singh,Bikesh Kumar
dc.subject.por.fl_str_mv Breast cancer
Computer aided diagnosis
Dominance-based filtering
Machine learning
topic Breast cancer
Computer aided diagnosis
Dominance-based filtering
Machine learning
description Abstract Breast cancer is the most commonly witnessed cancer amongst women around the world. Computer aided diagnosis (CAD) have been playing a significant role in early detection of breast tumors hence to curb the overall mortality rate. This work presents an enhanced empirical study of impact of dominance-based filtering approach on performances of various state-of-the-art classifiers. The feature dominance level is proportional to the difference in means of benign and malignant tumors. The experiments were done on original Wisconsin Breast Cancer Dataset (WBCD) with total nine features. It is found that the classifiers’ performances for top 4 and top 5 dominant-based features are almost equivalent to performances for all nine features. Artificial neural network (ANN) is come forth as the best performing classifier among all with accuracies of 98.9% and 99.6% for top 4 and top 5 dominant features respectively. The error rate of ANN between all nine and top 4 &5 dominant features is less than 2% for four performance evaluation parameters namely sensitivity, specificity, accuracy and AUC. Thus, it can be stated that the dominance-based filtering approach is appropriate for selecting a sound set of features from the feature pool, consequently, helps to reduce computation time with no deterioration in classifier’s performance.
publishDate 2019
dc.date.none.fl_str_mv 2019-01-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132019000100611
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132019000100611
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/1678-4324-2019180486
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Instituto de Tecnologia do Paraná - Tecpar
publisher.none.fl_str_mv Instituto de Tecnologia do Paraná - Tecpar
dc.source.none.fl_str_mv Brazilian Archives of Biology and Technology v.62 2019
reponame:Brazilian Archives of Biology and Technology
instname:Instituto de Tecnologia do Paraná (Tecpar)
instacron:TECPAR
instname_str Instituto de Tecnologia do Paraná (Tecpar)
instacron_str TECPAR
institution TECPAR
reponame_str Brazilian Archives of Biology and Technology
collection Brazilian Archives of Biology and Technology
repository.name.fl_str_mv Brazilian Archives of Biology and Technology - Instituto de Tecnologia do Paraná (Tecpar)
repository.mail.fl_str_mv babt@tecpar.br||babt@tecpar.br
_version_ 1750318279569702912