Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype
Autor(a) principal: | |
---|---|
Data de Publicação: | 2019 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Brazilian Archives of Biology and Technology |
Texto Completo: | http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132019000100611 |
Resumo: | Abstract Breast cancer is the most commonly witnessed cancer amongst women around the world. Computer aided diagnosis (CAD) have been playing a significant role in early detection of breast tumors hence to curb the overall mortality rate. This work presents an enhanced empirical study of impact of dominance-based filtering approach on performances of various state-of-the-art classifiers. The feature dominance level is proportional to the difference in means of benign and malignant tumors. The experiments were done on original Wisconsin Breast Cancer Dataset (WBCD) with total nine features. It is found that the classifiers’ performances for top 4 and top 5 dominant-based features are almost equivalent to performances for all nine features. Artificial neural network (ANN) is come forth as the best performing classifier among all with accuracies of 98.9% and 99.6% for top 4 and top 5 dominant features respectively. The error rate of ANN between all nine and top 4 &5 dominant features is less than 2% for four performance evaluation parameters namely sensitivity, specificity, accuracy and AUC. Thus, it can be stated that the dominance-based filtering approach is appropriate for selecting a sound set of features from the feature pool, consequently, helps to reduce computation time with no deterioration in classifier’s performance. |
id |
TECPAR-1_0ed7f5ca9329c147f21b56006ef3b0c9 |
---|---|
oai_identifier_str |
oai:scielo:S1516-89132019000100611 |
network_acronym_str |
TECPAR-1 |
network_name_str |
Brazilian Archives of Biology and Technology |
repository_id_str |
|
spelling |
Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning ArchetypeBreast cancerComputer aided diagnosisDominance-based filteringMachine learning Abstract Breast cancer is the most commonly witnessed cancer amongst women around the world. Computer aided diagnosis (CAD) have been playing a significant role in early detection of breast tumors hence to curb the overall mortality rate. This work presents an enhanced empirical study of impact of dominance-based filtering approach on performances of various state-of-the-art classifiers. The feature dominance level is proportional to the difference in means of benign and malignant tumors. The experiments were done on original Wisconsin Breast Cancer Dataset (WBCD) with total nine features. It is found that the classifiers’ performances for top 4 and top 5 dominant-based features are almost equivalent to performances for all nine features. Artificial neural network (ANN) is come forth as the best performing classifier among all with accuracies of 98.9% and 99.6% for top 4 and top 5 dominant features respectively. The error rate of ANN between all nine and top 4 &5 dominant features is less than 2% for four performance evaluation parameters namely sensitivity, specificity, accuracy and AUC. Thus, it can be stated that the dominance-based filtering approach is appropriate for selecting a sound set of features from the feature pool, consequently, helps to reduce computation time with no deterioration in classifier’s performance.Instituto de Tecnologia do Paraná - Tecpar2019-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132019000100611Brazilian Archives of Biology and Technology v.62 2019reponame:Brazilian Archives of Biology and Technologyinstname:Instituto de Tecnologia do Paraná (Tecpar)instacron:TECPAR10.1590/1678-4324-2019180486info:eu-repo/semantics/openAccessAtrey,KushangiSharma,YogeshBodhey,Narendra K.Singh,Bikesh Kumareng2020-01-31T00:00:00Zoai:scielo:S1516-89132019000100611Revistahttps://www.scielo.br/j/babt/https://old.scielo.br/oai/scielo-oai.phpbabt@tecpar.br||babt@tecpar.br1678-43241516-8913opendoar:2020-01-31T00:00Brazilian Archives of Biology and Technology - Instituto de Tecnologia do Paraná (Tecpar)false |
dc.title.none.fl_str_mv |
Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype |
title |
Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype |
spellingShingle |
Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype Atrey,Kushangi Breast cancer Computer aided diagnosis Dominance-based filtering Machine learning |
title_short |
Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype |
title_full |
Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype |
title_fullStr |
Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype |
title_full_unstemmed |
Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype |
title_sort |
Breast Cancer Prediction Using Dominance-based Feature Filtering Approach: A Comparative Investigation in Machine Learning Archetype |
author |
Atrey,Kushangi |
author_facet |
Atrey,Kushangi Sharma,Yogesh Bodhey,Narendra K. Singh,Bikesh Kumar |
author_role |
author |
author2 |
Sharma,Yogesh Bodhey,Narendra K. Singh,Bikesh Kumar |
author2_role |
author author author |
dc.contributor.author.fl_str_mv |
Atrey,Kushangi Sharma,Yogesh Bodhey,Narendra K. Singh,Bikesh Kumar |
dc.subject.por.fl_str_mv |
Breast cancer Computer aided diagnosis Dominance-based filtering Machine learning |
topic |
Breast cancer Computer aided diagnosis Dominance-based filtering Machine learning |
description |
Abstract Breast cancer is the most commonly witnessed cancer amongst women around the world. Computer aided diagnosis (CAD) have been playing a significant role in early detection of breast tumors hence to curb the overall mortality rate. This work presents an enhanced empirical study of impact of dominance-based filtering approach on performances of various state-of-the-art classifiers. The feature dominance level is proportional to the difference in means of benign and malignant tumors. The experiments were done on original Wisconsin Breast Cancer Dataset (WBCD) with total nine features. It is found that the classifiers’ performances for top 4 and top 5 dominant-based features are almost equivalent to performances for all nine features. Artificial neural network (ANN) is come forth as the best performing classifier among all with accuracies of 98.9% and 99.6% for top 4 and top 5 dominant features respectively. The error rate of ANN between all nine and top 4 &5 dominant features is less than 2% for four performance evaluation parameters namely sensitivity, specificity, accuracy and AUC. Thus, it can be stated that the dominance-based filtering approach is appropriate for selecting a sound set of features from the feature pool, consequently, helps to reduce computation time with no deterioration in classifier’s performance. |
publishDate |
2019 |
dc.date.none.fl_str_mv |
2019-01-01 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132019000100611 |
url |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132019000100611 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1590/1678-4324-2019180486 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
text/html |
dc.publisher.none.fl_str_mv |
Instituto de Tecnologia do Paraná - Tecpar |
publisher.none.fl_str_mv |
Instituto de Tecnologia do Paraná - Tecpar |
dc.source.none.fl_str_mv |
Brazilian Archives of Biology and Technology v.62 2019 reponame:Brazilian Archives of Biology and Technology instname:Instituto de Tecnologia do Paraná (Tecpar) instacron:TECPAR |
instname_str |
Instituto de Tecnologia do Paraná (Tecpar) |
instacron_str |
TECPAR |
institution |
TECPAR |
reponame_str |
Brazilian Archives of Biology and Technology |
collection |
Brazilian Archives of Biology and Technology |
repository.name.fl_str_mv |
Brazilian Archives of Biology and Technology - Instituto de Tecnologia do Paraná (Tecpar) |
repository.mail.fl_str_mv |
babt@tecpar.br||babt@tecpar.br |
_version_ |
1750318279569702912 |