Data mining for cardiovascular disease prediction

Detalhes bibliográficos
Autor(a) principal: Martins, Barbara
Data de Publicação: 2021
Outros Autores: Ferreira, Diana, Neto, Cristiana, Abelha, António, Machado, José Manuel
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://hdl.handle.net/1822/78014
Resumo: Cardiovascular diseases (CVDs) aredisorders of the heart and blood vessels and are a major cause of disability and premature death worldwide. Individuals at higher risk of developing CVD must be noticed at an early stage to prevent premature deaths. Advances in the field of computational intelligence, together with the vast amount of data produced daily in clinical settings, have made it possible to create recognition systems capable of identifying hidden patterns and useful information. This paper focuses on the application of Data Mining Techniques (DMTs) to clinical data collected during the medical examination in an attempt to predict whether or not an individual has a CVD. To this end, the CRossIndustry Standard Process for Data Mining (CRISP-DM) methodology was followed, in which five classifiers were applied, namely DT, Optimized DT, RI, RF, and DL. The models were mainly developed using the RapidMiner software with the assist of the WEKA tool and were analyzed based on accuracy, precision, sensitivity, and specificity. The results obtained were considered promising on the basis of the research for effective means of diagnosing CVD, with the best model being Optimized DT, which achieved the highest values for all the evaluation metrics, 73.54%, 75.82%, 68.89%, 78.16% and 0.788 for accuracy, precision, sensitivity, specificity, and AUC, respectively.
id RCAP_69724d3e404b601fa4f62d034e049ed3
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/78014
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Data mining for cardiovascular disease predictionCardiovascular diseaseHealth information systemsDecision support systemsData miningCRISP-DMClassificationScience & TechnologyCardiovascular diseases (CVDs) aredisorders of the heart and blood vessels and are a major cause of disability and premature death worldwide. Individuals at higher risk of developing CVD must be noticed at an early stage to prevent premature deaths. Advances in the field of computational intelligence, together with the vast amount of data produced daily in clinical settings, have made it possible to create recognition systems capable of identifying hidden patterns and useful information. This paper focuses on the application of Data Mining Techniques (DMTs) to clinical data collected during the medical examination in an attempt to predict whether or not an individual has a CVD. To this end, the CRossIndustry Standard Process for Data Mining (CRISP-DM) methodology was followed, in which five classifiers were applied, namely DT, Optimized DT, RI, RF, and DL. The models were mainly developed using the RapidMiner software with the assist of the WEKA tool and were analyzed based on accuracy, precision, sensitivity, and specificity. The results obtained were considered promising on the basis of the research for effective means of diagnosing CVD, with the best model being Optimized DT, which achieved the highest values for all the evaluation metrics, 73.54%, 75.82%, 68.89%, 78.16% and 0.788 for accuracy, precision, sensitivity, specificity, and AUC, respectively.This work has been supported by FCT - Fundacao para a Ciencia e Tecnologia within the R&D Units Project Scope: UIDB/00319/2020.SpringerUniversidade do MinhoMartins, BarbaraFerreira, DianaNeto, CristianaAbelha, AntónioMachado, José Manuel2021-01-052021-01-05T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://hdl.handle.net/1822/78014engMartins, B., Ferreira, D., Neto, C. et al. Data Mining for Cardiovascular Disease Prediction. J Med Syst 45, 6 (2021). https://doi.org/10.1007/s10916-020-01682-80148-559810.1007/s10916-020-01682-833404894https://link.springer.com/article/10.1007/s10916-020-01682-8info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-21T12:04:39Zoai:repositorium.sdum.uminho.pt:1822/78014Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T18:54:57.567796Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Data mining for cardiovascular disease prediction
title Data mining for cardiovascular disease prediction
spellingShingle Data mining for cardiovascular disease prediction
Martins, Barbara
Cardiovascular disease
Health information systems
Decision support systems
Data mining
CRISP-DM
Classification
Science & Technology
title_short Data mining for cardiovascular disease prediction
title_full Data mining for cardiovascular disease prediction
title_fullStr Data mining for cardiovascular disease prediction
title_full_unstemmed Data mining for cardiovascular disease prediction
title_sort Data mining for cardiovascular disease prediction
author Martins, Barbara
author_facet Martins, Barbara
Ferreira, Diana
Neto, Cristiana
Abelha, António
Machado, José Manuel
author_role author
author2 Ferreira, Diana
Neto, Cristiana
Abelha, António
Machado, José Manuel
author2_role author
author
author
author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Martins, Barbara
Ferreira, Diana
Neto, Cristiana
Abelha, António
Machado, José Manuel
dc.subject.por.fl_str_mv Cardiovascular disease
Health information systems
Decision support systems
Data mining
CRISP-DM
Classification
Science & Technology
topic Cardiovascular disease
Health information systems
Decision support systems
Data mining
CRISP-DM
Classification
Science & Technology
description Cardiovascular diseases (CVDs) aredisorders of the heart and blood vessels and are a major cause of disability and premature death worldwide. Individuals at higher risk of developing CVD must be noticed at an early stage to prevent premature deaths. Advances in the field of computational intelligence, together with the vast amount of data produced daily in clinical settings, have made it possible to create recognition systems capable of identifying hidden patterns and useful information. This paper focuses on the application of Data Mining Techniques (DMTs) to clinical data collected during the medical examination in an attempt to predict whether or not an individual has a CVD. To this end, the CRossIndustry Standard Process for Data Mining (CRISP-DM) methodology was followed, in which five classifiers were applied, namely DT, Optimized DT, RI, RF, and DL. The models were mainly developed using the RapidMiner software with the assist of the WEKA tool and were analyzed based on accuracy, precision, sensitivity, and specificity. The results obtained were considered promising on the basis of the research for effective means of diagnosing CVD, with the best model being Optimized DT, which achieved the highest values for all the evaluation metrics, 73.54%, 75.82%, 68.89%, 78.16% and 0.788 for accuracy, precision, sensitivity, specificity, and AUC, respectively.
publishDate 2021
dc.date.none.fl_str_mv 2021-01-05
2021-01-05T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/1822/78014
url https://hdl.handle.net/1822/78014
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Martins, B., Ferreira, D., Neto, C. et al. Data Mining for Cardiovascular Disease Prediction. J Med Syst 45, 6 (2021). https://doi.org/10.1007/s10916-020-01682-8
0148-5598
10.1007/s10916-020-01682-8
33404894
https://link.springer.com/article/10.1007/s10916-020-01682-8
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Springer
publisher.none.fl_str_mv Springer
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799132333192773632