Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models

Detalhes bibliográficos
Autor(a) principal: Zhang, Mengran
Data de Publicação: 2023
Outros Autores: Pang, Mingge
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Clinics
Texto Completo: https://www.revistas.usp.br/clinics/article/view/213996
Resumo: Background: Acute Respiratory Distress syndrome (ARDS) is a common complication of Acute Pancreatitis (AP) and is associated with high mortality. This study used Machine Learning (ML) to predict ARDS in patients with AP at admission. Methods: The authors retrospectively analyzed the data from patients with AP from January 2017 to August 2022. Clinical and laboratory parameters with significant differences between patients with and without ARDS were screened by univariate analysis. Then, Support Vector Machine (SVM), Ensembles of Decision Trees (EDTs), Bayesian Classifier (BC), and nomogram models were constructed and optimized after feature screening based on these parameters. Five-fold cross-validation was used to train each model. A test set was used to evaluate the predictive performance of the four models. Results: A total of 83 (18.04%) of 460 patients with AP developed ARDS. Thirty-one features with significant differences between the groups with and without ARDS in the training set were used for modeling. The Partial Pressure of Oxygen (PaO2), C-reactive protein, procalcitonin, lactic acid, Ca2+, the neutrophil:lymphocyte ratio, white blood cell count, and amylase were identified as the optimal subset of features. The BC algorithm had the best predictive performance with the highest AUC value (0.891) than SVM (0.870), EDTs (0.813), and the nomogram (0.874) in the test set. The EDT algorithm achieved the highest accuracy (0.891), precision (0.800), and F1 score (0.615), but the lowest FDR (0.200) and the second-highest NPV (0.902). Conclusions: A predictive model of ARDS complicated by AP was successfully developed based on ML. Predictive performance was evaluated by a test set, for which BC showed superior predictive performance and EDTs could be a more promising prediction tool for larger samples.
id USP-19_ef848914e9d03485c42e57c12b01c4d4
oai_identifier_str oai:revistas.usp.br:article/213996
network_acronym_str USP-19
network_name_str Clinics
repository_id_str
spelling Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning modelsAcute respiratory distress syndromeAcute pancreatitisMachine learningPrediction modelBackground: Acute Respiratory Distress syndrome (ARDS) is a common complication of Acute Pancreatitis (AP) and is associated with high mortality. This study used Machine Learning (ML) to predict ARDS in patients with AP at admission. Methods: The authors retrospectively analyzed the data from patients with AP from January 2017 to August 2022. Clinical and laboratory parameters with significant differences between patients with and without ARDS were screened by univariate analysis. Then, Support Vector Machine (SVM), Ensembles of Decision Trees (EDTs), Bayesian Classifier (BC), and nomogram models were constructed and optimized after feature screening based on these parameters. Five-fold cross-validation was used to train each model. A test set was used to evaluate the predictive performance of the four models. Results: A total of 83 (18.04%) of 460 patients with AP developed ARDS. Thirty-one features with significant differences between the groups with and without ARDS in the training set were used for modeling. The Partial Pressure of Oxygen (PaO2), C-reactive protein, procalcitonin, lactic acid, Ca2+, the neutrophil:lymphocyte ratio, white blood cell count, and amylase were identified as the optimal subset of features. The BC algorithm had the best predictive performance with the highest AUC value (0.891) than SVM (0.870), EDTs (0.813), and the nomogram (0.874) in the test set. The EDT algorithm achieved the highest accuracy (0.891), precision (0.800), and F1 score (0.615), but the lowest FDR (0.200) and the second-highest NPV (0.902). Conclusions: A predictive model of ARDS complicated by AP was successfully developed based on ML. Predictive performance was evaluated by a test set, for which BC showed superior predictive performance and EDTs could be a more promising prediction tool for larger samples.Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo2023-05-03info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://www.revistas.usp.br/clinics/article/view/21399610.1016/j.clinsp.2023.100215Clinics; Vol. 78 (2023); 100215Clinics; v. 78 (2023); 100215Clinics; Vol. 78 (2023); 1002151980-53221807-5932reponame:Clinicsinstname:Universidade de São Paulo (USP)instacron:USPenghttps://www.revistas.usp.br/clinics/article/view/213996/196209Copyright (c) 2023 Clinicsinfo:eu-repo/semantics/openAccessZhang, MengranPang, Mingge2023-07-06T13:05:39Zoai:revistas.usp.br:article/213996Revistahttps://www.revistas.usp.br/clinicsPUBhttps://www.revistas.usp.br/clinics/oai||clinics@hc.fm.usp.br1980-53221807-5932opendoar:2023-07-06T13:05:39Clinics - Universidade de São Paulo (USP)false
dc.title.none.fl_str_mv Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models
title Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models
spellingShingle Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models
Zhang, Mengran
Acute respiratory distress syndrome
Acute pancreatitis
Machine learning
Prediction model
title_short Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models
title_full Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models
title_fullStr Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models
title_full_unstemmed Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models
title_sort Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models
author Zhang, Mengran
author_facet Zhang, Mengran
Pang, Mingge
author_role author
author2 Pang, Mingge
author2_role author
dc.contributor.author.fl_str_mv Zhang, Mengran
Pang, Mingge
dc.subject.por.fl_str_mv Acute respiratory distress syndrome
Acute pancreatitis
Machine learning
Prediction model
topic Acute respiratory distress syndrome
Acute pancreatitis
Machine learning
Prediction model
description Background: Acute Respiratory Distress syndrome (ARDS) is a common complication of Acute Pancreatitis (AP) and is associated with high mortality. This study used Machine Learning (ML) to predict ARDS in patients with AP at admission. Methods: The authors retrospectively analyzed the data from patients with AP from January 2017 to August 2022. Clinical and laboratory parameters with significant differences between patients with and without ARDS were screened by univariate analysis. Then, Support Vector Machine (SVM), Ensembles of Decision Trees (EDTs), Bayesian Classifier (BC), and nomogram models were constructed and optimized after feature screening based on these parameters. Five-fold cross-validation was used to train each model. A test set was used to evaluate the predictive performance of the four models. Results: A total of 83 (18.04%) of 460 patients with AP developed ARDS. Thirty-one features with significant differences between the groups with and without ARDS in the training set were used for modeling. The Partial Pressure of Oxygen (PaO2), C-reactive protein, procalcitonin, lactic acid, Ca2+, the neutrophil:lymphocyte ratio, white blood cell count, and amylase were identified as the optimal subset of features. The BC algorithm had the best predictive performance with the highest AUC value (0.891) than SVM (0.870), EDTs (0.813), and the nomogram (0.874) in the test set. The EDT algorithm achieved the highest accuracy (0.891), precision (0.800), and F1 score (0.615), but the lowest FDR (0.200) and the second-highest NPV (0.902). Conclusions: A predictive model of ARDS complicated by AP was successfully developed based on ML. Predictive performance was evaluated by a test set, for which BC showed superior predictive performance and EDTs could be a more promising prediction tool for larger samples.
publishDate 2023
dc.date.none.fl_str_mv 2023-05-03
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://www.revistas.usp.br/clinics/article/view/213996
10.1016/j.clinsp.2023.100215
url https://www.revistas.usp.br/clinics/article/view/213996
identifier_str_mv 10.1016/j.clinsp.2023.100215
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://www.revistas.usp.br/clinics/article/view/213996/196209
dc.rights.driver.fl_str_mv Copyright (c) 2023 Clinics
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2023 Clinics
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo
publisher.none.fl_str_mv Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo
dc.source.none.fl_str_mv Clinics; Vol. 78 (2023); 100215
Clinics; v. 78 (2023); 100215
Clinics; Vol. 78 (2023); 100215
1980-5322
1807-5932
reponame:Clinics
instname:Universidade de São Paulo (USP)
instacron:USP
instname_str Universidade de São Paulo (USP)
instacron_str USP
institution USP
reponame_str Clinics
collection Clinics
repository.name.fl_str_mv Clinics - Universidade de São Paulo (USP)
repository.mail.fl_str_mv ||clinics@hc.fm.usp.br
_version_ 1800222767382003712