Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Outros Autores: | |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Clinics |
Texto Completo: | https://www.revistas.usp.br/clinics/article/view/213996 |
Resumo: | Background: Acute Respiratory Distress syndrome (ARDS) is a common complication of Acute Pancreatitis (AP) and is associated with high mortality. This study used Machine Learning (ML) to predict ARDS in patients with AP at admission. Methods: The authors retrospectively analyzed the data from patients with AP from January 2017 to August 2022. Clinical and laboratory parameters with significant differences between patients with and without ARDS were screened by univariate analysis. Then, Support Vector Machine (SVM), Ensembles of Decision Trees (EDTs), Bayesian Classifier (BC), and nomogram models were constructed and optimized after feature screening based on these parameters. Five-fold cross-validation was used to train each model. A test set was used to evaluate the predictive performance of the four models. Results: A total of 83 (18.04%) of 460 patients with AP developed ARDS. Thirty-one features with significant differences between the groups with and without ARDS in the training set were used for modeling. The Partial Pressure of Oxygen (PaO2), C-reactive protein, procalcitonin, lactic acid, Ca2+, the neutrophil:lymphocyte ratio, white blood cell count, and amylase were identified as the optimal subset of features. The BC algorithm had the best predictive performance with the highest AUC value (0.891) than SVM (0.870), EDTs (0.813), and the nomogram (0.874) in the test set. The EDT algorithm achieved the highest accuracy (0.891), precision (0.800), and F1 score (0.615), but the lowest FDR (0.200) and the second-highest NPV (0.902). Conclusions: A predictive model of ARDS complicated by AP was successfully developed based on ML. Predictive performance was evaluated by a test set, for which BC showed superior predictive performance and EDTs could be a more promising prediction tool for larger samples. |
id |
USP-19_ef848914e9d03485c42e57c12b01c4d4 |
---|---|
oai_identifier_str |
oai:revistas.usp.br:article/213996 |
network_acronym_str |
USP-19 |
network_name_str |
Clinics |
repository_id_str |
|
spelling |
Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning modelsAcute respiratory distress syndromeAcute pancreatitisMachine learningPrediction modelBackground: Acute Respiratory Distress syndrome (ARDS) is a common complication of Acute Pancreatitis (AP) and is associated with high mortality. This study used Machine Learning (ML) to predict ARDS in patients with AP at admission. Methods: The authors retrospectively analyzed the data from patients with AP from January 2017 to August 2022. Clinical and laboratory parameters with significant differences between patients with and without ARDS were screened by univariate analysis. Then, Support Vector Machine (SVM), Ensembles of Decision Trees (EDTs), Bayesian Classifier (BC), and nomogram models were constructed and optimized after feature screening based on these parameters. Five-fold cross-validation was used to train each model. A test set was used to evaluate the predictive performance of the four models. Results: A total of 83 (18.04%) of 460 patients with AP developed ARDS. Thirty-one features with significant differences between the groups with and without ARDS in the training set were used for modeling. The Partial Pressure of Oxygen (PaO2), C-reactive protein, procalcitonin, lactic acid, Ca2+, the neutrophil:lymphocyte ratio, white blood cell count, and amylase were identified as the optimal subset of features. The BC algorithm had the best predictive performance with the highest AUC value (0.891) than SVM (0.870), EDTs (0.813), and the nomogram (0.874) in the test set. The EDT algorithm achieved the highest accuracy (0.891), precision (0.800), and F1 score (0.615), but the lowest FDR (0.200) and the second-highest NPV (0.902). Conclusions: A predictive model of ARDS complicated by AP was successfully developed based on ML. Predictive performance was evaluated by a test set, for which BC showed superior predictive performance and EDTs could be a more promising prediction tool for larger samples.Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo2023-05-03info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://www.revistas.usp.br/clinics/article/view/21399610.1016/j.clinsp.2023.100215Clinics; Vol. 78 (2023); 100215Clinics; v. 78 (2023); 100215Clinics; Vol. 78 (2023); 1002151980-53221807-5932reponame:Clinicsinstname:Universidade de São Paulo (USP)instacron:USPenghttps://www.revistas.usp.br/clinics/article/view/213996/196209Copyright (c) 2023 Clinicsinfo:eu-repo/semantics/openAccessZhang, MengranPang, Mingge2023-07-06T13:05:39Zoai:revistas.usp.br:article/213996Revistahttps://www.revistas.usp.br/clinicsPUBhttps://www.revistas.usp.br/clinics/oai||clinics@hc.fm.usp.br1980-53221807-5932opendoar:2023-07-06T13:05:39Clinics - Universidade de São Paulo (USP)false |
dc.title.none.fl_str_mv |
Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models |
title |
Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models |
spellingShingle |
Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models Zhang, Mengran Acute respiratory distress syndrome Acute pancreatitis Machine learning Prediction model |
title_short |
Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models |
title_full |
Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models |
title_fullStr |
Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models |
title_full_unstemmed |
Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models |
title_sort |
Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models |
author |
Zhang, Mengran |
author_facet |
Zhang, Mengran Pang, Mingge |
author_role |
author |
author2 |
Pang, Mingge |
author2_role |
author |
dc.contributor.author.fl_str_mv |
Zhang, Mengran Pang, Mingge |
dc.subject.por.fl_str_mv |
Acute respiratory distress syndrome Acute pancreatitis Machine learning Prediction model |
topic |
Acute respiratory distress syndrome Acute pancreatitis Machine learning Prediction model |
description |
Background: Acute Respiratory Distress syndrome (ARDS) is a common complication of Acute Pancreatitis (AP) and is associated with high mortality. This study used Machine Learning (ML) to predict ARDS in patients with AP at admission. Methods: The authors retrospectively analyzed the data from patients with AP from January 2017 to August 2022. Clinical and laboratory parameters with significant differences between patients with and without ARDS were screened by univariate analysis. Then, Support Vector Machine (SVM), Ensembles of Decision Trees (EDTs), Bayesian Classifier (BC), and nomogram models were constructed and optimized after feature screening based on these parameters. Five-fold cross-validation was used to train each model. A test set was used to evaluate the predictive performance of the four models. Results: A total of 83 (18.04%) of 460 patients with AP developed ARDS. Thirty-one features with significant differences between the groups with and without ARDS in the training set were used for modeling. The Partial Pressure of Oxygen (PaO2), C-reactive protein, procalcitonin, lactic acid, Ca2+, the neutrophil:lymphocyte ratio, white blood cell count, and amylase were identified as the optimal subset of features. The BC algorithm had the best predictive performance with the highest AUC value (0.891) than SVM (0.870), EDTs (0.813), and the nomogram (0.874) in the test set. The EDT algorithm achieved the highest accuracy (0.891), precision (0.800), and F1 score (0.615), but the lowest FDR (0.200) and the second-highest NPV (0.902). Conclusions: A predictive model of ARDS complicated by AP was successfully developed based on ML. Predictive performance was evaluated by a test set, for which BC showed superior predictive performance and EDTs could be a more promising prediction tool for larger samples. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-05-03 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://www.revistas.usp.br/clinics/article/view/213996 10.1016/j.clinsp.2023.100215 |
url |
https://www.revistas.usp.br/clinics/article/view/213996 |
identifier_str_mv |
10.1016/j.clinsp.2023.100215 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
https://www.revistas.usp.br/clinics/article/view/213996/196209 |
dc.rights.driver.fl_str_mv |
Copyright (c) 2023 Clinics info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Copyright (c) 2023 Clinics |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo |
publisher.none.fl_str_mv |
Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo |
dc.source.none.fl_str_mv |
Clinics; Vol. 78 (2023); 100215 Clinics; v. 78 (2023); 100215 Clinics; Vol. 78 (2023); 100215 1980-5322 1807-5932 reponame:Clinics instname:Universidade de São Paulo (USP) instacron:USP |
instname_str |
Universidade de São Paulo (USP) |
instacron_str |
USP |
institution |
USP |
reponame_str |
Clinics |
collection |
Clinics |
repository.name.fl_str_mv |
Clinics - Universidade de São Paulo (USP) |
repository.mail.fl_str_mv |
||clinics@hc.fm.usp.br |
_version_ |
1800222767382003712 |