Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study

Detalhes bibliográficos
Autor(a) principal: Campos, João
Data de Publicação: 2019
Outros Autores: Costa, Ernesto, Vieira, Marco
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
DOI: 10.1109/ACCESS.2019.2958480
Texto Completo: http://hdl.handle.net/10316/101614
https://doi.org/10.1109/ACCESS.2019.2958480
Resumo: The complexity of software has grown considerably in recent years, making it nearly impossible to detect all faults before pushing to production. Such faults can ultimately lead to failures at runtime. Recent works have shown that using Machine Learning (ML) algorithms it is possible to create models that can accurately predict such failures. At the same time, methods that combine several independent learners (i.e., ensembles) have proved to outperform individual models in various problems. While some well-known ensemble algorithms (e.g Bagging) use the same base learners (i.e., homogeneous), using different algorithms (i.e., heterogeneous) may exploit the different biases of each algorithm. However, this is not a trivial task, as it requires nding and choosing the most adequate base learners and methods to combine their outputs. This paper presents a case study on using several ML techniques to create heterogeneous ensembles for Online Failure Prediction (OFP). More precisely, it attempts to assess the viability of combining different learners to improve performance and to understand how different combination techniques in uence the results. The paper also explores whether the interactions between learners can be studied and leveraged. The results suggest that the combination of certain learners and techniques, not necessarily individually the best, can improve the overall ability to predict failures. Additionally, studying the synergies in the best ensembles provides interesting insights into why some are able to perform better.
id RCAP_e07440b6d25ea0996cf363996054d276
oai_identifier_str oai:estudogeral.uc.pt:10316/101614
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case StudyEnsemblesfailure predictionmachine learningreliabilityThe complexity of software has grown considerably in recent years, making it nearly impossible to detect all faults before pushing to production. Such faults can ultimately lead to failures at runtime. Recent works have shown that using Machine Learning (ML) algorithms it is possible to create models that can accurately predict such failures. At the same time, methods that combine several independent learners (i.e., ensembles) have proved to outperform individual models in various problems. While some well-known ensemble algorithms (e.g Bagging) use the same base learners (i.e., homogeneous), using different algorithms (i.e., heterogeneous) may exploit the different biases of each algorithm. However, this is not a trivial task, as it requires nding and choosing the most adequate base learners and methods to combine their outputs. This paper presents a case study on using several ML techniques to create heterogeneous ensembles for Online Failure Prediction (OFP). More precisely, it attempts to assess the viability of combining different learners to improve performance and to understand how different combination techniques in uence the results. The paper also explores whether the interactions between learners can be studied and leveraged. The results suggest that the combination of certain learners and techniques, not necessarily individually the best, can improve the overall ability to predict failures. Additionally, studying the synergies in the best ensembles provides interesting insights into why some are able to perform better.2019info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10316/101614http://hdl.handle.net/10316/101614https://doi.org/10.1109/ACCESS.2019.2958480eng2169-3536Campos, JoãoCosta, ErnestoVieira, Marcoinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2022-09-02T20:47:42Zoai:estudogeral.uc.pt:10316/101614Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:18:46.217111Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
title Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
spellingShingle Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
Campos, João
Ensembles
failure prediction
machine learning
reliability
Campos, João
Ensembles
failure prediction
machine learning
reliability
title_short Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
title_full Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
title_fullStr Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
title_full_unstemmed Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
title_sort Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
author Campos, João
author_facet Campos, João
Campos, João
Costa, Ernesto
Vieira, Marco
Costa, Ernesto
Vieira, Marco
author_role author
author2 Costa, Ernesto
Vieira, Marco
author2_role author
author
dc.contributor.author.fl_str_mv Campos, João
Costa, Ernesto
Vieira, Marco
dc.subject.por.fl_str_mv Ensembles
failure prediction
machine learning
reliability
topic Ensembles
failure prediction
machine learning
reliability
description The complexity of software has grown considerably in recent years, making it nearly impossible to detect all faults before pushing to production. Such faults can ultimately lead to failures at runtime. Recent works have shown that using Machine Learning (ML) algorithms it is possible to create models that can accurately predict such failures. At the same time, methods that combine several independent learners (i.e., ensembles) have proved to outperform individual models in various problems. While some well-known ensemble algorithms (e.g Bagging) use the same base learners (i.e., homogeneous), using different algorithms (i.e., heterogeneous) may exploit the different biases of each algorithm. However, this is not a trivial task, as it requires nding and choosing the most adequate base learners and methods to combine their outputs. This paper presents a case study on using several ML techniques to create heterogeneous ensembles for Online Failure Prediction (OFP). More precisely, it attempts to assess the viability of combining different learners to improve performance and to understand how different combination techniques in uence the results. The paper also explores whether the interactions between learners can be studied and leveraged. The results suggest that the combination of certain learners and techniques, not necessarily individually the best, can improve the overall ability to predict failures. Additionally, studying the synergies in the best ensembles provides interesting insights into why some are able to perform better.
publishDate 2019
dc.date.none.fl_str_mv 2019
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10316/101614
http://hdl.handle.net/10316/101614
https://doi.org/10.1109/ACCESS.2019.2958480
url http://hdl.handle.net/10316/101614
https://doi.org/10.1109/ACCESS.2019.2958480
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 2169-3536
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1822183453025632256
dc.identifier.doi.none.fl_str_mv 10.1109/ACCESS.2019.2958480