Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study
Autor(a) principal: | |
---|---|
Data de Publicação: | 2019 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
DOI: | 10.1109/ACCESS.2019.2958480 |
Texto Completo: | http://hdl.handle.net/10316/101614 https://doi.org/10.1109/ACCESS.2019.2958480 |
Resumo: | The complexity of software has grown considerably in recent years, making it nearly impossible to detect all faults before pushing to production. Such faults can ultimately lead to failures at runtime. Recent works have shown that using Machine Learning (ML) algorithms it is possible to create models that can accurately predict such failures. At the same time, methods that combine several independent learners (i.e., ensembles) have proved to outperform individual models in various problems. While some well-known ensemble algorithms (e.g Bagging) use the same base learners (i.e., homogeneous), using different algorithms (i.e., heterogeneous) may exploit the different biases of each algorithm. However, this is not a trivial task, as it requires nding and choosing the most adequate base learners and methods to combine their outputs. This paper presents a case study on using several ML techniques to create heterogeneous ensembles for Online Failure Prediction (OFP). More precisely, it attempts to assess the viability of combining different learners to improve performance and to understand how different combination techniques in uence the results. The paper also explores whether the interactions between learners can be studied and leveraged. The results suggest that the combination of certain learners and techniques, not necessarily individually the best, can improve the overall ability to predict failures. Additionally, studying the synergies in the best ensembles provides interesting insights into why some are able to perform better. |
id |
RCAP_e07440b6d25ea0996cf363996054d276 |
---|---|
oai_identifier_str |
oai:estudogeral.uc.pt:10316/101614 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case StudyEnsemblesfailure predictionmachine learningreliabilityThe complexity of software has grown considerably in recent years, making it nearly impossible to detect all faults before pushing to production. Such faults can ultimately lead to failures at runtime. Recent works have shown that using Machine Learning (ML) algorithms it is possible to create models that can accurately predict such failures. At the same time, methods that combine several independent learners (i.e., ensembles) have proved to outperform individual models in various problems. While some well-known ensemble algorithms (e.g Bagging) use the same base learners (i.e., homogeneous), using different algorithms (i.e., heterogeneous) may exploit the different biases of each algorithm. However, this is not a trivial task, as it requires nding and choosing the most adequate base learners and methods to combine their outputs. This paper presents a case study on using several ML techniques to create heterogeneous ensembles for Online Failure Prediction (OFP). More precisely, it attempts to assess the viability of combining different learners to improve performance and to understand how different combination techniques in uence the results. The paper also explores whether the interactions between learners can be studied and leveraged. The results suggest that the combination of certain learners and techniques, not necessarily individually the best, can improve the overall ability to predict failures. Additionally, studying the synergies in the best ensembles provides interesting insights into why some are able to perform better.2019info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10316/101614http://hdl.handle.net/10316/101614https://doi.org/10.1109/ACCESS.2019.2958480eng2169-3536Campos, JoãoCosta, ErnestoVieira, Marcoinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2022-09-02T20:47:42Zoai:estudogeral.uc.pt:10316/101614Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:18:46.217111Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study |
title |
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study |
spellingShingle |
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study Campos, João Ensembles failure prediction machine learning reliability Campos, João Ensembles failure prediction machine learning reliability |
title_short |
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study |
title_full |
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study |
title_fullStr |
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study |
title_full_unstemmed |
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study |
title_sort |
Improving Failure Prediction by Ensembling the Decisions of Machine Learning Models: A Case Study |
author |
Campos, João |
author_facet |
Campos, João Campos, João Costa, Ernesto Vieira, Marco Costa, Ernesto Vieira, Marco |
author_role |
author |
author2 |
Costa, Ernesto Vieira, Marco |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Campos, João Costa, Ernesto Vieira, Marco |
dc.subject.por.fl_str_mv |
Ensembles failure prediction machine learning reliability |
topic |
Ensembles failure prediction machine learning reliability |
description |
The complexity of software has grown considerably in recent years, making it nearly impossible to detect all faults before pushing to production. Such faults can ultimately lead to failures at runtime. Recent works have shown that using Machine Learning (ML) algorithms it is possible to create models that can accurately predict such failures. At the same time, methods that combine several independent learners (i.e., ensembles) have proved to outperform individual models in various problems. While some well-known ensemble algorithms (e.g Bagging) use the same base learners (i.e., homogeneous), using different algorithms (i.e., heterogeneous) may exploit the different biases of each algorithm. However, this is not a trivial task, as it requires nding and choosing the most adequate base learners and methods to combine their outputs. This paper presents a case study on using several ML techniques to create heterogeneous ensembles for Online Failure Prediction (OFP). More precisely, it attempts to assess the viability of combining different learners to improve performance and to understand how different combination techniques in uence the results. The paper also explores whether the interactions between learners can be studied and leveraged. The results suggest that the combination of certain learners and techniques, not necessarily individually the best, can improve the overall ability to predict failures. Additionally, studying the synergies in the best ensembles provides interesting insights into why some are able to perform better. |
publishDate |
2019 |
dc.date.none.fl_str_mv |
2019 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10316/101614 http://hdl.handle.net/10316/101614 https://doi.org/10.1109/ACCESS.2019.2958480 |
url |
http://hdl.handle.net/10316/101614 https://doi.org/10.1109/ACCESS.2019.2958480 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
2169-3536 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1822183453025632256 |
dc.identifier.doi.none.fl_str_mv |
10.1109/ACCESS.2019.2958480 |