Time series forecast and anomaly detection at scale applied to business metrics in an ERP environment

Detalhes bibliográficos
Autor(a) principal: Martins, André Gil Cardeira
Data de Publicação: 2019
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10071/20281
Resumo: In the business world, dashboards are a widely used analytical mechanism that helps in the decision-making process by displaying insights, key performance indicators, and business metrics. The information provided by this type of mechanism is strongly aggregated, to obtain a high level of summarization and consequently make reading easier. However, the necessary summarization causes “blind spots” to appear by hiding important information such as a sharp drop in revenue from a specific customer, seller, or product/ service. These “blind spots” make it difficult to detect potential business problems and opportunities, which depend on lengthy and thorough additional exploration. Also, the digital transformation process has resulted in a substantial increase in the number of metrics for all systems supporting the business that need to be tracked. Thus, it will be possible to anticipate actions based on the prediction of future behavior, as well as to detect any isolated or successive deviation from the expected behavior. With this dissertation, we intend to promote the acquisition of knowledge from business data through the application of Machine Learning techniques. Based on the Data-Driven Decision-Making process, we intend to propose integration into an ERP application of a mechanism to predict time-series behavior, as well as detecting and measuring possible anomalies. For dealing with a wide diversity of time series, we propose a meta-learning forecasting method that uses a classifier to identify the best forecasting method for each time series. We also propose a new intelligent metric that allows us to sort time series by the accumulated anomaly. The knowledge generated will complement the information provided by the analytical mechanisms typically present in an ERP application (including dashboards). In this way, we intend to contribute to the maximization of profits and reduction of the possibility of error or fraud, as well as waste and consequently mitigate uncertainty and reduce operational risk. Our solution should promote the need to use Machine Learning in Small and Medium Enterprises, and consequently, future implementation of AI-Driven Decision Making. AI-Driven Decision-Making purposes an assertive and automated reaction to problems or opportunities encountered, but whose study is outside the scope of this dissertation.
id RCAP_6b5edb7cb29364119070650d83453176
oai_identifier_str oai:repositorio.iscte-iul.pt:10071/20281
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Time series forecast and anomaly detection at scale applied to business metrics in an ERP environmentTime seriesForecastingAnomaly detectionBusiness forecastingUncertain modelingSéries temporaisPrevisãoDeteção de anomaliasPrevisão em negóciosModelação de incertezaIn the business world, dashboards are a widely used analytical mechanism that helps in the decision-making process by displaying insights, key performance indicators, and business metrics. The information provided by this type of mechanism is strongly aggregated, to obtain a high level of summarization and consequently make reading easier. However, the necessary summarization causes “blind spots” to appear by hiding important information such as a sharp drop in revenue from a specific customer, seller, or product/ service. These “blind spots” make it difficult to detect potential business problems and opportunities, which depend on lengthy and thorough additional exploration. Also, the digital transformation process has resulted in a substantial increase in the number of metrics for all systems supporting the business that need to be tracked. Thus, it will be possible to anticipate actions based on the prediction of future behavior, as well as to detect any isolated or successive deviation from the expected behavior. With this dissertation, we intend to promote the acquisition of knowledge from business data through the application of Machine Learning techniques. Based on the Data-Driven Decision-Making process, we intend to propose integration into an ERP application of a mechanism to predict time-series behavior, as well as detecting and measuring possible anomalies. For dealing with a wide diversity of time series, we propose a meta-learning forecasting method that uses a classifier to identify the best forecasting method for each time series. We also propose a new intelligent metric that allows us to sort time series by the accumulated anomaly. The knowledge generated will complement the information provided by the analytical mechanisms typically present in an ERP application (including dashboards). In this way, we intend to contribute to the maximization of profits and reduction of the possibility of error or fraud, as well as waste and consequently mitigate uncertainty and reduce operational risk. Our solution should promote the need to use Machine Learning in Small and Medium Enterprises, and consequently, future implementation of AI-Driven Decision Making. AI-Driven Decision-Making purposes an assertive and automated reaction to problems or opportunities encountered, but whose study is outside the scope of this dissertation.No meio empresarial, “dashboards” são mecanismos analíticos amplamente utilizados que ajudam no processo de tomada de decisão ao exibirem insights, indicadores de desempenho (KPIs) e métricas de negócio. A informação disponibilizada por este tipo de mecanismo é fortemente agregada, de forma a obter-se um elevado nível de sumarização e consequentemente facilitar a sua consulta. No entanto, a necessária sumarização provoca o surgimento de “blind spots”, ao ocultar informação importante como, por exemplo, uma quebra acentuada de receita de um cliente, ou de um vendedor, ou de um produto/serviço específico. Estes “blind spots” dificultam a deteção de eventuais problemas e oportunidades de negócio, que ficam dependentes de uma exploração adicional demorada e minuciosa. Adicionalmente, o processo de transformação digital tem como consequência um aumento substancial do número de métricas referentes a todos os sistemas que suportam o negócio, que importa acompanhar. Desta forma, será possível antecipar ações baseadas na previsão de um comportamento futuro, bem como detetar um eventual desvio isolado ou sucessivo face ao seu comportamento espectável. Como objetivo desta dissertação pretendemos promover a obtenção de conhecimento a partir de dados de negócio, através da aplicação de técnicas de Aprendizagem Automática (“Machine Learning”). Tendo por base o processo de tomada de decisão a partir de dados (“Data-Driven Decision-Making”) pretende-se propor a integração numa aplicação ERP de um mecanismo que permita prever o comportamento futuro de séries temporais que contêm dados de negócio, bem como detetar e medir possíveis anomalias de forma a poderem ser gerados alertas. Para lidar com uma ampla diversidade de séries temporais, propomos um método de previsão de meta-aprendizagem que utiliza um classificador para identificar o melhor método de previsão para cada série temporal, e uma nova métrica inteligente que permite ordenar séries temporais pela anomalia acumulada. O conhecimento gerado irá complementar a informação disponibilizada pelos mecanismos analíticos tipicamente existente numa aplicação ERP (incluindo “dashboards”). Desta forma pretendemos contribuir para uma maximização dos proveitos e redução da possibilidade de erro ou fraude, bem como do desperdício e consequentemente mitigar a incerteza e diminuir o risco operacional. Pretende-se igualmente que a solução promova a utilização de Aprendizagem Automática em Pequenas e Médias Empresas, e consequentemente uma futura implementação de tomada de decisões a partir de Inteligência Artificial (“AI-Driven Decision Making”), onde uma reação assertiva e automatizada é despoletada, face a problemas ou oportunidades encontradas, mas cujo estudo fica fora do âmbito do presente trabalho.2020-03-31T10:15:44Z2019-12-06T00:00:00Z2019-12-062019-09info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10071/20281TID:202461904engMartins, André Gil Cardeirainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-09T17:34:49Zoai:repositorio.iscte-iul.pt:10071/20281Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T22:15:44.208063Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Time series forecast and anomaly detection at scale applied to business metrics in an ERP environment
title Time series forecast and anomaly detection at scale applied to business metrics in an ERP environment
spellingShingle Time series forecast and anomaly detection at scale applied to business metrics in an ERP environment
Martins, André Gil Cardeira
Time series
Forecasting
Anomaly detection
Business forecasting
Uncertain modeling
Séries temporais
Previsão
Deteção de anomalias
Previsão em negócios
Modelação de incerteza
title_short Time series forecast and anomaly detection at scale applied to business metrics in an ERP environment
title_full Time series forecast and anomaly detection at scale applied to business metrics in an ERP environment
title_fullStr Time series forecast and anomaly detection at scale applied to business metrics in an ERP environment
title_full_unstemmed Time series forecast and anomaly detection at scale applied to business metrics in an ERP environment
title_sort Time series forecast and anomaly detection at scale applied to business metrics in an ERP environment
author Martins, André Gil Cardeira
author_facet Martins, André Gil Cardeira
author_role author
dc.contributor.author.fl_str_mv Martins, André Gil Cardeira
dc.subject.por.fl_str_mv Time series
Forecasting
Anomaly detection
Business forecasting
Uncertain modeling
Séries temporais
Previsão
Deteção de anomalias
Previsão em negócios
Modelação de incerteza
topic Time series
Forecasting
Anomaly detection
Business forecasting
Uncertain modeling
Séries temporais
Previsão
Deteção de anomalias
Previsão em negócios
Modelação de incerteza
description In the business world, dashboards are a widely used analytical mechanism that helps in the decision-making process by displaying insights, key performance indicators, and business metrics. The information provided by this type of mechanism is strongly aggregated, to obtain a high level of summarization and consequently make reading easier. However, the necessary summarization causes “blind spots” to appear by hiding important information such as a sharp drop in revenue from a specific customer, seller, or product/ service. These “blind spots” make it difficult to detect potential business problems and opportunities, which depend on lengthy and thorough additional exploration. Also, the digital transformation process has resulted in a substantial increase in the number of metrics for all systems supporting the business that need to be tracked. Thus, it will be possible to anticipate actions based on the prediction of future behavior, as well as to detect any isolated or successive deviation from the expected behavior. With this dissertation, we intend to promote the acquisition of knowledge from business data through the application of Machine Learning techniques. Based on the Data-Driven Decision-Making process, we intend to propose integration into an ERP application of a mechanism to predict time-series behavior, as well as detecting and measuring possible anomalies. For dealing with a wide diversity of time series, we propose a meta-learning forecasting method that uses a classifier to identify the best forecasting method for each time series. We also propose a new intelligent metric that allows us to sort time series by the accumulated anomaly. The knowledge generated will complement the information provided by the analytical mechanisms typically present in an ERP application (including dashboards). In this way, we intend to contribute to the maximization of profits and reduction of the possibility of error or fraud, as well as waste and consequently mitigate uncertainty and reduce operational risk. Our solution should promote the need to use Machine Learning in Small and Medium Enterprises, and consequently, future implementation of AI-Driven Decision Making. AI-Driven Decision-Making purposes an assertive and automated reaction to problems or opportunities encountered, but whose study is outside the scope of this dissertation.
publishDate 2019
dc.date.none.fl_str_mv 2019-12-06T00:00:00Z
2019-12-06
2019-09
2020-03-31T10:15:44Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10071/20281
TID:202461904
url http://hdl.handle.net/10071/20281
identifier_str_mv TID:202461904
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134715367653376