Retail monitoring and alerting system for observability

Detalhes bibliográficos
Autor(a) principal: Silva, Diogo Manuel Pais
Data de Publicação: 2023
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10773/41046
Resumo: Oracle Retail implementations are supported by a somewhat complex infrastructure composed of various services. Retail Consult offers consultancy to clients with these systems, providing 24/7 support. Problems can arise in any of these infrastructure components with varying levels of priority, affecting the system’s availability. Manual monitoring of various clients’ systems is not possible due to the number of clients, systems, information dispersion and human resources available. Thus, automating the collection of key metrics from the various systems is essential. By constantly monitoring key metrics of the most important infrastructure components (the OS, WebLogic, and Database), alerts can be set for critical issues, improving the reaction time and efficacy of the solution. These metrics can also provide insight into the systems’ current and previous states, which can help debugging problems that arise. In this dissertation we propose a monitoring solution supported on a ETL (Extract, Transform, Load) pipeline based on the Unified Data Infrastructure 2.0, which pulls the desired metrics from the monitored system, sends them to a data broker (Kafka) and then loads them on a time series database (InfluxDB). Grafana was used to support metric visualization. It also allows customized analytics. With our system the metrics can be provided with low latency for visualization by an end user. The system also provides alerts and daily reports sent via Microsoft Teams and email, respectively. The system pilot was deployed in a production environment of a client, which had a very positive impact in the target infrastructure, helping spot problems in terms of Database concurrency, bugs, and resource usage, improving the quality of service provided by Retail Consult.
id RCAP_558d546e99f8f919d6cabcbe11c0af31
oai_identifier_str oai:ria.ua.pt:10773/41046
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Retail monitoring and alerting system for observabilityObservabilityAlertingTime Series DatabaseStreamingETLKafkaInfluxDBOracle RetailOracle Retail implementations are supported by a somewhat complex infrastructure composed of various services. Retail Consult offers consultancy to clients with these systems, providing 24/7 support. Problems can arise in any of these infrastructure components with varying levels of priority, affecting the system’s availability. Manual monitoring of various clients’ systems is not possible due to the number of clients, systems, information dispersion and human resources available. Thus, automating the collection of key metrics from the various systems is essential. By constantly monitoring key metrics of the most important infrastructure components (the OS, WebLogic, and Database), alerts can be set for critical issues, improving the reaction time and efficacy of the solution. These metrics can also provide insight into the systems’ current and previous states, which can help debugging problems that arise. In this dissertation we propose a monitoring solution supported on a ETL (Extract, Transform, Load) pipeline based on the Unified Data Infrastructure 2.0, which pulls the desired metrics from the monitored system, sends them to a data broker (Kafka) and then loads them on a time series database (InfluxDB). Grafana was used to support metric visualization. It also allows customized analytics. With our system the metrics can be provided with low latency for visualization by an end user. The system also provides alerts and daily reports sent via Microsoft Teams and email, respectively. The system pilot was deployed in a production environment of a client, which had a very positive impact in the target infrastructure, helping spot problems in terms of Database concurrency, bugs, and resource usage, improving the quality of service provided by Retail Consult.Implementações de Oracle Retail são suportadas por uma infraestrutura algo complexa, composta por vários serviços. A Retail Consult fornece consultoria e suporte 24/7 a empresas com sistemas Oracle Retail. Problemas com vários níveis de prioridade surgem nesses sistemas afetando a disponibilidade dos serviços. Realizar a monitorização de forma manual não é possível devido ao número de clientes, sistemas, dispersão de informação e recursos humanos disponíveis, pelo que é essencial automatizar o processo de recolha de métricas chave dos vários sistemas. Ao monitorizar métricas dos sistemas mais importantes da infraestrutura de uma implementação Oracle Retail (sistema operativo, WebLogic e base de dados) é possível implementar alertas para incidentes críticos, melhorando o tempo de reação e eficácia na resolução dos mesmos. Estas mesmas métricas também fornecem informação sobre o estado atual e passado dos sistemas, algo que é útil na depuração de problemas. Nesta dissertação propomos uma pipeline ETL (“Extract, Transform, Load”) baseada na arquitetura Unified Data Infrastructure 2.0, recolhendo as métricas pretendidas dos sistemas monitorizados, publicando num broker (Kafka) e armazenando-as numa base de dados temporal (InfluxDB). O Grafana é usado para permitir a visualização destas métricas. Com este sistema é possível o utilizador final observar as métricas recolhidas com baixa latência. O sistema tem a possibilidade de enviar alertas e relatórios diários via Microsoft Teams e/ou email, respetivamente. O piloto do sistema foi implementando em vários sistemas de clientes, um dos quais um ambiente produtivo, sendo que teve um impacto bastante positivo, tendo facilitado a monitorização da infraestrutura e detetado problemas que estavam a ocorrer em termos de concorrência na base de dados, bugs e uso de recursos, melhorando a qualidade de serviço prestada pela Retail Consult.2025-07-11T00:00:00Z2023-07-04T00:00:00Z2023-07-04info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10773/41046engSilva, Diogo Manuel Paisinfo:eu-repo/semantics/embargoedAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-18T01:48:19Zoai:ria.ua.pt:10773/41046Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T04:02:09.203560Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Retail monitoring and alerting system for observability
title Retail monitoring and alerting system for observability
spellingShingle Retail monitoring and alerting system for observability
Silva, Diogo Manuel Pais
Observability
Alerting
Time Series Database
Streaming
ETL
Kafka
InfluxDB
Oracle Retail
title_short Retail monitoring and alerting system for observability
title_full Retail monitoring and alerting system for observability
title_fullStr Retail monitoring and alerting system for observability
title_full_unstemmed Retail monitoring and alerting system for observability
title_sort Retail monitoring and alerting system for observability
author Silva, Diogo Manuel Pais
author_facet Silva, Diogo Manuel Pais
author_role author
dc.contributor.author.fl_str_mv Silva, Diogo Manuel Pais
dc.subject.por.fl_str_mv Observability
Alerting
Time Series Database
Streaming
ETL
Kafka
InfluxDB
Oracle Retail
topic Observability
Alerting
Time Series Database
Streaming
ETL
Kafka
InfluxDB
Oracle Retail
description Oracle Retail implementations are supported by a somewhat complex infrastructure composed of various services. Retail Consult offers consultancy to clients with these systems, providing 24/7 support. Problems can arise in any of these infrastructure components with varying levels of priority, affecting the system’s availability. Manual monitoring of various clients’ systems is not possible due to the number of clients, systems, information dispersion and human resources available. Thus, automating the collection of key metrics from the various systems is essential. By constantly monitoring key metrics of the most important infrastructure components (the OS, WebLogic, and Database), alerts can be set for critical issues, improving the reaction time and efficacy of the solution. These metrics can also provide insight into the systems’ current and previous states, which can help debugging problems that arise. In this dissertation we propose a monitoring solution supported on a ETL (Extract, Transform, Load) pipeline based on the Unified Data Infrastructure 2.0, which pulls the desired metrics from the monitored system, sends them to a data broker (Kafka) and then loads them on a time series database (InfluxDB). Grafana was used to support metric visualization. It also allows customized analytics. With our system the metrics can be provided with low latency for visualization by an end user. The system also provides alerts and daily reports sent via Microsoft Teams and email, respectively. The system pilot was deployed in a production environment of a client, which had a very positive impact in the target infrastructure, helping spot problems in terms of Database concurrency, bugs, and resource usage, improving the quality of service provided by Retail Consult.
publishDate 2023
dc.date.none.fl_str_mv 2023-07-04T00:00:00Z
2023-07-04
2025-07-11T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10773/41046
url http://hdl.handle.net/10773/41046
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/embargoedAccess
eu_rights_str_mv embargoedAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799138193908432896