2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais

Detalhes bibliográficos
Autor(a) principal: Paulo Manuel da Silva Faria
Data de Publicação: 2017
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://hdl.handle.net/10216/106807
Resumo: In a society where the data is transmitted and consumed in a large scale at great speed, to do so its needed to find the best way to process it and maximize the outcome. The aim of this work is to find tecnologies that are able to gather, model and process the data. Its in this context that comes the necessity of time series analysis in order to identity "where", "when" and "how" the clients of the telecomunication company interact with the services, and from that gaining the ability to control the moments of contact, the outcome from this monitorization is alot of data asking to be processed. The aim is to find tecnologies and way to model that are better suitable, using time series techniques, to factors like scalability, performance, cost of tools, physical support infrastructure and system operational costs. The queries performed to agregattion of information are, due to high amount and type of data, expensive in both computational resources and time from that comes problems because this data needs to be used for KPI's for Business Intelligence and those need to be available on time from the decision process. In the proponent case, the telecommunication companies, they diferentiate from one another for the quality of the service that they offer to the customers, that being so, its important to define methodologies and techniques that can compare several alternatives versus what is current implemmented and that way improve quality of services and get efficiency/economic benefits to the telecommunication company in their data management. After searching for alternatives, 3 were selected according to the user requirements for this project, those being Postgres, Citus and Timescaledb. To which data was model and insert to match project's proponente real use case and then each were put under tests to see the query performance and monitoring the machines in which the technologies were, using tools used by the company like jmeter and zabbix to get the results. After analysing the results, the conclusion is that the best choice is Citus, both to current use case evaluated and thinking on the scalability of the solution.
id RCAP_3a965ac6064059d228a3176620c59801
oai_identifier_str oai:repositorio-aberto.up.pt:10216/106807
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling 2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporaisEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringIn a society where the data is transmitted and consumed in a large scale at great speed, to do so its needed to find the best way to process it and maximize the outcome. The aim of this work is to find tecnologies that are able to gather, model and process the data. Its in this context that comes the necessity of time series analysis in order to identity "where", "when" and "how" the clients of the telecomunication company interact with the services, and from that gaining the ability to control the moments of contact, the outcome from this monitorization is alot of data asking to be processed. The aim is to find tecnologies and way to model that are better suitable, using time series techniques, to factors like scalability, performance, cost of tools, physical support infrastructure and system operational costs. The queries performed to agregattion of information are, due to high amount and type of data, expensive in both computational resources and time from that comes problems because this data needs to be used for KPI's for Business Intelligence and those need to be available on time from the decision process. In the proponent case, the telecommunication companies, they diferentiate from one another for the quality of the service that they offer to the customers, that being so, its important to define methodologies and techniques that can compare several alternatives versus what is current implemmented and that way improve quality of services and get efficiency/economic benefits to the telecommunication company in their data management. After searching for alternatives, 3 were selected according to the user requirements for this project, those being Postgres, Citus and Timescaledb. To which data was model and insert to match project's proponente real use case and then each were put under tests to see the query performance and monitoring the machines in which the technologies were, using tools used by the company like jmeter and zabbix to get the results. After analysing the results, the conclusion is that the best choice is Citus, both to current use case evaluated and thinking on the scalability of the solution.2017-07-182017-07-18T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/106807TID:201802074porPaulo Manuel da Silva Fariainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T15:49:22Zoai:repositorio-aberto.up.pt:10216/106807Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:33:04.899319Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv 2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais
title 2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais
spellingShingle 2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais
Paulo Manuel da Silva Faria
Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
title_short 2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais
title_full 2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais
title_fullStr 2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais
title_full_unstemmed 2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais
title_sort 2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais
author Paulo Manuel da Silva Faria
author_facet Paulo Manuel da Silva Faria
author_role author
dc.contributor.author.fl_str_mv Paulo Manuel da Silva Faria
dc.subject.por.fl_str_mv Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
topic Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
description In a society where the data is transmitted and consumed in a large scale at great speed, to do so its needed to find the best way to process it and maximize the outcome. The aim of this work is to find tecnologies that are able to gather, model and process the data. Its in this context that comes the necessity of time series analysis in order to identity "where", "when" and "how" the clients of the telecomunication company interact with the services, and from that gaining the ability to control the moments of contact, the outcome from this monitorization is alot of data asking to be processed. The aim is to find tecnologies and way to model that are better suitable, using time series techniques, to factors like scalability, performance, cost of tools, physical support infrastructure and system operational costs. The queries performed to agregattion of information are, due to high amount and type of data, expensive in both computational resources and time from that comes problems because this data needs to be used for KPI's for Business Intelligence and those need to be available on time from the decision process. In the proponent case, the telecommunication companies, they diferentiate from one another for the quality of the service that they offer to the customers, that being so, its important to define methodologies and techniques that can compare several alternatives versus what is current implemmented and that way improve quality of services and get efficiency/economic benefits to the telecommunication company in their data management. After searching for alternatives, 3 were selected according to the user requirements for this project, those being Postgres, Citus and Timescaledb. To which data was model and insert to match project's proponente real use case and then each were put under tests to see the query performance and monitoring the machines in which the technologies were, using tools used by the company like jmeter and zabbix to get the results. After analysing the results, the conclusion is that the best choice is Citus, both to current use case evaluated and thinking on the scalability of the solution.
publishDate 2017
dc.date.none.fl_str_mv 2017-07-18
2017-07-18T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/106807
TID:201802074
url https://hdl.handle.net/10216/106807
identifier_str_mv TID:201802074
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136239246376960