2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais
Autor(a) principal: | |
---|---|
Data de Publicação: | 2017 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/106807 |
Resumo: | In a society where the data is transmitted and consumed in a large scale at great speed, to do so its needed to find the best way to process it and maximize the outcome. The aim of this work is to find tecnologies that are able to gather, model and process the data. Its in this context that comes the necessity of time series analysis in order to identity "where", "when" and "how" the clients of the telecomunication company interact with the services, and from that gaining the ability to control the moments of contact, the outcome from this monitorization is alot of data asking to be processed. The aim is to find tecnologies and way to model that are better suitable, using time series techniques, to factors like scalability, performance, cost of tools, physical support infrastructure and system operational costs. The queries performed to agregattion of information are, due to high amount and type of data, expensive in both computational resources and time from that comes problems because this data needs to be used for KPI's for Business Intelligence and those need to be available on time from the decision process. In the proponent case, the telecommunication companies, they diferentiate from one another for the quality of the service that they offer to the customers, that being so, its important to define methodologies and techniques that can compare several alternatives versus what is current implemmented and that way improve quality of services and get efficiency/economic benefits to the telecommunication company in their data management. After searching for alternatives, 3 were selected according to the user requirements for this project, those being Postgres, Citus and Timescaledb. To which data was model and insert to match project's proponente real use case and then each were put under tests to see the query performance and monitoring the machines in which the technologies were, using tools used by the company like jmeter and zabbix to get the results. After analysing the results, the conclusion is that the best choice is Citus, both to current use case evaluated and thinking on the scalability of the solution. |
id |
RCAP_3a965ac6064059d228a3176620c59801 |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/106807 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporaisEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringIn a society where the data is transmitted and consumed in a large scale at great speed, to do so its needed to find the best way to process it and maximize the outcome. The aim of this work is to find tecnologies that are able to gather, model and process the data. Its in this context that comes the necessity of time series analysis in order to identity "where", "when" and "how" the clients of the telecomunication company interact with the services, and from that gaining the ability to control the moments of contact, the outcome from this monitorization is alot of data asking to be processed. The aim is to find tecnologies and way to model that are better suitable, using time series techniques, to factors like scalability, performance, cost of tools, physical support infrastructure and system operational costs. The queries performed to agregattion of information are, due to high amount and type of data, expensive in both computational resources and time from that comes problems because this data needs to be used for KPI's for Business Intelligence and those need to be available on time from the decision process. In the proponent case, the telecommunication companies, they diferentiate from one another for the quality of the service that they offer to the customers, that being so, its important to define methodologies and techniques that can compare several alternatives versus what is current implemmented and that way improve quality of services and get efficiency/economic benefits to the telecommunication company in their data management. After searching for alternatives, 3 were selected according to the user requirements for this project, those being Postgres, Citus and Timescaledb. To which data was model and insert to match project's proponente real use case and then each were put under tests to see the query performance and monitoring the machines in which the technologies were, using tools used by the company like jmeter and zabbix to get the results. After analysing the results, the conclusion is that the best choice is Citus, both to current use case evaluated and thinking on the scalability of the solution.2017-07-182017-07-18T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/106807TID:201802074porPaulo Manuel da Silva Fariainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T15:49:22Zoai:repositorio-aberto.up.pt:10216/106807Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:33:04.899319Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais |
title |
2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais |
spellingShingle |
2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais Paulo Manuel da Silva Faria Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais |
title_full |
2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais |
title_fullStr |
2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais |
title_full_unstemmed |
2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais |
title_sort |
2016_N76 - Tecnologias e modelos de suporte a analytics sobre séries temporais |
author |
Paulo Manuel da Silva Faria |
author_facet |
Paulo Manuel da Silva Faria |
author_role |
author |
dc.contributor.author.fl_str_mv |
Paulo Manuel da Silva Faria |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
In a society where the data is transmitted and consumed in a large scale at great speed, to do so its needed to find the best way to process it and maximize the outcome. The aim of this work is to find tecnologies that are able to gather, model and process the data. Its in this context that comes the necessity of time series analysis in order to identity "where", "when" and "how" the clients of the telecomunication company interact with the services, and from that gaining the ability to control the moments of contact, the outcome from this monitorization is alot of data asking to be processed. The aim is to find tecnologies and way to model that are better suitable, using time series techniques, to factors like scalability, performance, cost of tools, physical support infrastructure and system operational costs. The queries performed to agregattion of information are, due to high amount and type of data, expensive in both computational resources and time from that comes problems because this data needs to be used for KPI's for Business Intelligence and those need to be available on time from the decision process. In the proponent case, the telecommunication companies, they diferentiate from one another for the quality of the service that they offer to the customers, that being so, its important to define methodologies and techniques that can compare several alternatives versus what is current implemmented and that way improve quality of services and get efficiency/economic benefits to the telecommunication company in their data management. After searching for alternatives, 3 were selected according to the user requirements for this project, those being Postgres, Citus and Timescaledb. To which data was model and insert to match project's proponente real use case and then each were put under tests to see the query performance and monitoring the machines in which the technologies were, using tools used by the company like jmeter and zabbix to get the results. After analysing the results, the conclusion is that the best choice is Citus, both to current use case evaluated and thinking on the scalability of the solution. |
publishDate |
2017 |
dc.date.none.fl_str_mv |
2017-07-18 2017-07-18T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/106807 TID:201802074 |
url |
https://hdl.handle.net/10216/106807 |
identifier_str_mv |
TID:201802074 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136239246376960 |