Data Analysis in the BIG DATA scope in Basketball
Autor(a) principal: | |
---|---|
Data de Publicação: | 2022 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.22/21001 |
Resumo: | Nowadays we are witnessing a great growth in the generation, storage and treatment of large amounts of data. These data are generated by different sources, such as the Web, IoT devices, computers software, smartphone apps, sports and so on. Big Data is a term used for large, varied and complex sets of data, with difficulties in storage, analysis and visualization for later processes or results. The process of searching large amounts of data to reveal hidden patterns and correlations is called Big Data data analysis. This information is useful for companies or organizations and with the help of numerical and computational methods, results can be obtained in a short space of time. For this reason, data implementations in Big Data need to be analysed and executed as accurately as possible. With the amount of data generated, data storage is being crucial. The huge increase in data does not stop and data analysis and visualization are adding to the Big Data era with the amount of data generated by computers, social networks, mobile devices, data collection in sports, etc. This research presents an overview of the content, scope, samples, methods, advantages, challenges and concerns of data analysis in Big Data. Basketball is one of the examples, where we will work with the data, applying types of analysis and data visualization, to understand them and in the end, show their results. Using Clustering algorithms and, with criteria defined in the being of the problems, we will have the same information spread over two or more clusters. Important steps, such as the analysis of each of the indicators and, the objective, we determine rule settings for the expected result. In the results demonstration, we verified that the applied clustering algorithm, K-Means, obtained good results comparing with other data. With the completion of this work, we can better understand the scope of Big Data and apply mathematical clustering methods to extract useful information from large amounts of data. |
id |
RCAP_071f664e7995204ccbc9337623f230b5 |
---|---|
oai_identifier_str |
oai:recipp.ipp.pt:10400.22/21001 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Data Analysis in the BIG DATA scope in BasketballBig DataData analysisK-MeansComputational and mathematical methodsUnstructured dataBig DataAnálise de dadosMétodos computacionais e matemáticosDados não estruturadosNowadays we are witnessing a great growth in the generation, storage and treatment of large amounts of data. These data are generated by different sources, such as the Web, IoT devices, computers software, smartphone apps, sports and so on. Big Data is a term used for large, varied and complex sets of data, with difficulties in storage, analysis and visualization for later processes or results. The process of searching large amounts of data to reveal hidden patterns and correlations is called Big Data data analysis. This information is useful for companies or organizations and with the help of numerical and computational methods, results can be obtained in a short space of time. For this reason, data implementations in Big Data need to be analysed and executed as accurately as possible. With the amount of data generated, data storage is being crucial. The huge increase in data does not stop and data analysis and visualization are adding to the Big Data era with the amount of data generated by computers, social networks, mobile devices, data collection in sports, etc. This research presents an overview of the content, scope, samples, methods, advantages, challenges and concerns of data analysis in Big Data. Basketball is one of the examples, where we will work with the data, applying types of analysis and data visualization, to understand them and in the end, show their results. Using Clustering algorithms and, with criteria defined in the being of the problems, we will have the same information spread over two or more clusters. Important steps, such as the analysis of each of the indicators and, the objective, we determine rule settings for the expected result. In the results demonstration, we verified that the applied clustering algorithm, K-Means, obtained good results comparing with other data. With the completion of this work, we can better understand the scope of Big Data and apply mathematical clustering methods to extract useful information from large amounts of data.Atualmente assistimos a um grande crescimento na geração, armazenamento e tratamento de grandes quantidades de dados. Esses dados, gerados por diferentes fontes, como a Web, dispositivos IoT, aplicações computacionais, aplicativos de smartphones, desporto, entre outros. Big Data é um termo usado para um grande conjunto de dados, variados e complexos, com dificuldades de armazenamento, análise e visualização para processos ou resultados posteriores. O processo de pesquisa em grandes quantidades de dados para revelar padrões ocultos e correlações é chamado de análise de dados em Big Data. Essas informações são úteis para empresas ou organizações e com a ajuda de métodos numéricos e computacionais, pode-se obter resultados num espaço curto de tempo. Por esse motivo, as implementações de dados em Big Data precisam ser analisadas e executadas com a maior precisão possível. Com a quantidade de dados gerados, o armazenamento de dados está sendo crucial. O aumento enorme dos dados não para e a análise e visualização de dados estão agregando a era do Big Data com a quantidade de dados gerados por computadores, redes sociais, dispositivos móveis, coleta de dados em desporto, etc. Esta pesquisa apresenta uma visão geral do conteúdo, âmbito, amostras, métodos, vantagens, desafios e preocupações da análise de dados em Big Data. O Basketball é um dos exemplos, onde trabalharemos os dados, aplicando tipos de análise e visualização de dados, para entendê-los e no final, mostrar os seus resultados. Utilizando os algoritmos de Clustering e, com critérios definidos no ser dos problemas, teremos a mesma informações espalhadas por dois ou mais clusters. Etapas importantes, como a análise de cada um dos indicadores e, o objetivo, determinamos configurações de regras para o resultado esperado. Na demonstração de resultados, verificamos que o algoritmo de clustering aplicado, K-Means, obteve bom resultados comparando com outros dados. métodos matemáticos de clustering para extrair informações úteis de grandes quantidades de dados.Campos, Carlos José RibeiroRepositório Científico do Instituto Politécnico do PortoAlves, Diogo Filipe Pinto2023-07-25T00:32:08Z20222022-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.22/21001TID:203086570porinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-26T01:49:09Zoai:recipp.ipp.pt:10400.22/21001Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:41:05.369821Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Data Analysis in the BIG DATA scope in Basketball |
title |
Data Analysis in the BIG DATA scope in Basketball |
spellingShingle |
Data Analysis in the BIG DATA scope in Basketball Alves, Diogo Filipe Pinto Big Data Data analysis K-Means Computational and mathematical methods Unstructured data Big Data Análise de dados Métodos computacionais e matemáticos Dados não estruturados |
title_short |
Data Analysis in the BIG DATA scope in Basketball |
title_full |
Data Analysis in the BIG DATA scope in Basketball |
title_fullStr |
Data Analysis in the BIG DATA scope in Basketball |
title_full_unstemmed |
Data Analysis in the BIG DATA scope in Basketball |
title_sort |
Data Analysis in the BIG DATA scope in Basketball |
author |
Alves, Diogo Filipe Pinto |
author_facet |
Alves, Diogo Filipe Pinto |
author_role |
author |
dc.contributor.none.fl_str_mv |
Campos, Carlos José Ribeiro Repositório Científico do Instituto Politécnico do Porto |
dc.contributor.author.fl_str_mv |
Alves, Diogo Filipe Pinto |
dc.subject.por.fl_str_mv |
Big Data Data analysis K-Means Computational and mathematical methods Unstructured data Big Data Análise de dados Métodos computacionais e matemáticos Dados não estruturados |
topic |
Big Data Data analysis K-Means Computational and mathematical methods Unstructured data Big Data Análise de dados Métodos computacionais e matemáticos Dados não estruturados |
description |
Nowadays we are witnessing a great growth in the generation, storage and treatment of large amounts of data. These data are generated by different sources, such as the Web, IoT devices, computers software, smartphone apps, sports and so on. Big Data is a term used for large, varied and complex sets of data, with difficulties in storage, analysis and visualization for later processes or results. The process of searching large amounts of data to reveal hidden patterns and correlations is called Big Data data analysis. This information is useful for companies or organizations and with the help of numerical and computational methods, results can be obtained in a short space of time. For this reason, data implementations in Big Data need to be analysed and executed as accurately as possible. With the amount of data generated, data storage is being crucial. The huge increase in data does not stop and data analysis and visualization are adding to the Big Data era with the amount of data generated by computers, social networks, mobile devices, data collection in sports, etc. This research presents an overview of the content, scope, samples, methods, advantages, challenges and concerns of data analysis in Big Data. Basketball is one of the examples, where we will work with the data, applying types of analysis and data visualization, to understand them and in the end, show their results. Using Clustering algorithms and, with criteria defined in the being of the problems, we will have the same information spread over two or more clusters. Important steps, such as the analysis of each of the indicators and, the objective, we determine rule settings for the expected result. In the results demonstration, we verified that the applied clustering algorithm, K-Means, obtained good results comparing with other data. With the completion of this work, we can better understand the scope of Big Data and apply mathematical clustering methods to extract useful information from large amounts of data. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022 2022-01-01T00:00:00Z 2023-07-25T00:32:08Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.22/21001 TID:203086570 |
url |
http://hdl.handle.net/10400.22/21001 |
identifier_str_mv |
TID:203086570 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799131499343118336 |