Data Analysis in the BIG DATA scope in Basketball

Detalhes bibliográficos
Autor(a) principal: Alves, Diogo Filipe Pinto
Data de Publicação: 2022
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.22/21001
Resumo: Nowadays we are witnessing a great growth in the generation, storage and treatment of large amounts of data. These data are generated by different sources, such as the Web, IoT devices, computers software, smartphone apps, sports and so on. Big Data is a term used for large, varied and complex sets of data, with difficulties in storage, analysis and visualization for later processes or results. The process of searching large amounts of data to reveal hidden patterns and correlations is called Big Data data analysis. This information is useful for companies or organizations and with the help of numerical and computational methods, results can be obtained in a short space of time. For this reason, data implementations in Big Data need to be analysed and executed as accurately as possible. With the amount of data generated, data storage is being crucial. The huge increase in data does not stop and data analysis and visualization are adding to the Big Data era with the amount of data generated by computers, social networks, mobile devices, data collection in sports, etc. This research presents an overview of the content, scope, samples, methods, advantages, challenges and concerns of data analysis in Big Data. Basketball is one of the examples, where we will work with the data, applying types of analysis and data visualization, to understand them and in the end, show their results. Using Clustering algorithms and, with criteria defined in the being of the problems, we will have the same information spread over two or more clusters. Important steps, such as the analysis of each of the indicators and, the objective, we determine rule settings for the expected result. In the results demonstration, we verified that the applied clustering algorithm, K-Means, obtained good results comparing with other data. With the completion of this work, we can better understand the scope of Big Data and apply mathematical clustering methods to extract useful information from large amounts of data.
id RCAP_071f664e7995204ccbc9337623f230b5
oai_identifier_str oai:recipp.ipp.pt:10400.22/21001
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Data Analysis in the BIG DATA scope in BasketballBig DataData analysisK-MeansComputational and mathematical methodsUnstructured dataBig DataAnálise de dadosMétodos computacionais e matemáticosDados não estruturadosNowadays we are witnessing a great growth in the generation, storage and treatment of large amounts of data. These data are generated by different sources, such as the Web, IoT devices, computers software, smartphone apps, sports and so on. Big Data is a term used for large, varied and complex sets of data, with difficulties in storage, analysis and visualization for later processes or results. The process of searching large amounts of data to reveal hidden patterns and correlations is called Big Data data analysis. This information is useful for companies or organizations and with the help of numerical and computational methods, results can be obtained in a short space of time. For this reason, data implementations in Big Data need to be analysed and executed as accurately as possible. With the amount of data generated, data storage is being crucial. The huge increase in data does not stop and data analysis and visualization are adding to the Big Data era with the amount of data generated by computers, social networks, mobile devices, data collection in sports, etc. This research presents an overview of the content, scope, samples, methods, advantages, challenges and concerns of data analysis in Big Data. Basketball is one of the examples, where we will work with the data, applying types of analysis and data visualization, to understand them and in the end, show their results. Using Clustering algorithms and, with criteria defined in the being of the problems, we will have the same information spread over two or more clusters. Important steps, such as the analysis of each of the indicators and, the objective, we determine rule settings for the expected result. In the results demonstration, we verified that the applied clustering algorithm, K-Means, obtained good results comparing with other data. With the completion of this work, we can better understand the scope of Big Data and apply mathematical clustering methods to extract useful information from large amounts of data.Atualmente assistimos a um grande crescimento na geração, armazenamento e tratamento de grandes quantidades de dados. Esses dados, gerados por diferentes fontes, como a Web, dispositivos IoT, aplicações computacionais, aplicativos de smartphones, desporto, entre outros. Big Data é um termo usado para um grande conjunto de dados, variados e complexos, com dificuldades de armazenamento, análise e visualização para processos ou resultados posteriores. O processo de pesquisa em grandes quantidades de dados para revelar padrões ocultos e correlações é chamado de análise de dados em Big Data. Essas informações são úteis para empresas ou organizações e com a ajuda de métodos numéricos e computacionais, pode-se obter resultados num espaço curto de tempo. Por esse motivo, as implementações de dados em Big Data precisam ser analisadas e executadas com a maior precisão possível. Com a quantidade de dados gerados, o armazenamento de dados está sendo crucial. O aumento enorme dos dados não para e a análise e visualização de dados estão agregando a era do Big Data com a quantidade de dados gerados por computadores, redes sociais, dispositivos móveis, coleta de dados em desporto, etc. Esta pesquisa apresenta uma visão geral do conteúdo, âmbito, amostras, métodos, vantagens, desafios e preocupações da análise de dados em Big Data. O Basketball é um dos exemplos, onde trabalharemos os dados, aplicando tipos de análise e visualização de dados, para entendê-los e no final, mostrar os seus resultados. Utilizando os algoritmos de Clustering e, com critérios definidos no ser dos problemas, teremos a mesma informações espalhadas por dois ou mais clusters. Etapas importantes, como a análise de cada um dos indicadores e, o objetivo, determinamos configurações de regras para o resultado esperado. Na demonstração de resultados, verificamos que o algoritmo de clustering aplicado, K-Means, obteve bom resultados comparando com outros dados. métodos matemáticos de clustering para extrair informações úteis de grandes quantidades de dados.Campos, Carlos José RibeiroRepositório Científico do Instituto Politécnico do PortoAlves, Diogo Filipe Pinto2023-07-25T00:32:08Z20222022-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.22/21001TID:203086570porinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-26T01:49:09Zoai:recipp.ipp.pt:10400.22/21001Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:41:05.369821Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Data Analysis in the BIG DATA scope in Basketball
title Data Analysis in the BIG DATA scope in Basketball
spellingShingle Data Analysis in the BIG DATA scope in Basketball
Alves, Diogo Filipe Pinto
Big Data
Data analysis
K-Means
Computational and mathematical methods
Unstructured data
Big Data
Análise de dados
Métodos computacionais e matemáticos
Dados não estruturados
title_short Data Analysis in the BIG DATA scope in Basketball
title_full Data Analysis in the BIG DATA scope in Basketball
title_fullStr Data Analysis in the BIG DATA scope in Basketball
title_full_unstemmed Data Analysis in the BIG DATA scope in Basketball
title_sort Data Analysis in the BIG DATA scope in Basketball
author Alves, Diogo Filipe Pinto
author_facet Alves, Diogo Filipe Pinto
author_role author
dc.contributor.none.fl_str_mv Campos, Carlos José Ribeiro
Repositório Científico do Instituto Politécnico do Porto
dc.contributor.author.fl_str_mv Alves, Diogo Filipe Pinto
dc.subject.por.fl_str_mv Big Data
Data analysis
K-Means
Computational and mathematical methods
Unstructured data
Big Data
Análise de dados
Métodos computacionais e matemáticos
Dados não estruturados
topic Big Data
Data analysis
K-Means
Computational and mathematical methods
Unstructured data
Big Data
Análise de dados
Métodos computacionais e matemáticos
Dados não estruturados
description Nowadays we are witnessing a great growth in the generation, storage and treatment of large amounts of data. These data are generated by different sources, such as the Web, IoT devices, computers software, smartphone apps, sports and so on. Big Data is a term used for large, varied and complex sets of data, with difficulties in storage, analysis and visualization for later processes or results. The process of searching large amounts of data to reveal hidden patterns and correlations is called Big Data data analysis. This information is useful for companies or organizations and with the help of numerical and computational methods, results can be obtained in a short space of time. For this reason, data implementations in Big Data need to be analysed and executed as accurately as possible. With the amount of data generated, data storage is being crucial. The huge increase in data does not stop and data analysis and visualization are adding to the Big Data era with the amount of data generated by computers, social networks, mobile devices, data collection in sports, etc. This research presents an overview of the content, scope, samples, methods, advantages, challenges and concerns of data analysis in Big Data. Basketball is one of the examples, where we will work with the data, applying types of analysis and data visualization, to understand them and in the end, show their results. Using Clustering algorithms and, with criteria defined in the being of the problems, we will have the same information spread over two or more clusters. Important steps, such as the analysis of each of the indicators and, the objective, we determine rule settings for the expected result. In the results demonstration, we verified that the applied clustering algorithm, K-Means, obtained good results comparing with other data. With the completion of this work, we can better understand the scope of Big Data and apply mathematical clustering methods to extract useful information from large amounts of data.
publishDate 2022
dc.date.none.fl_str_mv 2022
2022-01-01T00:00:00Z
2023-07-25T00:32:08Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.22/21001
TID:203086570
url http://hdl.handle.net/10400.22/21001
identifier_str_mv TID:203086570
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799131499343118336