Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico

Brenner Lopes

Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico

Detalhes bibliográficos
Autor(a) principal:	Brenner Lopes
Data de Publicação:	2024
Tipo de documento:	Tese
Idioma:	por
Título da fonte:	Repositório Institucional da UFMG
Texto Completo:	http://hdl.handle.net/1843/74112
Resumo:	The continuous increase in the amount of data generated and available has created an environment characterized by a gigantic mass of data, which grows not only in terms of volume and quantity, but also in terms of variety, being created and moving at high speed. If before, the focus and availability of data had as a priority interface the so-called structured data, today they are in much smaller quantity and importance, and adjustments and improvements in technologies and analyzes were partly carried out to adapt to this new reality which is conventionally called big data. The value created by big data analysis has brought many positive factors to different fields, but, on the other hand, it has also brought some problems. One of the issues of great relevance in this context that has raised great concern in society are the threats to privacy, brought about by advanced analyzes of large volumes of data. The question raised as a result of several researches is that the procedures, techniques and technologies currently available cannot fully guarantee privacy. As well as legislation that regulates data protection and privacy, such as in the Brazilian case the General Data Protection Law (LGPD). On the other hand, it is possible to obtain a very high level of privacy, but at the cost of nullifying the possibilities of extracting value from big data. Given this complex scenario, the focus of this research is to propose a multifaceted model (because it needs to be composed of multiple approaches and constructs) within the scope of big data analytics, which guarantees privacy, at the same time, in which it does not make it impossible to extract value. The methodology proposed for this work will be based on a quantitative approach, using text mining and unsupervised machine learning with a focus on creating clusters. After the quantitative phase, the privacy policies of 28 companies were analyzed using a qualitative approach, with two main objectives: finding the best number of clusters; and obtain an understanding of their distinctive characteristics, in order to improve the analysis carried out. The privacy policies of the thousand largest companies in Brazil were selected, classified as such by the Valor1000 ranking (2021). The results pointed to the consistency of the Privacy Pentagon model in the proposed Big Data Analytics; as well as what was called “opaque privacy” in this work.

Metadados do item

id	UFMG_e0c641d953cf84ad44b899727814fbaf
oai_identifier_str	oai:repositorio.ufmg.br:1843/74112
network_acronym_str	UFMG
network_name_str	Repositório Institucional da UFMG
repository_id_str
spelling	Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analíticoPrivacidadeBig Data AnalyticsBig Data PrivacyValor do big dataClusteringAprendizagem de máquina não supervisionadoMineração de textoCiência da informaçãoBig dataDireito à privacidadeProteção de dadosLei Geral de Proteção de Dados Pessoais (LGPD)The continuous increase in the amount of data generated and available has created an environment characterized by a gigantic mass of data, which grows not only in terms of volume and quantity, but also in terms of variety, being created and moving at high speed. If before, the focus and availability of data had as a priority interface the so-called structured data, today they are in much smaller quantity and importance, and adjustments and improvements in technologies and analyzes were partly carried out to adapt to this new reality which is conventionally called big data. The value created by big data analysis has brought many positive factors to different fields, but, on the other hand, it has also brought some problems. One of the issues of great relevance in this context that has raised great concern in society are the threats to privacy, brought about by advanced analyzes of large volumes of data. The question raised as a result of several researches is that the procedures, techniques and technologies currently available cannot fully guarantee privacy. As well as legislation that regulates data protection and privacy, such as in the Brazilian case the General Data Protection Law (LGPD). On the other hand, it is possible to obtain a very high level of privacy, but at the cost of nullifying the possibilities of extracting value from big data. Given this complex scenario, the focus of this research is to propose a multifaceted model (because it needs to be composed of multiple approaches and constructs) within the scope of big data analytics, which guarantees privacy, at the same time, in which it does not make it impossible to extract value. The methodology proposed for this work will be based on a quantitative approach, using text mining and unsupervised machine learning with a focus on creating clusters. After the quantitative phase, the privacy policies of 28 companies were analyzed using a qualitative approach, with two main objectives: finding the best number of clusters; and obtain an understanding of their distinctive characteristics, in order to improve the analysis carried out. The privacy policies of the thousand largest companies in Brazil were selected, classified as such by the Valor1000 ranking (2021). The results pointed to the consistency of the Privacy Pentagon model in the proposed Big Data Analytics; as well as what was called “opaque privacy” in this work.O contínuo aumento da quantidade de dados gerados e disponíveis, criou um ambiente caracterizado por uma gigantesca massa de dados, que crescem não só quanto ao seu volume e quantidade, mas também em termos de variedade, sendo criado e transitando em alta velocidade. Se antes, o foco e a disponibilidade de dados, tinha como interface prioritária os chamados dados estruturados, hoje eles estão em quantidade e importância bem menores, e os ajustes e aprimoramentos nas tecnologias e análises foram em parte realizados para se adaptarem a essa nova realidade que se convencionou chamar de big data. O valor criado pelas análises de big data, têm trazido muitos fatores positivos para diversos campos, como também, por outro lado, têm trazido alguns problemas. Uma das questões de grande relevância nesse contexto que têm suscitado grande preocupação à sociedade são as ameaças à privacidade, trazidas pelas análises avançadas de grandes volumes de dados. A questão posta como resultado de diversas pesquisas, é que os procedimentos, técnicas e tecnologias atualmente disponíveis não conseguem dar garantia plena à privacidade. Assim como as legislações que disciplinam a proteção e privacidade de dados, como no caso brasileiro a Lei Geral de Proteção de Dados (LGPD). Por outro lado, é possível obter um elevadíssimo nível de privacidade, mas ao custo da nulidade das possibilidades de extração de valor do big data. Diante desse complexo cenário, o foco dessa pesquisa está em propor um modelo multifacetado (porque precisa ser composto por múltiplas abordagens e construtos) no âmbito do big data analytics, que garanta a privacidade, ao mesmo tempo, em que não inviabilize sua extração de valor. A metodologia proposta para esse trabalho, estará baseada na abordagem quantitativa, com o emprego de mineração de texto e aprendizado de máquina não supervisionado com foco na criação de agrupamentos. Após a fase quantitativa foram analisadas a partir de uma abordagem qualitativa as políticas de privacidade de 28 empresas, com dois objetivos principais: encontrar o melhor número de clusters; e obter um entendimento de características distintivas dos mesmos, de forma a aprimorar a análise realizada. Foram selecionadas as políticas de privacidade das mil maiores empresas do Brasil, assim classificadas pelo ranking Valor1000 (2021). Os resultados apontaram para a consistência do modelo do Pentágono da Privacidade no Big Data Analytics proposto; assim como para o que foi denominado neste trabalho de “privacidade opaca”.Universidade Federal de Minas GeraisBrasilECI - ESCOLA DE CIENCIA DA INFORMAÇÃOPrograma de Pós-Graduação em Gestão e Organização do ConhecimentoUFMGRicardo Rodrigues Barbosahttp://lattes.cnpq.br/6523065261260011George Leal JamilCristiana Fernandes de MuylderDarly Fernando AndradeAndre Luiz de Castro LealBrenner Lopes2024-08-16T13:15:25Z2024-08-16T13:15:25Z2024-02-22info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfhttp://hdl.handle.net/1843/741120000-0002-5807-0437porinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMG2024-08-16T13:15:26Zoai:repositorio.ufmg.br:1843/74112Repositório InstitucionalPUBhttps://repositorio.ufmg.br/oairepositorio@ufmg.bropendoar:2024-08-16T13:15:26Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false
dc.title.none.fl_str_mv	Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico
title	Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico
spellingShingle	Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico Brenner Lopes Privacidade Big Data Analytics Big Data Privacy Valor do big data Clustering Aprendizagem de máquina não supervisionado Mineração de texto Ciência da informação Big data Direito à privacidade Proteção de dados Lei Geral de Proteção de Dados Pessoais (LGPD)
title_short	Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico
title_full	Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico
title_fullStr	Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico
title_full_unstemmed	Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico
title_sort	Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico
author	Brenner Lopes
author_facet	Brenner Lopes
author_role	author
dc.contributor.none.fl_str_mv	Ricardo Rodrigues Barbosa http://lattes.cnpq.br/6523065261260011 George Leal Jamil Cristiana Fernandes de Muylder Darly Fernando Andrade Andre Luiz de Castro Leal
dc.contributor.author.fl_str_mv	Brenner Lopes
dc.subject.por.fl_str_mv	Privacidade Big Data Analytics Big Data Privacy Valor do big data Clustering Aprendizagem de máquina não supervisionado Mineração de texto Ciência da informação Big data Direito à privacidade Proteção de dados Lei Geral de Proteção de Dados Pessoais (LGPD)
topic	Privacidade Big Data Analytics Big Data Privacy Valor do big data Clustering Aprendizagem de máquina não supervisionado Mineração de texto Ciência da informação Big data Direito à privacidade Proteção de dados Lei Geral de Proteção de Dados Pessoais (LGPD)
description	The continuous increase in the amount of data generated and available has created an environment characterized by a gigantic mass of data, which grows not only in terms of volume and quantity, but also in terms of variety, being created and moving at high speed. If before, the focus and availability of data had as a priority interface the so-called structured data, today they are in much smaller quantity and importance, and adjustments and improvements in technologies and analyzes were partly carried out to adapt to this new reality which is conventionally called big data. The value created by big data analysis has brought many positive factors to different fields, but, on the other hand, it has also brought some problems. One of the issues of great relevance in this context that has raised great concern in society are the threats to privacy, brought about by advanced analyzes of large volumes of data. The question raised as a result of several researches is that the procedures, techniques and technologies currently available cannot fully guarantee privacy. As well as legislation that regulates data protection and privacy, such as in the Brazilian case the General Data Protection Law (LGPD). On the other hand, it is possible to obtain a very high level of privacy, but at the cost of nullifying the possibilities of extracting value from big data. Given this complex scenario, the focus of this research is to propose a multifaceted model (because it needs to be composed of multiple approaches and constructs) within the scope of big data analytics, which guarantees privacy, at the same time, in which it does not make it impossible to extract value. The methodology proposed for this work will be based on a quantitative approach, using text mining and unsupervised machine learning with a focus on creating clusters. After the quantitative phase, the privacy policies of 28 companies were analyzed using a qualitative approach, with two main objectives: finding the best number of clusters; and obtain an understanding of their distinctive characteristics, in order to improve the analysis carried out. The privacy policies of the thousand largest companies in Brazil were selected, classified as such by the Valor1000 ranking (2021). The results pointed to the consistency of the Privacy Pentagon model in the proposed Big Data Analytics; as well as what was called “opaque privacy” in this work.
publishDate	2024
dc.date.none.fl_str_mv	2024-08-16T13:15:25Z 2024-08-16T13:15:25Z 2024-02-22
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/doctoralThesis
format	doctoralThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/1843/74112 0000-0002-5807-0437
url	http://hdl.handle.net/1843/74112
identifier_str_mv	0000-0002-5807-0437
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Federal de Minas Gerais Brasil ECI - ESCOLA DE CIENCIA DA INFORMAÇÃO Programa de Pós-Graduação em Gestão e Organização do Conhecimento UFMG
publisher.none.fl_str_mv	Universidade Federal de Minas Gerais Brasil ECI - ESCOLA DE CIENCIA DA INFORMAÇÃO Programa de Pós-Graduação em Gestão e Organização do Conhecimento UFMG
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFMG instname:Universidade Federal de Minas Gerais (UFMG) instacron:UFMG
instname_str	Universidade Federal de Minas Gerais (UFMG)
instacron_str	UFMG
institution	UFMG
reponame_str	Repositório Institucional da UFMG
collection	Repositório Institucional da UFMG
repository.name.fl_str_mv	Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv	repositorio@ufmg.br
_version_	1823248033478868992

Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico

Registros relacionados