Anonimização de informação clínica

Detalhes bibliográficos
Autor(a) principal: José Miguel de Sousa Pessanha Pereira de Melo
Data de Publicação: 2017
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://hdl.handle.net/10216/106216
Resumo: Over the past years, with the progress of technology, the amount of data being collected by the IT systems has exponentially grown. By using data mining techniques, this data can be analyzed to find trends and statistics, which are really useful for all companies and industries. The analysis and data sharing for studies became a large industry, with a great impact in all sectors. However, with this comes the concern with individual privacy - there is a huge amount of data which is private and should not be public in any circumstances - so, it is highly needed to find a solution to share and analyse data while protecting privacy. Nevertheless, it is truly important to take into account performance issues as the anonymization process should not hinder the normal functioning of the operational system. The focus goes to clinical data, which allows medical researchers to learn trends, statistics and relations between certain clinical attributes, such as correlations between gender and a specific disease. These studies and data analysis are very important as they can bring great benefits and knowledge in healthcare. However, maintaining individual privacy is crucial. In order to solve this problem, a new solution will be proposed and developed to efficiently anonymize data. With this solution, the user can quickly and efficiently anonymize a given dataset, according to the initially provided configurations. The tool receives as input the dataset to anonymize and a minimal configuration to specify the required anonymization parameters, and returns as output the corresponding anonymized dataset. The anonymization is done recurring to well known models and algorithms to protect privacy, associated with specific clinical criteria, restrictions and hierarchies. This solution will be validated using a subset of clinical data that needs to be anonymized. After the anonymization process, an anonymized version of the subset is expected that meets the selected privacy model, balancing enough privacy versus keeping research value. As well as the result, the performance will be evaluated in order to validate the solution's applicability. It is intended with this solution to surpass the issue of individual data privacy when sharing data and to have impact in healthcare studies, allowing them to continue without connecting any data to any specific identity, protecting individual privacy.
id RCAP_184ddca54cd6b666d19c1a79273553dc
oai_identifier_str oai:repositorio-aberto.up.pt:10216/106216
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Anonimização de informação clínicaEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringOver the past years, with the progress of technology, the amount of data being collected by the IT systems has exponentially grown. By using data mining techniques, this data can be analyzed to find trends and statistics, which are really useful for all companies and industries. The analysis and data sharing for studies became a large industry, with a great impact in all sectors. However, with this comes the concern with individual privacy - there is a huge amount of data which is private and should not be public in any circumstances - so, it is highly needed to find a solution to share and analyse data while protecting privacy. Nevertheless, it is truly important to take into account performance issues as the anonymization process should not hinder the normal functioning of the operational system. The focus goes to clinical data, which allows medical researchers to learn trends, statistics and relations between certain clinical attributes, such as correlations between gender and a specific disease. These studies and data analysis are very important as they can bring great benefits and knowledge in healthcare. However, maintaining individual privacy is crucial. In order to solve this problem, a new solution will be proposed and developed to efficiently anonymize data. With this solution, the user can quickly and efficiently anonymize a given dataset, according to the initially provided configurations. The tool receives as input the dataset to anonymize and a minimal configuration to specify the required anonymization parameters, and returns as output the corresponding anonymized dataset. The anonymization is done recurring to well known models and algorithms to protect privacy, associated with specific clinical criteria, restrictions and hierarchies. This solution will be validated using a subset of clinical data that needs to be anonymized. After the anonymization process, an anonymized version of the subset is expected that meets the selected privacy model, balancing enough privacy versus keeping research value. As well as the result, the performance will be evaluated in order to validate the solution's applicability. It is intended with this solution to surpass the issue of individual data privacy when sharing data and to have impact in healthcare studies, allowing them to continue without connecting any data to any specific identity, protecting individual privacy.2017-07-172017-07-17T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/106216TID:201799944engJosé Miguel de Sousa Pessanha Pereira de Meloinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T12:56:27Zoai:repositorio-aberto.up.pt:10216/106216Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:30:01.616299Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Anonimização de informação clínica
title Anonimização de informação clínica
spellingShingle Anonimização de informação clínica
José Miguel de Sousa Pessanha Pereira de Melo
Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
title_short Anonimização de informação clínica
title_full Anonimização de informação clínica
title_fullStr Anonimização de informação clínica
title_full_unstemmed Anonimização de informação clínica
title_sort Anonimização de informação clínica
author José Miguel de Sousa Pessanha Pereira de Melo
author_facet José Miguel de Sousa Pessanha Pereira de Melo
author_role author
dc.contributor.author.fl_str_mv José Miguel de Sousa Pessanha Pereira de Melo
dc.subject.por.fl_str_mv Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
topic Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
description Over the past years, with the progress of technology, the amount of data being collected by the IT systems has exponentially grown. By using data mining techniques, this data can be analyzed to find trends and statistics, which are really useful for all companies and industries. The analysis and data sharing for studies became a large industry, with a great impact in all sectors. However, with this comes the concern with individual privacy - there is a huge amount of data which is private and should not be public in any circumstances - so, it is highly needed to find a solution to share and analyse data while protecting privacy. Nevertheless, it is truly important to take into account performance issues as the anonymization process should not hinder the normal functioning of the operational system. The focus goes to clinical data, which allows medical researchers to learn trends, statistics and relations between certain clinical attributes, such as correlations between gender and a specific disease. These studies and data analysis are very important as they can bring great benefits and knowledge in healthcare. However, maintaining individual privacy is crucial. In order to solve this problem, a new solution will be proposed and developed to efficiently anonymize data. With this solution, the user can quickly and efficiently anonymize a given dataset, according to the initially provided configurations. The tool receives as input the dataset to anonymize and a minimal configuration to specify the required anonymization parameters, and returns as output the corresponding anonymized dataset. The anonymization is done recurring to well known models and algorithms to protect privacy, associated with specific clinical criteria, restrictions and hierarchies. This solution will be validated using a subset of clinical data that needs to be anonymized. After the anonymization process, an anonymized version of the subset is expected that meets the selected privacy model, balancing enough privacy versus keeping research value. As well as the result, the performance will be evaluated in order to validate the solution's applicability. It is intended with this solution to surpass the issue of individual data privacy when sharing data and to have impact in healthcare studies, allowing them to continue without connecting any data to any specific identity, protecting individual privacy.
publishDate 2017
dc.date.none.fl_str_mv 2017-07-17
2017-07-17T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/106216
TID:201799944
url https://hdl.handle.net/10216/106216
identifier_str_mv TID:201799944
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799135607669129217