Anonimização de informação clínica
Autor(a) principal: | |
---|---|
Data de Publicação: | 2017 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/106216 |
Resumo: | Over the past years, with the progress of technology, the amount of data being collected by the IT systems has exponentially grown. By using data mining techniques, this data can be analyzed to find trends and statistics, which are really useful for all companies and industries. The analysis and data sharing for studies became a large industry, with a great impact in all sectors. However, with this comes the concern with individual privacy - there is a huge amount of data which is private and should not be public in any circumstances - so, it is highly needed to find a solution to share and analyse data while protecting privacy. Nevertheless, it is truly important to take into account performance issues as the anonymization process should not hinder the normal functioning of the operational system. The focus goes to clinical data, which allows medical researchers to learn trends, statistics and relations between certain clinical attributes, such as correlations between gender and a specific disease. These studies and data analysis are very important as they can bring great benefits and knowledge in healthcare. However, maintaining individual privacy is crucial. In order to solve this problem, a new solution will be proposed and developed to efficiently anonymize data. With this solution, the user can quickly and efficiently anonymize a given dataset, according to the initially provided configurations. The tool receives as input the dataset to anonymize and a minimal configuration to specify the required anonymization parameters, and returns as output the corresponding anonymized dataset. The anonymization is done recurring to well known models and algorithms to protect privacy, associated with specific clinical criteria, restrictions and hierarchies. This solution will be validated using a subset of clinical data that needs to be anonymized. After the anonymization process, an anonymized version of the subset is expected that meets the selected privacy model, balancing enough privacy versus keeping research value. As well as the result, the performance will be evaluated in order to validate the solution's applicability. It is intended with this solution to surpass the issue of individual data privacy when sharing data and to have impact in healthcare studies, allowing them to continue without connecting any data to any specific identity, protecting individual privacy. |
id |
RCAP_184ddca54cd6b666d19c1a79273553dc |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/106216 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Anonimização de informação clínicaEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringOver the past years, with the progress of technology, the amount of data being collected by the IT systems has exponentially grown. By using data mining techniques, this data can be analyzed to find trends and statistics, which are really useful for all companies and industries. The analysis and data sharing for studies became a large industry, with a great impact in all sectors. However, with this comes the concern with individual privacy - there is a huge amount of data which is private and should not be public in any circumstances - so, it is highly needed to find a solution to share and analyse data while protecting privacy. Nevertheless, it is truly important to take into account performance issues as the anonymization process should not hinder the normal functioning of the operational system. The focus goes to clinical data, which allows medical researchers to learn trends, statistics and relations between certain clinical attributes, such as correlations between gender and a specific disease. These studies and data analysis are very important as they can bring great benefits and knowledge in healthcare. However, maintaining individual privacy is crucial. In order to solve this problem, a new solution will be proposed and developed to efficiently anonymize data. With this solution, the user can quickly and efficiently anonymize a given dataset, according to the initially provided configurations. The tool receives as input the dataset to anonymize and a minimal configuration to specify the required anonymization parameters, and returns as output the corresponding anonymized dataset. The anonymization is done recurring to well known models and algorithms to protect privacy, associated with specific clinical criteria, restrictions and hierarchies. This solution will be validated using a subset of clinical data that needs to be anonymized. After the anonymization process, an anonymized version of the subset is expected that meets the selected privacy model, balancing enough privacy versus keeping research value. As well as the result, the performance will be evaluated in order to validate the solution's applicability. It is intended with this solution to surpass the issue of individual data privacy when sharing data and to have impact in healthcare studies, allowing them to continue without connecting any data to any specific identity, protecting individual privacy.2017-07-172017-07-17T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/106216TID:201799944engJosé Miguel de Sousa Pessanha Pereira de Meloinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T12:56:27Zoai:repositorio-aberto.up.pt:10216/106216Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:30:01.616299Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Anonimização de informação clínica |
title |
Anonimização de informação clínica |
spellingShingle |
Anonimização de informação clínica José Miguel de Sousa Pessanha Pereira de Melo Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Anonimização de informação clínica |
title_full |
Anonimização de informação clínica |
title_fullStr |
Anonimização de informação clínica |
title_full_unstemmed |
Anonimização de informação clínica |
title_sort |
Anonimização de informação clínica |
author |
José Miguel de Sousa Pessanha Pereira de Melo |
author_facet |
José Miguel de Sousa Pessanha Pereira de Melo |
author_role |
author |
dc.contributor.author.fl_str_mv |
José Miguel de Sousa Pessanha Pereira de Melo |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
Over the past years, with the progress of technology, the amount of data being collected by the IT systems has exponentially grown. By using data mining techniques, this data can be analyzed to find trends and statistics, which are really useful for all companies and industries. The analysis and data sharing for studies became a large industry, with a great impact in all sectors. However, with this comes the concern with individual privacy - there is a huge amount of data which is private and should not be public in any circumstances - so, it is highly needed to find a solution to share and analyse data while protecting privacy. Nevertheless, it is truly important to take into account performance issues as the anonymization process should not hinder the normal functioning of the operational system. The focus goes to clinical data, which allows medical researchers to learn trends, statistics and relations between certain clinical attributes, such as correlations between gender and a specific disease. These studies and data analysis are very important as they can bring great benefits and knowledge in healthcare. However, maintaining individual privacy is crucial. In order to solve this problem, a new solution will be proposed and developed to efficiently anonymize data. With this solution, the user can quickly and efficiently anonymize a given dataset, according to the initially provided configurations. The tool receives as input the dataset to anonymize and a minimal configuration to specify the required anonymization parameters, and returns as output the corresponding anonymized dataset. The anonymization is done recurring to well known models and algorithms to protect privacy, associated with specific clinical criteria, restrictions and hierarchies. This solution will be validated using a subset of clinical data that needs to be anonymized. After the anonymization process, an anonymized version of the subset is expected that meets the selected privacy model, balancing enough privacy versus keeping research value. As well as the result, the performance will be evaluated in order to validate the solution's applicability. It is intended with this solution to surpass the issue of individual data privacy when sharing data and to have impact in healthcare studies, allowing them to continue without connecting any data to any specific identity, protecting individual privacy. |
publishDate |
2017 |
dc.date.none.fl_str_mv |
2017-07-17 2017-07-17T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/106216 TID:201799944 |
url |
https://hdl.handle.net/10216/106216 |
identifier_str_mv |
TID:201799944 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799135607669129217 |