Privacy in data publishing for tailored recommendation scenarios

Detalhes bibliográficos
Autor(a) principal: Gonçalves, J. M.
Data de Publicação: 2015
Outros Autores: Gomes, Diogo Nuno, Aguiar, R. L.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10773/16059
Resumo: Personal information is increasingly gathered and used for providing services tailored to user preferences, but the datasets used to provide such functionality can represent serious privacy threats if not appropriately protected. Work in privacy-preserving data publishing targeted privacy guarantees that protect against record re-identification, by making records indistinguishable, or sensitive attribute value disclosure, by introducing diversity or noise in the sensitive values. However, most approaches fail in the high-dimensional case, and the ones that don’t introduce a utility cost incompatible with tailored recommendation scenarios. This paper aims at a sensible trade-off between privacy and the benefits of tailored recommendations, in the context of privacy-preserving data publishing. We empirically demonstrate that significant privacy improvements can be achieved at a utility cost compatible with tailored recommendation scenarios, using a simple partition-based sanitization method.
id RCAP_5acf3874cd103f22109560675d0ec789
oai_identifier_str oai:ria.ua.pt:10773/16059
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Privacy in data publishing for tailored recommendation scenariosData anonymization and sanitizationHigh-dimensional datasetsPrivacy-preserving data publishingRating predictionRecommender systemsTailored recommendationsEconomic and social effectsHigh-dimensionalPersonal informationRe identificationsSanitizationSensitive attributeTailored recommendationsData privacyPersonal information is increasingly gathered and used for providing services tailored to user preferences, but the datasets used to provide such functionality can represent serious privacy threats if not appropriately protected. Work in privacy-preserving data publishing targeted privacy guarantees that protect against record re-identification, by making records indistinguishable, or sensitive attribute value disclosure, by introducing diversity or noise in the sensitive values. However, most approaches fail in the high-dimensional case, and the ones that don’t introduce a utility cost incompatible with tailored recommendation scenarios. This paper aims at a sensible trade-off between privacy and the benefits of tailored recommendations, in the context of privacy-preserving data publishing. We empirically demonstrate that significant privacy improvements can be achieved at a utility cost compatible with tailored recommendation scenarios, using a simple partition-based sanitization method.IIIA-CSIC2016-09-02T09:31:02Z2015-01-01T00:00:00Z2015info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10773/16059eng1888-5063Gonçalves, J. M.Gomes, Diogo NunoAguiar, R. L.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-02-22T11:29:43Zoai:ria.ua.pt:10773/16059Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T02:51:14.606606Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Privacy in data publishing for tailored recommendation scenarios
title Privacy in data publishing for tailored recommendation scenarios
spellingShingle Privacy in data publishing for tailored recommendation scenarios
Gonçalves, J. M.
Data anonymization and sanitization
High-dimensional datasets
Privacy-preserving data publishing
Rating prediction
Recommender systems
Tailored recommendations
Economic and social effects
High-dimensional
Personal information
Re identifications
Sanitization
Sensitive attribute
Tailored recommendations
Data privacy
title_short Privacy in data publishing for tailored recommendation scenarios
title_full Privacy in data publishing for tailored recommendation scenarios
title_fullStr Privacy in data publishing for tailored recommendation scenarios
title_full_unstemmed Privacy in data publishing for tailored recommendation scenarios
title_sort Privacy in data publishing for tailored recommendation scenarios
author Gonçalves, J. M.
author_facet Gonçalves, J. M.
Gomes, Diogo Nuno
Aguiar, R. L.
author_role author
author2 Gomes, Diogo Nuno
Aguiar, R. L.
author2_role author
author
dc.contributor.author.fl_str_mv Gonçalves, J. M.
Gomes, Diogo Nuno
Aguiar, R. L.
dc.subject.por.fl_str_mv Data anonymization and sanitization
High-dimensional datasets
Privacy-preserving data publishing
Rating prediction
Recommender systems
Tailored recommendations
Economic and social effects
High-dimensional
Personal information
Re identifications
Sanitization
Sensitive attribute
Tailored recommendations
Data privacy
topic Data anonymization and sanitization
High-dimensional datasets
Privacy-preserving data publishing
Rating prediction
Recommender systems
Tailored recommendations
Economic and social effects
High-dimensional
Personal information
Re identifications
Sanitization
Sensitive attribute
Tailored recommendations
Data privacy
description Personal information is increasingly gathered and used for providing services tailored to user preferences, but the datasets used to provide such functionality can represent serious privacy threats if not appropriately protected. Work in privacy-preserving data publishing targeted privacy guarantees that protect against record re-identification, by making records indistinguishable, or sensitive attribute value disclosure, by introducing diversity or noise in the sensitive values. However, most approaches fail in the high-dimensional case, and the ones that don’t introduce a utility cost incompatible with tailored recommendation scenarios. This paper aims at a sensible trade-off between privacy and the benefits of tailored recommendations, in the context of privacy-preserving data publishing. We empirically demonstrate that significant privacy improvements can be achieved at a utility cost compatible with tailored recommendation scenarios, using a simple partition-based sanitization method.
publishDate 2015
dc.date.none.fl_str_mv 2015-01-01T00:00:00Z
2015
2016-09-02T09:31:02Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10773/16059
url http://hdl.handle.net/10773/16059
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 1888-5063
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv IIIA-CSIC
publisher.none.fl_str_mv IIIA-CSIC
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799137561850937344