Privacy in data publishing for tailored recommendation scenarios
Autor(a) principal: | |
---|---|
Data de Publicação: | 2015 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10773/16059 |
Resumo: | Personal information is increasingly gathered and used for providing services tailored to user preferences, but the datasets used to provide such functionality can represent serious privacy threats if not appropriately protected. Work in privacy-preserving data publishing targeted privacy guarantees that protect against record re-identification, by making records indistinguishable, or sensitive attribute value disclosure, by introducing diversity or noise in the sensitive values. However, most approaches fail in the high-dimensional case, and the ones that don’t introduce a utility cost incompatible with tailored recommendation scenarios. This paper aims at a sensible trade-off between privacy and the benefits of tailored recommendations, in the context of privacy-preserving data publishing. We empirically demonstrate that significant privacy improvements can be achieved at a utility cost compatible with tailored recommendation scenarios, using a simple partition-based sanitization method. |
id |
RCAP_5acf3874cd103f22109560675d0ec789 |
---|---|
oai_identifier_str |
oai:ria.ua.pt:10773/16059 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Privacy in data publishing for tailored recommendation scenariosData anonymization and sanitizationHigh-dimensional datasetsPrivacy-preserving data publishingRating predictionRecommender systemsTailored recommendationsEconomic and social effectsHigh-dimensionalPersonal informationRe identificationsSanitizationSensitive attributeTailored recommendationsData privacyPersonal information is increasingly gathered and used for providing services tailored to user preferences, but the datasets used to provide such functionality can represent serious privacy threats if not appropriately protected. Work in privacy-preserving data publishing targeted privacy guarantees that protect against record re-identification, by making records indistinguishable, or sensitive attribute value disclosure, by introducing diversity or noise in the sensitive values. However, most approaches fail in the high-dimensional case, and the ones that don’t introduce a utility cost incompatible with tailored recommendation scenarios. This paper aims at a sensible trade-off between privacy and the benefits of tailored recommendations, in the context of privacy-preserving data publishing. We empirically demonstrate that significant privacy improvements can be achieved at a utility cost compatible with tailored recommendation scenarios, using a simple partition-based sanitization method.IIIA-CSIC2016-09-02T09:31:02Z2015-01-01T00:00:00Z2015info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10773/16059eng1888-5063Gonçalves, J. M.Gomes, Diogo NunoAguiar, R. L.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-02-22T11:29:43Zoai:ria.ua.pt:10773/16059Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T02:51:14.606606Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Privacy in data publishing for tailored recommendation scenarios |
title |
Privacy in data publishing for tailored recommendation scenarios |
spellingShingle |
Privacy in data publishing for tailored recommendation scenarios Gonçalves, J. M. Data anonymization and sanitization High-dimensional datasets Privacy-preserving data publishing Rating prediction Recommender systems Tailored recommendations Economic and social effects High-dimensional Personal information Re identifications Sanitization Sensitive attribute Tailored recommendations Data privacy |
title_short |
Privacy in data publishing for tailored recommendation scenarios |
title_full |
Privacy in data publishing for tailored recommendation scenarios |
title_fullStr |
Privacy in data publishing for tailored recommendation scenarios |
title_full_unstemmed |
Privacy in data publishing for tailored recommendation scenarios |
title_sort |
Privacy in data publishing for tailored recommendation scenarios |
author |
Gonçalves, J. M. |
author_facet |
Gonçalves, J. M. Gomes, Diogo Nuno Aguiar, R. L. |
author_role |
author |
author2 |
Gomes, Diogo Nuno Aguiar, R. L. |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Gonçalves, J. M. Gomes, Diogo Nuno Aguiar, R. L. |
dc.subject.por.fl_str_mv |
Data anonymization and sanitization High-dimensional datasets Privacy-preserving data publishing Rating prediction Recommender systems Tailored recommendations Economic and social effects High-dimensional Personal information Re identifications Sanitization Sensitive attribute Tailored recommendations Data privacy |
topic |
Data anonymization and sanitization High-dimensional datasets Privacy-preserving data publishing Rating prediction Recommender systems Tailored recommendations Economic and social effects High-dimensional Personal information Re identifications Sanitization Sensitive attribute Tailored recommendations Data privacy |
description |
Personal information is increasingly gathered and used for providing services tailored to user preferences, but the datasets used to provide such functionality can represent serious privacy threats if not appropriately protected. Work in privacy-preserving data publishing targeted privacy guarantees that protect against record re-identification, by making records indistinguishable, or sensitive attribute value disclosure, by introducing diversity or noise in the sensitive values. However, most approaches fail in the high-dimensional case, and the ones that don’t introduce a utility cost incompatible with tailored recommendation scenarios. This paper aims at a sensible trade-off between privacy and the benefits of tailored recommendations, in the context of privacy-preserving data publishing. We empirically demonstrate that significant privacy improvements can be achieved at a utility cost compatible with tailored recommendation scenarios, using a simple partition-based sanitization method. |
publishDate |
2015 |
dc.date.none.fl_str_mv |
2015-01-01T00:00:00Z 2015 2016-09-02T09:31:02Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10773/16059 |
url |
http://hdl.handle.net/10773/16059 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
1888-5063 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
IIIA-CSIC |
publisher.none.fl_str_mv |
IIIA-CSIC |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799137561850937344 |