A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables
Autor(a) principal: | |
---|---|
Data de Publicação: | 2008 |
Outros Autores: | |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.5/29162 |
Resumo: | To obtain full cooperation from respondents, statistical offices must guarantee that confidential data will not be disclosed when their reports are published. For tabular data, cell suppression is one of the preferred techniques to control statistical disclosure. When suppressing only confidential values does not guarantee the desired data protection, it is also necessary to suppress the values in some non-confidential cells. The problem of finding an optimal set of complementary suppressions—the cell suppression problem (CSP)—is NP-hard. We present a three-phase algorithm for the CSP based on a binary relaxation derived from row and column protection conditions. To enforce violated single cell conditions, integer cuts are added to the CSP relaxation. The numerical results obtained in 1410 instances with up to more than 250 000 cells, which were generated to reproduce two classes of real-world data, indicate that the algorithm is quite effective for both classes of instances and that it outperforms state-of-the-art algorithms for one of them. |
id |
RCAP_db4a6b74698807898b7f8ac05d78b0d7 |
---|---|
oai_identifier_str |
oai:www.repository.utl.pt:10400.5/29162 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
A three-phase algorithm for the cell suppression problem in two-dimensional statistical tablesInteger ProgrammingNetworks and GraphsHeuristicsCell Suppression ProblemTo obtain full cooperation from respondents, statistical offices must guarantee that confidential data will not be disclosed when their reports are published. For tabular data, cell suppression is one of the preferred techniques to control statistical disclosure. When suppressing only confidential values does not guarantee the desired data protection, it is also necessary to suppress the values in some non-confidential cells. The problem of finding an optimal set of complementary suppressions—the cell suppression problem (CSP)—is NP-hard. We present a three-phase algorithm for the CSP based on a binary relaxation derived from row and column protection conditions. To enforce violated single cell conditions, integer cuts are added to the CSP relaxation. The numerical results obtained in 1410 instances with up to more than 250 000 cells, which were generated to reproduce two classes of real-world data, indicate that the algorithm is quite effective for both classes of instances and that it outperforms state-of-the-art algorithms for one of them.Taylor & FrancisRepositório da Universidade de LisboaCarvalho, Filipa D.Almeida, Maria Teresa2023-10-30T16:36:56Z20082008-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.5/29162engCarvalho, Filipa D. and Maria Teresa Almeida .(2008). “A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables”. Journal of the Operational Research Society, Vol. 59, No. 4: pp. 556-562. .(Search PDF in 2023).1476-9360info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-05T01:31:50Zoai:www.repository.utl.pt:10400.5/29162Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:26:47.179698Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables |
title |
A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables |
spellingShingle |
A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables Carvalho, Filipa D. Integer Programming Networks and Graphs Heuristics Cell Suppression Problem |
title_short |
A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables |
title_full |
A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables |
title_fullStr |
A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables |
title_full_unstemmed |
A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables |
title_sort |
A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables |
author |
Carvalho, Filipa D. |
author_facet |
Carvalho, Filipa D. Almeida, Maria Teresa |
author_role |
author |
author2 |
Almeida, Maria Teresa |
author2_role |
author |
dc.contributor.none.fl_str_mv |
Repositório da Universidade de Lisboa |
dc.contributor.author.fl_str_mv |
Carvalho, Filipa D. Almeida, Maria Teresa |
dc.subject.por.fl_str_mv |
Integer Programming Networks and Graphs Heuristics Cell Suppression Problem |
topic |
Integer Programming Networks and Graphs Heuristics Cell Suppression Problem |
description |
To obtain full cooperation from respondents, statistical offices must guarantee that confidential data will not be disclosed when their reports are published. For tabular data, cell suppression is one of the preferred techniques to control statistical disclosure. When suppressing only confidential values does not guarantee the desired data protection, it is also necessary to suppress the values in some non-confidential cells. The problem of finding an optimal set of complementary suppressions—the cell suppression problem (CSP)—is NP-hard. We present a three-phase algorithm for the CSP based on a binary relaxation derived from row and column protection conditions. To enforce violated single cell conditions, integer cuts are added to the CSP relaxation. The numerical results obtained in 1410 instances with up to more than 250 000 cells, which were generated to reproduce two classes of real-world data, indicate that the algorithm is quite effective for both classes of instances and that it outperforms state-of-the-art algorithms for one of them. |
publishDate |
2008 |
dc.date.none.fl_str_mv |
2008 2008-01-01T00:00:00Z 2023-10-30T16:36:56Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.5/29162 |
url |
http://hdl.handle.net/10400.5/29162 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Carvalho, Filipa D. and Maria Teresa Almeida .(2008). “A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables”. Journal of the Operational Research Society, Vol. 59, No. 4: pp. 556-562. .(Search PDF in 2023). 1476-9360 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Taylor & Francis |
publisher.none.fl_str_mv |
Taylor & Francis |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799134149404000256 |