A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables

Detalhes bibliográficos
Autor(a) principal: Carvalho, Filipa D.
Data de Publicação: 2008
Outros Autores: Almeida, Maria Teresa
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.5/29162
Resumo: To obtain full cooperation from respondents, statistical offices must guarantee that confidential data will not be disclosed when their reports are published. For tabular data, cell suppression is one of the preferred techniques to control statistical disclosure. When suppressing only confidential values does not guarantee the desired data protection, it is also necessary to suppress the values in some non-confidential cells. The problem of finding an optimal set of complementary suppressions—the cell suppression problem (CSP)—is NP-hard. We present a three-phase algorithm for the CSP based on a binary relaxation derived from row and column protection conditions. To enforce violated single cell conditions, integer cuts are added to the CSP relaxation. The numerical results obtained in 1410 instances with up to more than 250 000 cells, which were generated to reproduce two classes of real-world data, indicate that the algorithm is quite effective for both classes of instances and that it outperforms state-of-the-art algorithms for one of them.
id RCAP_db4a6b74698807898b7f8ac05d78b0d7
oai_identifier_str oai:www.repository.utl.pt:10400.5/29162
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling A three-phase algorithm for the cell suppression problem in two-dimensional statistical tablesInteger ProgrammingNetworks and GraphsHeuristicsCell Suppression ProblemTo obtain full cooperation from respondents, statistical offices must guarantee that confidential data will not be disclosed when their reports are published. For tabular data, cell suppression is one of the preferred techniques to control statistical disclosure. When suppressing only confidential values does not guarantee the desired data protection, it is also necessary to suppress the values in some non-confidential cells. The problem of finding an optimal set of complementary suppressions—the cell suppression problem (CSP)—is NP-hard. We present a three-phase algorithm for the CSP based on a binary relaxation derived from row and column protection conditions. To enforce violated single cell conditions, integer cuts are added to the CSP relaxation. The numerical results obtained in 1410 instances with up to more than 250 000 cells, which were generated to reproduce two classes of real-world data, indicate that the algorithm is quite effective for both classes of instances and that it outperforms state-of-the-art algorithms for one of them.Taylor & FrancisRepositório da Universidade de LisboaCarvalho, Filipa D.Almeida, Maria Teresa2023-10-30T16:36:56Z20082008-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.5/29162engCarvalho, Filipa D. and Maria Teresa Almeida .(2008). “A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables”. Journal of the Operational Research Society, Vol. 59, No. 4: pp. 556-562. .(Search PDF in 2023).1476-9360info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-05T01:31:50Zoai:www.repository.utl.pt:10400.5/29162Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T21:26:47.179698Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables
title A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables
spellingShingle A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables
Carvalho, Filipa D.
Integer Programming
Networks and Graphs
Heuristics
Cell Suppression Problem
title_short A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables
title_full A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables
title_fullStr A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables
title_full_unstemmed A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables
title_sort A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables
author Carvalho, Filipa D.
author_facet Carvalho, Filipa D.
Almeida, Maria Teresa
author_role author
author2 Almeida, Maria Teresa
author2_role author
dc.contributor.none.fl_str_mv Repositório da Universidade de Lisboa
dc.contributor.author.fl_str_mv Carvalho, Filipa D.
Almeida, Maria Teresa
dc.subject.por.fl_str_mv Integer Programming
Networks and Graphs
Heuristics
Cell Suppression Problem
topic Integer Programming
Networks and Graphs
Heuristics
Cell Suppression Problem
description To obtain full cooperation from respondents, statistical offices must guarantee that confidential data will not be disclosed when their reports are published. For tabular data, cell suppression is one of the preferred techniques to control statistical disclosure. When suppressing only confidential values does not guarantee the desired data protection, it is also necessary to suppress the values in some non-confidential cells. The problem of finding an optimal set of complementary suppressions—the cell suppression problem (CSP)—is NP-hard. We present a three-phase algorithm for the CSP based on a binary relaxation derived from row and column protection conditions. To enforce violated single cell conditions, integer cuts are added to the CSP relaxation. The numerical results obtained in 1410 instances with up to more than 250 000 cells, which were generated to reproduce two classes of real-world data, indicate that the algorithm is quite effective for both classes of instances and that it outperforms state-of-the-art algorithms for one of them.
publishDate 2008
dc.date.none.fl_str_mv 2008
2008-01-01T00:00:00Z
2023-10-30T16:36:56Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.5/29162
url http://hdl.handle.net/10400.5/29162
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Carvalho, Filipa D. and Maria Teresa Almeida .(2008). “A three-phase algorithm for the cell suppression problem in two-dimensional statistical tables”. Journal of the Operational Research Society, Vol. 59, No. 4: pp. 556-562. .(Search PDF in 2023).
1476-9360
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Taylor & Francis
publisher.none.fl_str_mv Taylor & Francis
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134149404000256