The impact of spatial data redundancy on SOLAP query performance

Detalhes bibliográficos
Autor(a) principal: Siqueira,Thiago Luís Lopes
Data de Publicação: 2009
Outros Autores: Ciferri,Cristina Dutra de Aguiar, Times,Valéria Cesário, Oliveira,Anjolina Grisi de, Ciferri,Ricardo Rodrigues
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Journal of the Brazilian Computer Society
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-65002009000200003
Resumo: Geographic Data Warehouses (GDW) are one of the main technologies used in decision-making processes and spatial analysis, and the literature proposes several conceptual and logical data models for GDW. However, little effort has been focused on studying how spatial data redundancy affects SOLAP (Spatial On-Line Analytical Processing) query performance over GDW. In this paper, we investigate this issue. Firstly, we compare redundant and non-redundant GDW schemas and conclude that redundancy is related to high performance losses. We also analyze the issue of indexing, aiming at improving SOLAP query performance on a redundant GDW. Comparisons of the SB-index approach, the star-join aided by R-tree and the star-join aided by GiST indicate that the SB-index significantly improves the elapsed time in query processing from 25% up to 99% with regard to SOLAP queries defined over the spatial predicates of intersection, enclosure and containment and applied to roll-up and drill-down operations. We also investigate the impact of the increase in data volume on the performance. The increase did not impair the performance of the SB-index, which highly improved the elapsed time in query processing. Performance tests also show that the SB-index is far more compact than the star-join, requiring only a small fraction of at most 0.20% of the volume. Moreover, we propose a specific enhancement of the SB-index to deal with spatial data redundancy. This enhancement improved performance from 80 to 91% for redundant GDW schemas.
id UFRGS-28_32413d8397d1c817bc60a4b07e66ee6f
oai_identifier_str oai:scielo:S0104-65002009000200003
network_acronym_str UFRGS-28
network_name_str Journal of the Brazilian Computer Society
repository_id_str
spelling The impact of spatial data redundancy on SOLAP query performancegeographic data warehouseindex structureSOLAP query performancespatial data redundancyGeographic Data Warehouses (GDW) are one of the main technologies used in decision-making processes and spatial analysis, and the literature proposes several conceptual and logical data models for GDW. However, little effort has been focused on studying how spatial data redundancy affects SOLAP (Spatial On-Line Analytical Processing) query performance over GDW. In this paper, we investigate this issue. Firstly, we compare redundant and non-redundant GDW schemas and conclude that redundancy is related to high performance losses. We also analyze the issue of indexing, aiming at improving SOLAP query performance on a redundant GDW. Comparisons of the SB-index approach, the star-join aided by R-tree and the star-join aided by GiST indicate that the SB-index significantly improves the elapsed time in query processing from 25% up to 99% with regard to SOLAP queries defined over the spatial predicates of intersection, enclosure and containment and applied to roll-up and drill-down operations. We also investigate the impact of the increase in data volume on the performance. The increase did not impair the performance of the SB-index, which highly improved the elapsed time in query processing. Performance tests also show that the SB-index is far more compact than the star-join, requiring only a small fraction of at most 0.20% of the volume. Moreover, we propose a specific enhancement of the SB-index to deal with spatial data redundancy. This enhancement improved performance from 80 to 91% for redundant GDW schemas.Sociedade Brasileira de Computação2009-06-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-65002009000200003Journal of the Brazilian Computer Society v.15 n.2 2009reponame:Journal of the Brazilian Computer Societyinstname:Sociedade Brasileira de Computação (SBC)instacron:UFRGS10.1007/BF03194499info:eu-repo/semantics/openAccessSiqueira,Thiago Luís LopesCiferri,Cristina Dutra de AguiarTimes,Valéria CesárioOliveira,Anjolina Grisi deCiferri,Ricardo Rodrigueseng2009-08-31T00:00:00Zoai:scielo:S0104-65002009000200003Revistahttps://journal-bcs.springeropen.com/PUBhttps://old.scielo.br/oai/scielo-oai.phpjbcs@icmc.sc.usp.br1678-48040104-6500opendoar:2009-08-31T00:00Journal of the Brazilian Computer Society - Sociedade Brasileira de Computação (SBC)false
dc.title.none.fl_str_mv The impact of spatial data redundancy on SOLAP query performance
title The impact of spatial data redundancy on SOLAP query performance
spellingShingle The impact of spatial data redundancy on SOLAP query performance
Siqueira,Thiago Luís Lopes
geographic data warehouse
index structure
SOLAP query performance
spatial data redundancy
title_short The impact of spatial data redundancy on SOLAP query performance
title_full The impact of spatial data redundancy on SOLAP query performance
title_fullStr The impact of spatial data redundancy on SOLAP query performance
title_full_unstemmed The impact of spatial data redundancy on SOLAP query performance
title_sort The impact of spatial data redundancy on SOLAP query performance
author Siqueira,Thiago Luís Lopes
author_facet Siqueira,Thiago Luís Lopes
Ciferri,Cristina Dutra de Aguiar
Times,Valéria Cesário
Oliveira,Anjolina Grisi de
Ciferri,Ricardo Rodrigues
author_role author
author2 Ciferri,Cristina Dutra de Aguiar
Times,Valéria Cesário
Oliveira,Anjolina Grisi de
Ciferri,Ricardo Rodrigues
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Siqueira,Thiago Luís Lopes
Ciferri,Cristina Dutra de Aguiar
Times,Valéria Cesário
Oliveira,Anjolina Grisi de
Ciferri,Ricardo Rodrigues
dc.subject.por.fl_str_mv geographic data warehouse
index structure
SOLAP query performance
spatial data redundancy
topic geographic data warehouse
index structure
SOLAP query performance
spatial data redundancy
description Geographic Data Warehouses (GDW) are one of the main technologies used in decision-making processes and spatial analysis, and the literature proposes several conceptual and logical data models for GDW. However, little effort has been focused on studying how spatial data redundancy affects SOLAP (Spatial On-Line Analytical Processing) query performance over GDW. In this paper, we investigate this issue. Firstly, we compare redundant and non-redundant GDW schemas and conclude that redundancy is related to high performance losses. We also analyze the issue of indexing, aiming at improving SOLAP query performance on a redundant GDW. Comparisons of the SB-index approach, the star-join aided by R-tree and the star-join aided by GiST indicate that the SB-index significantly improves the elapsed time in query processing from 25% up to 99% with regard to SOLAP queries defined over the spatial predicates of intersection, enclosure and containment and applied to roll-up and drill-down operations. We also investigate the impact of the increase in data volume on the performance. The increase did not impair the performance of the SB-index, which highly improved the elapsed time in query processing. Performance tests also show that the SB-index is far more compact than the star-join, requiring only a small fraction of at most 0.20% of the volume. Moreover, we propose a specific enhancement of the SB-index to deal with spatial data redundancy. This enhancement improved performance from 80 to 91% for redundant GDW schemas.
publishDate 2009
dc.date.none.fl_str_mv 2009-06-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-65002009000200003
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0104-65002009000200003
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1007/BF03194499
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Sociedade Brasileira de Computação
publisher.none.fl_str_mv Sociedade Brasileira de Computação
dc.source.none.fl_str_mv Journal of the Brazilian Computer Society v.15 n.2 2009
reponame:Journal of the Brazilian Computer Society
instname:Sociedade Brasileira de Computação (SBC)
instacron:UFRGS
instname_str Sociedade Brasileira de Computação (SBC)
instacron_str UFRGS
institution UFRGS
reponame_str Journal of the Brazilian Computer Society
collection Journal of the Brazilian Computer Society
repository.name.fl_str_mv Journal of the Brazilian Computer Society - Sociedade Brasileira de Computação (SBC)
repository.mail.fl_str_mv jbcs@icmc.sc.usp.br
_version_ 1754734669994131456