Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join

Detalhes bibliográficos
Autor(a) principal: Cunha, Anderson Rogério
Data de Publicação: 2014
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Repositório Institucional da UFG
dARK ID: ark:/38995/0013000009pjn
Texto Completo: http://repositorio.bc.ufg.br/tede/handle/tede/3836
Resumo: Spatial join is one of the spatial operations of higher computational cost. Its complexity increases significantly when it involves multiple databases (multi-way spatial join). Traditional processing strategies of multi-way spatial join apply combinations of binary join algorithms on centralized computing environments. For complex queries, this approach requires much computational power, making it often unfeasible in centralized environments. This work proposes the Distributed Synchronous Traversal algorithm (DST), whose goal is to enable the distributed processing of multi-way spatial joins on a cluster of computers. The DST algorithm is based on Synchronous Traversal algorithm and processes the multiway spatial join in a single synchronous descent upon R-Trees levels of the database entries (the final outcome is built incrementally, without creating temporary databases). To the best of our knowledge, there are no other proposals in the literature that deal with this problem in a distributed fashion and on a peer-to-peer architecture. Many challenges had to be overcome, such as the definition of data structures that enabled the mapping of the semantics of queries of multi-way spatial join and coordination of the required distributed processing. DST proved to be satisfactorily parallelizable and scalable process real datasets in experiments performed in clusters of 1, 2, 4 and 8 servers.
id UFG-2_e86664f3cdfd47b8f055321757aaf146
oai_identifier_str oai:repositorio.bc.ufg.br:tede/3836
network_acronym_str UFG-2
network_name_str Repositório Institucional da UFG
repository_id_str
spelling Rodrigues, Vagner José do Sacramentohttp://lattes.cnpq.br/4148896613580056Rodrigues, Vagner José do SacramentoDavis Júnior, Clodoveu AugustoCardoso, Kleber Vieirahttp://lattes.cnpq.br/1051044276625162Cunha, Anderson Rogério2014-12-29T15:39:23Z2014-02-19CUNHA, Anderson Rogério. Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join. 2014. 82 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Goiás, Goiânia, 2014.http://repositorio.bc.ufg.br/tede/handle/tede/3836ark:/38995/0013000009pjnSpatial join is one of the spatial operations of higher computational cost. Its complexity increases significantly when it involves multiple databases (multi-way spatial join). Traditional processing strategies of multi-way spatial join apply combinations of binary join algorithms on centralized computing environments. For complex queries, this approach requires much computational power, making it often unfeasible in centralized environments. This work proposes the Distributed Synchronous Traversal algorithm (DST), whose goal is to enable the distributed processing of multi-way spatial joins on a cluster of computers. The DST algorithm is based on Synchronous Traversal algorithm and processes the multiway spatial join in a single synchronous descent upon R-Trees levels of the database entries (the final outcome is built incrementally, without creating temporary databases). To the best of our knowledge, there are no other proposals in the literature that deal with this problem in a distributed fashion and on a peer-to-peer architecture. Many challenges had to be overcome, such as the definition of data structures that enabled the mapping of the semantics of queries of multi-way spatial join and coordination of the required distributed processing. DST proved to be satisfactorily parallelizable and scalable process real datasets in experiments performed in clusters of 1, 2, 4 and 8 servers.A junção espacial (Spatial Join) é uma das operações espaciais de maior custo computacional. Sua complexidade aumenta significativamente quando envolve múltiplas bases de dados (multi-way spatial join). Estratégias tradicionais de processamento do multi-way spatial join aplicam combinações de algoritmos de junção binária sobre ambientes computacionais centralizados. Em consultas complexas, esse tipo de abordagem exige grande capacidade computacional muitas vezes inviável em ambientes centralizados. Neste trabalho é proposto o algoritmo Distributed Synchronous Traversal (DST), cujo objetivo é tornar viável a execução distribuída do multi-way spatial join em um cluster de computadores. O DST se baseia no algoritmo Synchronous Traversal e processa o multiway spatial join em uma única descida síncrona sobre os níveis das R-Trees das bases de dados de entrada. O resultado final é construído incrementalmente, sem a consolidação de dados intermediários. Até onde conhecemos, não há outras propostas na literatura para multi-way spatial join distribuído sobre uma arquitetura peer-to-peer. Muitos desafios tiveram que ser superados, como a definição de estruturas de dados que possibilitassem o mapeamento da semântica das consultas de multi-way spatial join e a coordenação do processamento distribuído das mesmas. O DST se mostrou satisfatoriamente paralelizável e escalável ao processar bases de dados reais em clusters de até 8 servidores.Submitted by Erika Demachki (erikademachki@gmail.com) on 2014-12-29T15:33:04Z No. of bitstreams: 2 license_rdf: 23148 bytes, checksum: 9da0b6dfac957114c6a7714714b86306 (MD5) Dissertação - Anderson Rogério Cunha - 2014.pdf: 4853685 bytes, checksum: d50cf557f1a067a91c2034443ee62df2 (MD5)Approved for entry into archive by Erika Demachki (erikademachki@gmail.com) on 2014-12-29T15:39:23Z (GMT) No. of bitstreams: 2 license_rdf: 23148 bytes, checksum: 9da0b6dfac957114c6a7714714b86306 (MD5) Dissertação - Anderson Rogério Cunha - 2014.pdf: 4853685 bytes, checksum: d50cf557f1a067a91c2034443ee62df2 (MD5)Made available in DSpace on 2014-12-29T15:39:23Z (GMT). No. of bitstreams: 2 license_rdf: 23148 bytes, checksum: 9da0b6dfac957114c6a7714714b86306 (MD5) Dissertação - Anderson Rogério Cunha - 2014.pdf: 4853685 bytes, checksum: d50cf557f1a067a91c2034443ee62df2 (MD5) Previous issue date: 2014-02-19Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPESapplication/pdfhttp://repositorio.bc.ufg.br/tede/retrieve/14245/Disserta%c3%a7%c3%a3o%20-%20Anderson%20Rog%c3%a9rio%20Cunha%20-%202014.pdf.jpgporUniversidade Federal de GoiásPrograma de Pós-graduação em Ciência da Computação (INF)UFGBrasilInstituto de Informática - INF (RG)Processamento distribuídoJunção espacialMulti-way spatial joinR-TreeDistributed processingSpatial joinCIENCIA DA COMPUTACAO::SISTEMAS DE COMPUTACAOProcessamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial joininfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesis-3303550325223384799600600600600-771226673463364476889300925156837715312075167498588264571info:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFGinstname:Universidade Federal de Goiás (UFG)instacron:UFGORIGINALDissertação - Anderson Rogério Cunha - 2014.pdfDissertação - Anderson Rogério Cunha - 2014.pdfapplication/pdf4853685http://repositorio.bc.ufg.br/tede/bitstreams/9a34a670-80fa-4fc2-96a7-27767ce78898/downloadd50cf557f1a067a91c2034443ee62df2MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82165http://repositorio.bc.ufg.br/tede/bitstreams/bfdcdbc6-af17-4ed0-8d85-58bebba11646/downloadbd3efa91386c1718a7f26a329fdcb468MD53CC-LICENSElicense_urllicense_urltext/plain; charset=utf-849http://repositorio.bc.ufg.br/tede/bitstreams/20605fdb-9c02-4253-a065-3a4316759fc3/download4afdbb8c545fd630ea7db775da747b2fMD54license_textlicense_texttext/html; charset=utf-822901http://repositorio.bc.ufg.br/tede/bitstreams/9518698f-8ff5-4ef4-95e0-4d589354c1b3/download29b9d5e95be03707f9d4a2e110421c11MD55license_rdflicense_rdfapplication/rdf+xml; charset=utf-823148http://repositorio.bc.ufg.br/tede/bitstreams/ac458a5b-29ef-4cb1-b3f0-c52619c3e821/download9da0b6dfac957114c6a7714714b86306MD56TEXTDissertação - Anderson Rogério Cunha - 2014.pdf.txtDissertação - Anderson Rogério Cunha - 2014.pdf.txtExtracted Texttext/plain150479http://repositorio.bc.ufg.br/tede/bitstreams/5e84331b-76bd-4d05-ba62-8409fdf4f76a/downloadcdb4c87fd25369116ad56e9d4afc856cMD57THUMBNAILDissertação - Anderson Rogério Cunha - 2014.pdf.jpgDissertação - Anderson Rogério Cunha - 2014.pdf.jpgGenerated Thumbnailimage/jpeg3540http://repositorio.bc.ufg.br/tede/bitstreams/baf2c42c-ab95-4dc1-9ca1-502b2b826fe1/download4787eb2e881ab143822a82ccb008af18MD58tede/38362014-12-30 03:03:57.213open.accessoai:repositorio.bc.ufg.br:tede/3836http://repositorio.bc.ufg.br/tedeRepositório InstitucionalPUBhttp://repositorio.bc.ufg.br/oai/requesttasesdissertacoes.bc@ufg.bropendoar:2014-12-30T05:03:57Repositório Institucional da UFG - Universidade Federal de Goiás (UFG)falseTk9UQTogQ09MT1FVRSBBUVVJIEEgU1VBIFBSw5NQUklBIExJQ0VOw4dBCkVzdGEgbGljZW7Dp2EgZGUgZXhlbXBsbyDDqSBmb3JuZWNpZGEgYXBlbmFzIHBhcmEgZmlucyBpbmZvcm1hdGl2b3MuCgpMSUNFTsOHQSBERSBESVNUUklCVUnDh8ODTyBOw4NPLUVYQ0xVU0lWQQoKQ29tIGEgYXByZXNlbnRhw6fDo28gZGVzdGEgbGljZW7Dp2EsIHZvY8OqIChvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSDDoCBVbml2ZXJzaWRhZGUgClhYWCAoU2lnbGEgZGEgVW5pdmVyc2lkYWRlKSBvIGRpcmVpdG8gbsOjby1leGNsdXNpdm8gZGUgcmVwcm9kdXppciwgIHRyYWR1emlyIChjb25mb3JtZSBkZWZpbmlkbyBhYmFpeG8pLCBlL291IApkaXN0cmlidWlyIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyAoaW5jbHVpbmRvIG8gcmVzdW1vKSBwb3IgdG9kbyBvIG11bmRvIG5vIGZvcm1hdG8gaW1wcmVzc28gZSBlbGV0csO0bmljbyBlIAplbSBxdWFscXVlciBtZWlvLCBpbmNsdWluZG8gb3MgZm9ybWF0b3Mgw6F1ZGlvIG91IHbDrWRlby4KClZvY8OqIGNvbmNvcmRhIHF1ZSBhIFNpZ2xhIGRlIFVuaXZlcnNpZGFkZSBwb2RlLCBzZW0gYWx0ZXJhciBvIGNvbnRlw7pkbywgdHJhbnNwb3IgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIApwYXJhIHF1YWxxdWVyIG1laW8gb3UgZm9ybWF0byBwYXJhIGZpbnMgZGUgcHJlc2VydmHDp8Ojby4KClZvY8OqIHRhbWLDqW0gY29uY29yZGEgcXVlIGEgU2lnbGEgZGUgVW5pdmVyc2lkYWRlIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGPDs3BpYSBhIHN1YSB0ZXNlIG91IApkaXNzZXJ0YcOnw6NvIHBhcmEgZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrLXVwIGUgcHJlc2VydmHDp8Ojby4KClZvY8OqIGRlY2xhcmEgcXVlIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyDDqSBvcmlnaW5hbCBlIHF1ZSB2b2PDqiB0ZW0gbyBwb2RlciBkZSBjb25jZWRlciBvcyBkaXJlaXRvcyBjb250aWRvcyAKbmVzdGEgbGljZW7Dp2EuIFZvY8OqIHRhbWLDqW0gZGVjbGFyYSBxdWUgbyBkZXDDs3NpdG8gZGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBuw6NvLCBxdWUgc2VqYSBkZSBzZXUgCmNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uCgpDYXNvIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiAKZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc8OjbyBpcnJlc3RyaXRhIGRvIGRldGVudG9yIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBwYXJhIGNvbmNlZGVyIMOgIFNpZ2xhIGRlIFVuaXZlcnNpZGFkZSAKb3MgZGlyZWl0b3MgYXByZXNlbnRhZG9zIG5lc3RhIGxpY2Vuw6dhLCBlIHF1ZSBlc3NlIG1hdGVyaWFsIGRlIHByb3ByaWVkYWRlIGRlIHRlcmNlaXJvcyBlc3TDoSBjbGFyYW1lbnRlIAppZGVudGlmaWNhZG8gZSByZWNvbmhlY2lkbyBubyB0ZXh0byBvdSBubyBjb250ZcO6ZG8gZGEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIG9yYSBkZXBvc2l0YWRhLgoKQ0FTTyBBIFRFU0UgT1UgRElTU0VSVEHDh8ODTyBPUkEgREVQT1NJVEFEQSBURU5IQSBTSURPIFJFU1VMVEFETyBERSBVTSBQQVRST0PDjU5JTyBPVSAKQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PIFFVRSBOw4NPIFNFSkEgQSBTSUdMQSBERSAKVU5JVkVSU0lEQURFLCBWT0PDiiBERUNMQVJBIFFVRSBSRVNQRUlUT1UgVE9ET1MgRSBRVUFJU1FVRVIgRElSRUlUT1MgREUgUkVWSVPDg08gQ09NTyAKVEFNQsOJTSBBUyBERU1BSVMgT0JSSUdBw4fDlUVTIEVYSUdJREFTIFBPUiBDT05UUkFUTyBPVSBBQ09SRE8uCgpBIFNpZ2xhIGRlIFVuaXZlcnNpZGFkZSBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lIChzKSBvdSBvKHMpIG5vbWUocykgZG8ocykgCmRldGVudG9yKGVzKSBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgZGEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvLCBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIGFsw6ltIGRhcXVlbGFzIApjb25jZWRpZGFzIHBvciBlc3RhIGxpY2Vuw6dhLgo=
dc.title.por.fl_str_mv Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join
title Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join
spellingShingle Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join
Cunha, Anderson Rogério
Processamento distribuído
Junção espacial
Multi-way spatial join
R-Tree
Distributed processing
Spatial join
CIENCIA DA COMPUTACAO::SISTEMAS DE COMPUTACAO
title_short Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join
title_full Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join
title_fullStr Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join
title_full_unstemmed Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join
title_sort Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join
author Cunha, Anderson Rogério
author_facet Cunha, Anderson Rogério
author_role author
dc.contributor.advisor1.fl_str_mv Rodrigues, Vagner José do Sacramento
dc.contributor.advisor1Lattes.fl_str_mv http://lattes.cnpq.br/4148896613580056
dc.contributor.referee1.fl_str_mv Rodrigues, Vagner José do Sacramento
dc.contributor.referee2.fl_str_mv Davis Júnior, Clodoveu Augusto
dc.contributor.referee3.fl_str_mv Cardoso, Kleber Vieira
dc.contributor.authorLattes.fl_str_mv http://lattes.cnpq.br/1051044276625162
dc.contributor.author.fl_str_mv Cunha, Anderson Rogério
contributor_str_mv Rodrigues, Vagner José do Sacramento
Rodrigues, Vagner José do Sacramento
Davis Júnior, Clodoveu Augusto
Cardoso, Kleber Vieira
dc.subject.por.fl_str_mv Processamento distribuído
Junção espacial
topic Processamento distribuído
Junção espacial
Multi-way spatial join
R-Tree
Distributed processing
Spatial join
CIENCIA DA COMPUTACAO::SISTEMAS DE COMPUTACAO
dc.subject.eng.fl_str_mv Multi-way spatial join
R-Tree
Distributed processing
Spatial join
dc.subject.cnpq.fl_str_mv CIENCIA DA COMPUTACAO::SISTEMAS DE COMPUTACAO
description Spatial join is one of the spatial operations of higher computational cost. Its complexity increases significantly when it involves multiple databases (multi-way spatial join). Traditional processing strategies of multi-way spatial join apply combinations of binary join algorithms on centralized computing environments. For complex queries, this approach requires much computational power, making it often unfeasible in centralized environments. This work proposes the Distributed Synchronous Traversal algorithm (DST), whose goal is to enable the distributed processing of multi-way spatial joins on a cluster of computers. The DST algorithm is based on Synchronous Traversal algorithm and processes the multiway spatial join in a single synchronous descent upon R-Trees levels of the database entries (the final outcome is built incrementally, without creating temporary databases). To the best of our knowledge, there are no other proposals in the literature that deal with this problem in a distributed fashion and on a peer-to-peer architecture. Many challenges had to be overcome, such as the definition of data structures that enabled the mapping of the semantics of queries of multi-way spatial join and coordination of the required distributed processing. DST proved to be satisfactorily parallelizable and scalable process real datasets in experiments performed in clusters of 1, 2, 4 and 8 servers.
publishDate 2014
dc.date.accessioned.fl_str_mv 2014-12-29T15:39:23Z
dc.date.issued.fl_str_mv 2014-02-19
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv CUNHA, Anderson Rogério. Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join. 2014. 82 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Goiás, Goiânia, 2014.
dc.identifier.uri.fl_str_mv http://repositorio.bc.ufg.br/tede/handle/tede/3836
dc.identifier.dark.fl_str_mv ark:/38995/0013000009pjn
identifier_str_mv CUNHA, Anderson Rogério. Processamento distribuído da junção espacial de múltiplas bases de dados: multi-way spatial join. 2014. 82 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Goiás, Goiânia, 2014.
ark:/38995/0013000009pjn
url http://repositorio.bc.ufg.br/tede/handle/tede/3836
dc.language.iso.fl_str_mv por
language por
dc.relation.program.fl_str_mv -3303550325223384799
dc.relation.confidence.fl_str_mv 600
600
600
600
dc.relation.department.fl_str_mv -7712266734633644768
dc.relation.cnpq.fl_str_mv 8930092515683771531
dc.relation.sponsorship.fl_str_mv 2075167498588264571
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Goiás
dc.publisher.program.fl_str_mv Programa de Pós-graduação em Ciência da Computação (INF)
dc.publisher.initials.fl_str_mv UFG
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv Instituto de Informática - INF (RG)
publisher.none.fl_str_mv Universidade Federal de Goiás
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFG
instname:Universidade Federal de Goiás (UFG)
instacron:UFG
instname_str Universidade Federal de Goiás (UFG)
instacron_str UFG
institution UFG
reponame_str Repositório Institucional da UFG
collection Repositório Institucional da UFG
bitstream.url.fl_str_mv http://repositorio.bc.ufg.br/tede/bitstreams/9a34a670-80fa-4fc2-96a7-27767ce78898/download
http://repositorio.bc.ufg.br/tede/bitstreams/bfdcdbc6-af17-4ed0-8d85-58bebba11646/download
http://repositorio.bc.ufg.br/tede/bitstreams/20605fdb-9c02-4253-a065-3a4316759fc3/download
http://repositorio.bc.ufg.br/tede/bitstreams/9518698f-8ff5-4ef4-95e0-4d589354c1b3/download
http://repositorio.bc.ufg.br/tede/bitstreams/ac458a5b-29ef-4cb1-b3f0-c52619c3e821/download
http://repositorio.bc.ufg.br/tede/bitstreams/5e84331b-76bd-4d05-ba62-8409fdf4f76a/download
http://repositorio.bc.ufg.br/tede/bitstreams/baf2c42c-ab95-4dc1-9ca1-502b2b826fe1/download
bitstream.checksum.fl_str_mv d50cf557f1a067a91c2034443ee62df2
bd3efa91386c1718a7f26a329fdcb468
4afdbb8c545fd630ea7db775da747b2f
29b9d5e95be03707f9d4a2e110421c11
9da0b6dfac957114c6a7714714b86306
cdb4c87fd25369116ad56e9d4afc856c
4787eb2e881ab143822a82ccb008af18
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFG - Universidade Federal de Goiás (UFG)
repository.mail.fl_str_mv tasesdissertacoes.bc@ufg.br
_version_ 1815172613540937728