A systematic comparison of spatial search strategies for open government datasets

Detalhes bibliográficos
Autor(a) principal: Teka, Brhane Bahrishum
Data de Publicação: 2019
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10362/67708
Resumo: Dissertation submitted in partial fulfilment of the requirements for the Degree of Master of Science in Geospatial Technologies
id RCAP_4aade23ec51bbfa078d7ca77a436b6ae
oai_identifier_str oai:run.unl.pt:10362/67708
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling A systematic comparison of spatial search strategies for open government datasetsOpen Government DataSpatial SearchRelevance JudgmentGeographic Information RetrievalQuery ExpansionDissertation submitted in partial fulfilment of the requirements for the Degree of Master of Science in Geospatial TechnologiesDatasets produced or collected by governments are being made publicly available for re-use. Open government data portals help realize such reuse by providing list of datasets and links to access those datasets. This ensures that users can search, inspect and use the data easily. With the rapidly increasing size of datasets in open government data portals, just like it is the case with the web, nding relevant datasets with a query of few keywords is a challenge. Furthermore, those data portals not only consist of textual information but also georeferenced data that needs to be searched properly. Currently, most popular open government data portals like the data.gov.uk and data.gov.ie lack the support for simultaneous thematic and spatial search. Moreover, the use of query expansion hasn't also been studied in open government datasets. In this study we have assessed di erent spatial search strategies and query expansions' performance and impact on user relevance judgment. To evaluate those strategies we harvested machine readable spatial datasets and their metadata from three English based open government data portals, performed metadata enhancement, developed a prototype and performed theoretical and user evaluation. According to the results from the evaluations keyword based search strategy returned limited number of results but the highest relevance rating. In the other hand aggregated spatial and thematic search improved the number of results of the baseline keyword based strategy with a 1 second increase in response time and but decreased relevance rating. Moreover, strategies based on WordNet Synonyms query expansion exhibited the highest relevance rated rst seven results than all other strategies except the keyword based baseline strategy in three out of the four query terms. Regarding the use of Hausdor distance and area of overlap, since documents were returned as results only if they overlap with the query, the number of results returned were the same in both spatial similarities. But strategies using Hausdor distance were of higher relevance rating and average mean than area of overlap based strategies in three of the four queries. In conclusion, while the spatial search strategies assessed in this study can be used to improve the existing keyword based OGDs search approaches, we recommend OGD developers to also consider using WordNet Synonyms based query expansion and hausdor distance as a way of improving relevant spatial data discovery in open government datasets using few keywords and tolerable response time.Degbelo, AuriolHenriques, Roberto André PereiraCasteleyn, SvenRUNTeka, Brhane Bahrishum2019-04-26T15:56:27Z2019-02-052019-02-05T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10362/67708TID:202228290enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T04:31:58Zoai:run.unl.pt:10362/67708Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:34:39.206143Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv A systematic comparison of spatial search strategies for open government datasets
title A systematic comparison of spatial search strategies for open government datasets
spellingShingle A systematic comparison of spatial search strategies for open government datasets
Teka, Brhane Bahrishum
Open Government Data
Spatial Search
Relevance Judgment
Geographic Information Retrieval
Query Expansion
title_short A systematic comparison of spatial search strategies for open government datasets
title_full A systematic comparison of spatial search strategies for open government datasets
title_fullStr A systematic comparison of spatial search strategies for open government datasets
title_full_unstemmed A systematic comparison of spatial search strategies for open government datasets
title_sort A systematic comparison of spatial search strategies for open government datasets
author Teka, Brhane Bahrishum
author_facet Teka, Brhane Bahrishum
author_role author
dc.contributor.none.fl_str_mv Degbelo, Auriol
Henriques, Roberto André Pereira
Casteleyn, Sven
RUN
dc.contributor.author.fl_str_mv Teka, Brhane Bahrishum
dc.subject.por.fl_str_mv Open Government Data
Spatial Search
Relevance Judgment
Geographic Information Retrieval
Query Expansion
topic Open Government Data
Spatial Search
Relevance Judgment
Geographic Information Retrieval
Query Expansion
description Dissertation submitted in partial fulfilment of the requirements for the Degree of Master of Science in Geospatial Technologies
publishDate 2019
dc.date.none.fl_str_mv 2019-04-26T15:56:27Z
2019-02-05
2019-02-05T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10362/67708
TID:202228290
url http://hdl.handle.net/10362/67708
identifier_str_mv TID:202228290
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799137968165748736