Geospatial database generation from digital newspapers: use case for risk and disaster domains.
Autor(a) principal: | |
---|---|
Data de Publicação: | 2010 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10362/5637 |
Resumo: | Dissertation submitted in partial fulfilment of the requirements for the Degree of Master of Science in Geospatial Technologies. |
id |
RCAP_677867f3007191c449eaa12c3a6de38f |
---|---|
oai_identifier_str |
oai:run.unl.pt:10362/5637 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Geospatial database generation from digital newspapers: use case for risk and disaster domains.Geographic information systemsNatural languageDigital newspapersRisk domainsDisaster domainsDissertation submitted in partial fulfilment of the requirements for the Degree of Master of Science in Geospatial Technologies.The generation of geospatial databases is expensive in terms of time and money. Many geospatial users still lack spatial data. Geographic Information Extraction and Retrieval systems can alleviate this problem. This work proposes a method to populate spatial databases automatically from the Web. It applies the approach to the risk and disaster domain taking digital newspapers as a data source. News stories on digital newspapers contain rich thematic information that can be attached to places. The use case of automating spatial database generation is applied to Mexico using placenames. In Mexico, small and medium disasters occur most years. The facts about these are frequently mentioned in newspapers but rarely stored as records in national databases. Therefore, it is difficult to estimate human and material losses of those events. This work present two ways to extract information from digital news using natural languages techniques for distilling the text, and the national gazetteer codes to achieve placename-attribute disambiguation. Two outputs are presented; a general one that exposes highly relevant news, and another that attaches attributes of interest to placenames. The later achieved a 75% rate of thematic relevance under qualitative analysis.Berlanga-Llavori, RafaelKeßler, CarstenNeto, Miguel de Castro Simões FerreiraRUNPreciado López, Julio César2011-05-17T15:24:52Z2010-04-032010-04-03T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10362/5637enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T03:36:21Zoai:run.unl.pt:10362/5637Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:16:24.844869Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Geospatial database generation from digital newspapers: use case for risk and disaster domains. |
title |
Geospatial database generation from digital newspapers: use case for risk and disaster domains. |
spellingShingle |
Geospatial database generation from digital newspapers: use case for risk and disaster domains. Preciado López, Julio César Geographic information systems Natural language Digital newspapers Risk domains Disaster domains |
title_short |
Geospatial database generation from digital newspapers: use case for risk and disaster domains. |
title_full |
Geospatial database generation from digital newspapers: use case for risk and disaster domains. |
title_fullStr |
Geospatial database generation from digital newspapers: use case for risk and disaster domains. |
title_full_unstemmed |
Geospatial database generation from digital newspapers: use case for risk and disaster domains. |
title_sort |
Geospatial database generation from digital newspapers: use case for risk and disaster domains. |
author |
Preciado López, Julio César |
author_facet |
Preciado López, Julio César |
author_role |
author |
dc.contributor.none.fl_str_mv |
Berlanga-Llavori, Rafael Keßler, Carsten Neto, Miguel de Castro Simões Ferreira RUN |
dc.contributor.author.fl_str_mv |
Preciado López, Julio César |
dc.subject.por.fl_str_mv |
Geographic information systems Natural language Digital newspapers Risk domains Disaster domains |
topic |
Geographic information systems Natural language Digital newspapers Risk domains Disaster domains |
description |
Dissertation submitted in partial fulfilment of the requirements for the Degree of Master of Science in Geospatial Technologies. |
publishDate |
2010 |
dc.date.none.fl_str_mv |
2010-04-03 2010-04-03T00:00:00Z 2011-05-17T15:24:52Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10362/5637 |
url |
http://hdl.handle.net/10362/5637 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799137814167683072 |