Exploring the Sea: Heterogenous Geo-Referenced Data Repository

Detalhes bibliográficos
Autor(a) principal: Inês Davim Lopes Garganta Silva
Data de Publicação: 2016
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://hdl.handle.net/10216/85612
Resumo: The marine environment is subject of an increasing attention and constitutes a dynamic and multidimensional environment, that is very demanding for data collection and update, requiring large amounts of data. This data is unique since the campaigns where it is gathered are unrepeatable, due to the existence of a wide range of factors outside the researchers' control. Moreover, funding for these campaigns can be hard to come by. Nevertheless, these datasets are often underused as they are not available to all the involved stakeholders, or involve non-interoperable formats. Currently, metadata and some of the data are registered in paper-based forms, which are later digitalized or transcribed to spreadsheets, with researchers placing emphasis in the publications rather than in the management of the collected data. Data provenance often relates to soil, water and biological samples, as well as sensors, ship routes, photos, videos, sounds and laboratorial analyses. This problem is reflected in the large BIOMETORE project that involves several teams of marine researchers lead by Instituto Português do Mar e da Atmosfera. The ultimate goal of the BIOMETORE is the achievement and maintenance of the Good Environmental Status (GES) of the European Marine Waters. This project has eight campaigns, producing large amounts of marine data that should be organized in order to enable reusability by different stakeholders. On the other hand, the SeaBioData project, lead by INESC TEC, aims at developing a georeferenced database for the BIOMETORE, that can integrate all available data and implement existing standards for data interoperability, as specified in directives such as INSPIRE. Building the database is essential to allow uniform data access by local researchers as well as the international community and, at the same time, reduce the required effort allocated to data management, promoting faster and more accurate scientific results. In order to respect the INSPIRE directive, we adopted the data model from the OGC Sensor Observation Service. This data model has already been adopted by the international community, which ensures that the implementation relies on an interoperable approach. We surveyed available technological options, as well as the datasets supplied by IPMA. We decided on the open source implementation from 52º North, since it supports the majority of the SOS model's concepts and provides a native REST API and Web Services. The 52º North data model does not support the storage of all of the data required by IPMA for internal usage. One of the main data modelling challenges was to extend the existing data model without altering the original tables, thus centralizing the data, while ensuring that the model is compliant with existing services. We had to follow the metadata structure defined by SNIMAR, which implied the study and implementation of SNIMAR's metadata profile. We followed the Darwin Core standard, in order to store more details of the taxonomic rank of the species. Furthermore, we have extended the 52º North data model, in order to address the local needs of the BIOMETORE, since the SOS model simply stored data concerning the observations, disregarding information about entities such as teams, campaigns, users, documents or responsible parties.
id RCAP_a141396998b63c2ccb98f940fa5782b0
oai_identifier_str oai:repositorio-aberto.up.pt:10216/85612
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Exploring the Sea: Heterogenous Geo-Referenced Data RepositoryEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringThe marine environment is subject of an increasing attention and constitutes a dynamic and multidimensional environment, that is very demanding for data collection and update, requiring large amounts of data. This data is unique since the campaigns where it is gathered are unrepeatable, due to the existence of a wide range of factors outside the researchers' control. Moreover, funding for these campaigns can be hard to come by. Nevertheless, these datasets are often underused as they are not available to all the involved stakeholders, or involve non-interoperable formats. Currently, metadata and some of the data are registered in paper-based forms, which are later digitalized or transcribed to spreadsheets, with researchers placing emphasis in the publications rather than in the management of the collected data. Data provenance often relates to soil, water and biological samples, as well as sensors, ship routes, photos, videos, sounds and laboratorial analyses. This problem is reflected in the large BIOMETORE project that involves several teams of marine researchers lead by Instituto Português do Mar e da Atmosfera. The ultimate goal of the BIOMETORE is the achievement and maintenance of the Good Environmental Status (GES) of the European Marine Waters. This project has eight campaigns, producing large amounts of marine data that should be organized in order to enable reusability by different stakeholders. On the other hand, the SeaBioData project, lead by INESC TEC, aims at developing a georeferenced database for the BIOMETORE, that can integrate all available data and implement existing standards for data interoperability, as specified in directives such as INSPIRE. Building the database is essential to allow uniform data access by local researchers as well as the international community and, at the same time, reduce the required effort allocated to data management, promoting faster and more accurate scientific results. In order to respect the INSPIRE directive, we adopted the data model from the OGC Sensor Observation Service. This data model has already been adopted by the international community, which ensures that the implementation relies on an interoperable approach. We surveyed available technological options, as well as the datasets supplied by IPMA. We decided on the open source implementation from 52º North, since it supports the majority of the SOS model's concepts and provides a native REST API and Web Services. The 52º North data model does not support the storage of all of the data required by IPMA for internal usage. One of the main data modelling challenges was to extend the existing data model without altering the original tables, thus centralizing the data, while ensuring that the model is compliant with existing services. We had to follow the metadata structure defined by SNIMAR, which implied the study and implementation of SNIMAR's metadata profile. We followed the Darwin Core standard, in order to store more details of the taxonomic rank of the species. Furthermore, we have extended the 52º North data model, in order to address the local needs of the BIOMETORE, since the SOS model simply stored data concerning the observations, disregarding information about entities such as teams, campaigns, users, documents or responsible parties.2016-07-182016-07-18T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/85612TID:201305330engInês Davim Lopes Garganta Silvainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T14:03:10Zoai:repositorio-aberto.up.pt:10216/85612Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:53:32.209101Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Exploring the Sea: Heterogenous Geo-Referenced Data Repository
title Exploring the Sea: Heterogenous Geo-Referenced Data Repository
spellingShingle Exploring the Sea: Heterogenous Geo-Referenced Data Repository
Inês Davim Lopes Garganta Silva
Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
title_short Exploring the Sea: Heterogenous Geo-Referenced Data Repository
title_full Exploring the Sea: Heterogenous Geo-Referenced Data Repository
title_fullStr Exploring the Sea: Heterogenous Geo-Referenced Data Repository
title_full_unstemmed Exploring the Sea: Heterogenous Geo-Referenced Data Repository
title_sort Exploring the Sea: Heterogenous Geo-Referenced Data Repository
author Inês Davim Lopes Garganta Silva
author_facet Inês Davim Lopes Garganta Silva
author_role author
dc.contributor.author.fl_str_mv Inês Davim Lopes Garganta Silva
dc.subject.por.fl_str_mv Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
topic Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
description The marine environment is subject of an increasing attention and constitutes a dynamic and multidimensional environment, that is very demanding for data collection and update, requiring large amounts of data. This data is unique since the campaigns where it is gathered are unrepeatable, due to the existence of a wide range of factors outside the researchers' control. Moreover, funding for these campaigns can be hard to come by. Nevertheless, these datasets are often underused as they are not available to all the involved stakeholders, or involve non-interoperable formats. Currently, metadata and some of the data are registered in paper-based forms, which are later digitalized or transcribed to spreadsheets, with researchers placing emphasis in the publications rather than in the management of the collected data. Data provenance often relates to soil, water and biological samples, as well as sensors, ship routes, photos, videos, sounds and laboratorial analyses. This problem is reflected in the large BIOMETORE project that involves several teams of marine researchers lead by Instituto Português do Mar e da Atmosfera. The ultimate goal of the BIOMETORE is the achievement and maintenance of the Good Environmental Status (GES) of the European Marine Waters. This project has eight campaigns, producing large amounts of marine data that should be organized in order to enable reusability by different stakeholders. On the other hand, the SeaBioData project, lead by INESC TEC, aims at developing a georeferenced database for the BIOMETORE, that can integrate all available data and implement existing standards for data interoperability, as specified in directives such as INSPIRE. Building the database is essential to allow uniform data access by local researchers as well as the international community and, at the same time, reduce the required effort allocated to data management, promoting faster and more accurate scientific results. In order to respect the INSPIRE directive, we adopted the data model from the OGC Sensor Observation Service. This data model has already been adopted by the international community, which ensures that the implementation relies on an interoperable approach. We surveyed available technological options, as well as the datasets supplied by IPMA. We decided on the open source implementation from 52º North, since it supports the majority of the SOS model's concepts and provides a native REST API and Web Services. The 52º North data model does not support the storage of all of the data required by IPMA for internal usage. One of the main data modelling challenges was to extend the existing data model without altering the original tables, thus centralizing the data, while ensuring that the model is compliant with existing services. We had to follow the metadata structure defined by SNIMAR, which implied the study and implementation of SNIMAR's metadata profile. We followed the Darwin Core standard, in order to store more details of the taxonomic rank of the species. Furthermore, we have extended the 52º North data model, in order to address the local needs of the BIOMETORE, since the SOS model simply stored data concerning the observations, disregarding information about entities such as teams, campaigns, users, documents or responsible parties.
publishDate 2016
dc.date.none.fl_str_mv 2016-07-18
2016-07-18T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/85612
TID:201305330
url https://hdl.handle.net/10216/85612
identifier_str_mv TID:201305330
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799135855480143872