Exploring the Sea: Heterogenous Geo-Referenced Data Repository
Autor(a) principal: | |
---|---|
Data de Publicação: | 2016 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/85612 |
Resumo: | The marine environment is subject of an increasing attention and constitutes a dynamic and multidimensional environment, that is very demanding for data collection and update, requiring large amounts of data. This data is unique since the campaigns where it is gathered are unrepeatable, due to the existence of a wide range of factors outside the researchers' control. Moreover, funding for these campaigns can be hard to come by. Nevertheless, these datasets are often underused as they are not available to all the involved stakeholders, or involve non-interoperable formats. Currently, metadata and some of the data are registered in paper-based forms, which are later digitalized or transcribed to spreadsheets, with researchers placing emphasis in the publications rather than in the management of the collected data. Data provenance often relates to soil, water and biological samples, as well as sensors, ship routes, photos, videos, sounds and laboratorial analyses. This problem is reflected in the large BIOMETORE project that involves several teams of marine researchers lead by Instituto Português do Mar e da Atmosfera. The ultimate goal of the BIOMETORE is the achievement and maintenance of the Good Environmental Status (GES) of the European Marine Waters. This project has eight campaigns, producing large amounts of marine data that should be organized in order to enable reusability by different stakeholders. On the other hand, the SeaBioData project, lead by INESC TEC, aims at developing a georeferenced database for the BIOMETORE, that can integrate all available data and implement existing standards for data interoperability, as specified in directives such as INSPIRE. Building the database is essential to allow uniform data access by local researchers as well as the international community and, at the same time, reduce the required effort allocated to data management, promoting faster and more accurate scientific results. In order to respect the INSPIRE directive, we adopted the data model from the OGC Sensor Observation Service. This data model has already been adopted by the international community, which ensures that the implementation relies on an interoperable approach. We surveyed available technological options, as well as the datasets supplied by IPMA. We decided on the open source implementation from 52º North, since it supports the majority of the SOS model's concepts and provides a native REST API and Web Services. The 52º North data model does not support the storage of all of the data required by IPMA for internal usage. One of the main data modelling challenges was to extend the existing data model without altering the original tables, thus centralizing the data, while ensuring that the model is compliant with existing services. We had to follow the metadata structure defined by SNIMAR, which implied the study and implementation of SNIMAR's metadata profile. We followed the Darwin Core standard, in order to store more details of the taxonomic rank of the species. Furthermore, we have extended the 52º North data model, in order to address the local needs of the BIOMETORE, since the SOS model simply stored data concerning the observations, disregarding information about entities such as teams, campaigns, users, documents or responsible parties. |
id |
RCAP_a141396998b63c2ccb98f940fa5782b0 |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/85612 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Exploring the Sea: Heterogenous Geo-Referenced Data RepositoryEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringThe marine environment is subject of an increasing attention and constitutes a dynamic and multidimensional environment, that is very demanding for data collection and update, requiring large amounts of data. This data is unique since the campaigns where it is gathered are unrepeatable, due to the existence of a wide range of factors outside the researchers' control. Moreover, funding for these campaigns can be hard to come by. Nevertheless, these datasets are often underused as they are not available to all the involved stakeholders, or involve non-interoperable formats. Currently, metadata and some of the data are registered in paper-based forms, which are later digitalized or transcribed to spreadsheets, with researchers placing emphasis in the publications rather than in the management of the collected data. Data provenance often relates to soil, water and biological samples, as well as sensors, ship routes, photos, videos, sounds and laboratorial analyses. This problem is reflected in the large BIOMETORE project that involves several teams of marine researchers lead by Instituto Português do Mar e da Atmosfera. The ultimate goal of the BIOMETORE is the achievement and maintenance of the Good Environmental Status (GES) of the European Marine Waters. This project has eight campaigns, producing large amounts of marine data that should be organized in order to enable reusability by different stakeholders. On the other hand, the SeaBioData project, lead by INESC TEC, aims at developing a georeferenced database for the BIOMETORE, that can integrate all available data and implement existing standards for data interoperability, as specified in directives such as INSPIRE. Building the database is essential to allow uniform data access by local researchers as well as the international community and, at the same time, reduce the required effort allocated to data management, promoting faster and more accurate scientific results. In order to respect the INSPIRE directive, we adopted the data model from the OGC Sensor Observation Service. This data model has already been adopted by the international community, which ensures that the implementation relies on an interoperable approach. We surveyed available technological options, as well as the datasets supplied by IPMA. We decided on the open source implementation from 52º North, since it supports the majority of the SOS model's concepts and provides a native REST API and Web Services. The 52º North data model does not support the storage of all of the data required by IPMA for internal usage. One of the main data modelling challenges was to extend the existing data model without altering the original tables, thus centralizing the data, while ensuring that the model is compliant with existing services. We had to follow the metadata structure defined by SNIMAR, which implied the study and implementation of SNIMAR's metadata profile. We followed the Darwin Core standard, in order to store more details of the taxonomic rank of the species. Furthermore, we have extended the 52º North data model, in order to address the local needs of the BIOMETORE, since the SOS model simply stored data concerning the observations, disregarding information about entities such as teams, campaigns, users, documents or responsible parties.2016-07-182016-07-18T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/85612TID:201305330engInês Davim Lopes Garganta Silvainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T14:03:10Zoai:repositorio-aberto.up.pt:10216/85612Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:53:32.209101Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Exploring the Sea: Heterogenous Geo-Referenced Data Repository |
title |
Exploring the Sea: Heterogenous Geo-Referenced Data Repository |
spellingShingle |
Exploring the Sea: Heterogenous Geo-Referenced Data Repository Inês Davim Lopes Garganta Silva Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Exploring the Sea: Heterogenous Geo-Referenced Data Repository |
title_full |
Exploring the Sea: Heterogenous Geo-Referenced Data Repository |
title_fullStr |
Exploring the Sea: Heterogenous Geo-Referenced Data Repository |
title_full_unstemmed |
Exploring the Sea: Heterogenous Geo-Referenced Data Repository |
title_sort |
Exploring the Sea: Heterogenous Geo-Referenced Data Repository |
author |
Inês Davim Lopes Garganta Silva |
author_facet |
Inês Davim Lopes Garganta Silva |
author_role |
author |
dc.contributor.author.fl_str_mv |
Inês Davim Lopes Garganta Silva |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
The marine environment is subject of an increasing attention and constitutes a dynamic and multidimensional environment, that is very demanding for data collection and update, requiring large amounts of data. This data is unique since the campaigns where it is gathered are unrepeatable, due to the existence of a wide range of factors outside the researchers' control. Moreover, funding for these campaigns can be hard to come by. Nevertheless, these datasets are often underused as they are not available to all the involved stakeholders, or involve non-interoperable formats. Currently, metadata and some of the data are registered in paper-based forms, which are later digitalized or transcribed to spreadsheets, with researchers placing emphasis in the publications rather than in the management of the collected data. Data provenance often relates to soil, water and biological samples, as well as sensors, ship routes, photos, videos, sounds and laboratorial analyses. This problem is reflected in the large BIOMETORE project that involves several teams of marine researchers lead by Instituto Português do Mar e da Atmosfera. The ultimate goal of the BIOMETORE is the achievement and maintenance of the Good Environmental Status (GES) of the European Marine Waters. This project has eight campaigns, producing large amounts of marine data that should be organized in order to enable reusability by different stakeholders. On the other hand, the SeaBioData project, lead by INESC TEC, aims at developing a georeferenced database for the BIOMETORE, that can integrate all available data and implement existing standards for data interoperability, as specified in directives such as INSPIRE. Building the database is essential to allow uniform data access by local researchers as well as the international community and, at the same time, reduce the required effort allocated to data management, promoting faster and more accurate scientific results. In order to respect the INSPIRE directive, we adopted the data model from the OGC Sensor Observation Service. This data model has already been adopted by the international community, which ensures that the implementation relies on an interoperable approach. We surveyed available technological options, as well as the datasets supplied by IPMA. We decided on the open source implementation from 52º North, since it supports the majority of the SOS model's concepts and provides a native REST API and Web Services. The 52º North data model does not support the storage of all of the data required by IPMA for internal usage. One of the main data modelling challenges was to extend the existing data model without altering the original tables, thus centralizing the data, while ensuring that the model is compliant with existing services. We had to follow the metadata structure defined by SNIMAR, which implied the study and implementation of SNIMAR's metadata profile. We followed the Darwin Core standard, in order to store more details of the taxonomic rank of the species. Furthermore, we have extended the 52º North data model, in order to address the local needs of the BIOMETORE, since the SOS model simply stored data concerning the observations, disregarding information about entities such as teams, campaigns, users, documents or responsible parties. |
publishDate |
2016 |
dc.date.none.fl_str_mv |
2016-07-18 2016-07-18T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/85612 TID:201305330 |
url |
https://hdl.handle.net/10216/85612 |
identifier_str_mv |
TID:201305330 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799135855480143872 |