Forecasting shellfish contamination by marine biotoxins based onmultivariate time series
Autor(a) principal: | |
---|---|
Data de Publicação: | 2022 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10362/139558 |
Resumo: | Shellfish production has been growing in recent years, having a high impact in several Portuguese coastal regions. However, this resource can be contaminated by marine biotoxins produced by toxic phytoplankton. The consumption of contaminated shellfish can cause serious health problems, and hence the harvesting and commercialisation of this product are prohibited whenever biotoxin concentration exceeds the safety limits. Since this prohibition leads to severe economic losses, it becomes necessary to develop strategies that predict shellfish contamination. Biotoxin concentration in bivalve molluscs can be predicted using univariate and multivariate time series, by modelling past information to predict the future. These time series include historical data on in-situ measurements of biotoxin concentration in several shellfish species, as well as other biological and meteorological data. In this thesis, multiple time series were acquired from different sources, integrated and pre-processed. Afterwards, various univariate andmultivariate time series forecasting methods were developed to predict mussel contamination in multiple production areas. In this context, autoregressive models and artificial neural networks (ANNs), such as feed-forward, convolutional and long short-term memory (LSTM) networks, were tested. Additionally, various data preparation and feature engineering methods were explored to improve these models. The forecasting models were evaluated and compared in order to determine which are the most suitable to solve the problem at hand. The results showed that the ANNs, namely networks trained on data whose dimension had previously been reduced using an autoencoder and networks trained on univariate time series, outperformed the classic autoregressive models. Moreover, among the ANN models, the LSTMs were very accurate, especially at one-week ahead predictions. Finally, the multivariate models did not outperform the univariate models, which may be explained by the fact that the additional variables used in this thesis did not provide relevant information to forecast shellfish contamination. These results might be regarded as the first pivotal steps towards the development of a model-based forecasting tool, which will allow the production sector to anticipate the harvesting prohibition, enabling the development of strategies to mitigate the economic losses inherent to this situation. |
id |
RCAP_bd51bdfe05872d1779004f80dc0254d4 |
---|---|
oai_identifier_str |
oai:run.unl.pt:10362/139558 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Forecasting shellfish contamination by marine biotoxins based onmultivariate time seriesShellfish ContaminationTime SeriesForecastingArtificial Neural NetworksFeed-Forward Neural NetworksConvolutional Neural NetworksDomínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e InformáticaShellfish production has been growing in recent years, having a high impact in several Portuguese coastal regions. However, this resource can be contaminated by marine biotoxins produced by toxic phytoplankton. The consumption of contaminated shellfish can cause serious health problems, and hence the harvesting and commercialisation of this product are prohibited whenever biotoxin concentration exceeds the safety limits. Since this prohibition leads to severe economic losses, it becomes necessary to develop strategies that predict shellfish contamination. Biotoxin concentration in bivalve molluscs can be predicted using univariate and multivariate time series, by modelling past information to predict the future. These time series include historical data on in-situ measurements of biotoxin concentration in several shellfish species, as well as other biological and meteorological data. In this thesis, multiple time series were acquired from different sources, integrated and pre-processed. Afterwards, various univariate andmultivariate time series forecasting methods were developed to predict mussel contamination in multiple production areas. In this context, autoregressive models and artificial neural networks (ANNs), such as feed-forward, convolutional and long short-term memory (LSTM) networks, were tested. Additionally, various data preparation and feature engineering methods were explored to improve these models. The forecasting models were evaluated and compared in order to determine which are the most suitable to solve the problem at hand. The results showed that the ANNs, namely networks trained on data whose dimension had previously been reduced using an autoencoder and networks trained on univariate time series, outperformed the classic autoregressive models. Moreover, among the ANN models, the LSTMs were very accurate, especially at one-week ahead predictions. Finally, the multivariate models did not outperform the univariate models, which may be explained by the fact that the additional variables used in this thesis did not provide relevant information to forecast shellfish contamination. These results might be regarded as the first pivotal steps towards the development of a model-based forecasting tool, which will allow the production sector to anticipate the harvesting prohibition, enabling the development of strategies to mitigate the economic losses inherent to this situation.A produção de moluscos bivalves tem vindo a crescer nos últimos anos, possuindo um elevado impacto em diversas regiões costeiras portuguesas. No entanto, este recurso pode ser contaminado por biotoxinas produzidas por fitoplâncton tóxico, sendo que a sua apanha e comercialização é interdita quando a concentração de biotoxinas atinge valores demasiado elevados. Esta interdição leva a perdas económicas significativas, tornando-se necessário desenvolver estratégias que prevejam a contaminação de bivalves. A concentração de biotoxinas em bivalves pode ser prevista utilizando séries temporais univariadas e multivariadas, que permitem prever o futuro com base em informação sobre o passado. Estas séries temporais incluem dados históricos de medições in-situ da concentração de biotoxinas em várias espécies de bivalves, bem como outras variáveis biológicas e climatéricas. Nesta tese, múltiplas séries temporais foram adquiridas a partir de diferentes fontes, integradas e pré-processadas. De seguida, foram desenvolvidos vários métodos de previsão de séries temporais para prever a contaminação de bivalves em múltiplas áreas de produção. Neste contexto, foram testados modelos autorregressivos e redes neuronais artificiais (ANNs), tais como redes feed-forward, de convolução e de memória a curto prazo (LSTM). Adicionalmente, foram explorados vários métodos de preparação dos dados e de engenharia de features, de forma a melhorar os modelos implementados. Estes modelos foram avaliados e comparados, de forma a determinar quais os mais adequados para resolver o problema em questão. Os resultados mostraram que as ANNs, nomeadamente redes treinadas em dados com a dimensão reduzida por um autoencoder e redes treinadas em séries univariadas, obtiveram melhor performance que os modelos autorregressivos. Além disso, de entre as ANNs, as LSTMs obtiveram resultados bastante precisos, especialmente em previsões uma semana no futuro. Por fim, os modelos multivariados não tiveram melhor performance que os modelos univariados, o que pode ser devido ao facto de as variáveis adicionais usadas neste tese não fornecerem informação relevante sobre a contaminação de bivalves. Estes resultados podem ser considerados os primeiros passos para o desenvolvimento de uma ferramenta de previsão baseada em modelos, que permitirá ao setor produtivo antecipar a interdição da apanha de bivalves e desenvolver estratégias de mitigação das perdas económicas inerentes a esta situação.Lopes, MartaKrippahl, LudwigRUNCruz, Rafaela Carreira Eleutério Gregório da2022-06-07T10:09:34Z2022-022022-02-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10362/139558enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T05:16:50Zoai:run.unl.pt:10362/139558Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:49:27.369816Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Forecasting shellfish contamination by marine biotoxins based onmultivariate time series |
title |
Forecasting shellfish contamination by marine biotoxins based onmultivariate time series |
spellingShingle |
Forecasting shellfish contamination by marine biotoxins based onmultivariate time series Cruz, Rafaela Carreira Eleutério Gregório da Shellfish Contamination Time Series Forecasting Artificial Neural Networks Feed-Forward Neural Networks Convolutional Neural Networks Domínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática |
title_short |
Forecasting shellfish contamination by marine biotoxins based onmultivariate time series |
title_full |
Forecasting shellfish contamination by marine biotoxins based onmultivariate time series |
title_fullStr |
Forecasting shellfish contamination by marine biotoxins based onmultivariate time series |
title_full_unstemmed |
Forecasting shellfish contamination by marine biotoxins based onmultivariate time series |
title_sort |
Forecasting shellfish contamination by marine biotoxins based onmultivariate time series |
author |
Cruz, Rafaela Carreira Eleutério Gregório da |
author_facet |
Cruz, Rafaela Carreira Eleutério Gregório da |
author_role |
author |
dc.contributor.none.fl_str_mv |
Lopes, Marta Krippahl, Ludwig RUN |
dc.contributor.author.fl_str_mv |
Cruz, Rafaela Carreira Eleutério Gregório da |
dc.subject.por.fl_str_mv |
Shellfish Contamination Time Series Forecasting Artificial Neural Networks Feed-Forward Neural Networks Convolutional Neural Networks Domínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática |
topic |
Shellfish Contamination Time Series Forecasting Artificial Neural Networks Feed-Forward Neural Networks Convolutional Neural Networks Domínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática |
description |
Shellfish production has been growing in recent years, having a high impact in several Portuguese coastal regions. However, this resource can be contaminated by marine biotoxins produced by toxic phytoplankton. The consumption of contaminated shellfish can cause serious health problems, and hence the harvesting and commercialisation of this product are prohibited whenever biotoxin concentration exceeds the safety limits. Since this prohibition leads to severe economic losses, it becomes necessary to develop strategies that predict shellfish contamination. Biotoxin concentration in bivalve molluscs can be predicted using univariate and multivariate time series, by modelling past information to predict the future. These time series include historical data on in-situ measurements of biotoxin concentration in several shellfish species, as well as other biological and meteorological data. In this thesis, multiple time series were acquired from different sources, integrated and pre-processed. Afterwards, various univariate andmultivariate time series forecasting methods were developed to predict mussel contamination in multiple production areas. In this context, autoregressive models and artificial neural networks (ANNs), such as feed-forward, convolutional and long short-term memory (LSTM) networks, were tested. Additionally, various data preparation and feature engineering methods were explored to improve these models. The forecasting models were evaluated and compared in order to determine which are the most suitable to solve the problem at hand. The results showed that the ANNs, namely networks trained on data whose dimension had previously been reduced using an autoencoder and networks trained on univariate time series, outperformed the classic autoregressive models. Moreover, among the ANN models, the LSTMs were very accurate, especially at one-week ahead predictions. Finally, the multivariate models did not outperform the univariate models, which may be explained by the fact that the additional variables used in this thesis did not provide relevant information to forecast shellfish contamination. These results might be regarded as the first pivotal steps towards the development of a model-based forecasting tool, which will allow the production sector to anticipate the harvesting prohibition, enabling the development of strategies to mitigate the economic losses inherent to this situation. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-06-07T10:09:34Z 2022-02 2022-02-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10362/139558 |
url |
http://hdl.handle.net/10362/139558 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799138093424443392 |