Forecasting shellfish contamination by marine biotoxins based onmultivariate time series

Detalhes bibliográficos
Autor(a) principal: Cruz, Rafaela Carreira Eleutério Gregório da
Data de Publicação: 2022
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10362/139558
Resumo: Shellfish production has been growing in recent years, having a high impact in several Portuguese coastal regions. However, this resource can be contaminated by marine biotoxins produced by toxic phytoplankton. The consumption of contaminated shellfish can cause serious health problems, and hence the harvesting and commercialisation of this product are prohibited whenever biotoxin concentration exceeds the safety limits. Since this prohibition leads to severe economic losses, it becomes necessary to develop strategies that predict shellfish contamination. Biotoxin concentration in bivalve molluscs can be predicted using univariate and multivariate time series, by modelling past information to predict the future. These time series include historical data on in-situ measurements of biotoxin concentration in several shellfish species, as well as other biological and meteorological data. In this thesis, multiple time series were acquired from different sources, integrated and pre-processed. Afterwards, various univariate andmultivariate time series forecasting methods were developed to predict mussel contamination in multiple production areas. In this context, autoregressive models and artificial neural networks (ANNs), such as feed-forward, convolutional and long short-term memory (LSTM) networks, were tested. Additionally, various data preparation and feature engineering methods were explored to improve these models. The forecasting models were evaluated and compared in order to determine which are the most suitable to solve the problem at hand. The results showed that the ANNs, namely networks trained on data whose dimension had previously been reduced using an autoencoder and networks trained on univariate time series, outperformed the classic autoregressive models. Moreover, among the ANN models, the LSTMs were very accurate, especially at one-week ahead predictions. Finally, the multivariate models did not outperform the univariate models, which may be explained by the fact that the additional variables used in this thesis did not provide relevant information to forecast shellfish contamination. These results might be regarded as the first pivotal steps towards the development of a model-based forecasting tool, which will allow the production sector to anticipate the harvesting prohibition, enabling the development of strategies to mitigate the economic losses inherent to this situation.
id RCAP_bd51bdfe05872d1779004f80dc0254d4
oai_identifier_str oai:run.unl.pt:10362/139558
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Forecasting shellfish contamination by marine biotoxins based onmultivariate time seriesShellfish ContaminationTime SeriesForecastingArtificial Neural NetworksFeed-Forward Neural NetworksConvolutional Neural NetworksDomínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e InformáticaShellfish production has been growing in recent years, having a high impact in several Portuguese coastal regions. However, this resource can be contaminated by marine biotoxins produced by toxic phytoplankton. The consumption of contaminated shellfish can cause serious health problems, and hence the harvesting and commercialisation of this product are prohibited whenever biotoxin concentration exceeds the safety limits. Since this prohibition leads to severe economic losses, it becomes necessary to develop strategies that predict shellfish contamination. Biotoxin concentration in bivalve molluscs can be predicted using univariate and multivariate time series, by modelling past information to predict the future. These time series include historical data on in-situ measurements of biotoxin concentration in several shellfish species, as well as other biological and meteorological data. In this thesis, multiple time series were acquired from different sources, integrated and pre-processed. Afterwards, various univariate andmultivariate time series forecasting methods were developed to predict mussel contamination in multiple production areas. In this context, autoregressive models and artificial neural networks (ANNs), such as feed-forward, convolutional and long short-term memory (LSTM) networks, were tested. Additionally, various data preparation and feature engineering methods were explored to improve these models. The forecasting models were evaluated and compared in order to determine which are the most suitable to solve the problem at hand. The results showed that the ANNs, namely networks trained on data whose dimension had previously been reduced using an autoencoder and networks trained on univariate time series, outperformed the classic autoregressive models. Moreover, among the ANN models, the LSTMs were very accurate, especially at one-week ahead predictions. Finally, the multivariate models did not outperform the univariate models, which may be explained by the fact that the additional variables used in this thesis did not provide relevant information to forecast shellfish contamination. These results might be regarded as the first pivotal steps towards the development of a model-based forecasting tool, which will allow the production sector to anticipate the harvesting prohibition, enabling the development of strategies to mitigate the economic losses inherent to this situation.A produção de moluscos bivalves tem vindo a crescer nos últimos anos, possuindo um elevado impacto em diversas regiões costeiras portuguesas. No entanto, este recurso pode ser contaminado por biotoxinas produzidas por fitoplâncton tóxico, sendo que a sua apanha e comercialização é interdita quando a concentração de biotoxinas atinge valores demasiado elevados. Esta interdição leva a perdas económicas significativas, tornando-se necessário desenvolver estratégias que prevejam a contaminação de bivalves. A concentração de biotoxinas em bivalves pode ser prevista utilizando séries temporais univariadas e multivariadas, que permitem prever o futuro com base em informação sobre o passado. Estas séries temporais incluem dados históricos de medições in-situ da concentração de biotoxinas em várias espécies de bivalves, bem como outras variáveis biológicas e climatéricas. Nesta tese, múltiplas séries temporais foram adquiridas a partir de diferentes fontes, integradas e pré-processadas. De seguida, foram desenvolvidos vários métodos de previsão de séries temporais para prever a contaminação de bivalves em múltiplas áreas de produção. Neste contexto, foram testados modelos autorregressivos e redes neuronais artificiais (ANNs), tais como redes feed-forward, de convolução e de memória a curto prazo (LSTM). Adicionalmente, foram explorados vários métodos de preparação dos dados e de engenharia de features, de forma a melhorar os modelos implementados. Estes modelos foram avaliados e comparados, de forma a determinar quais os mais adequados para resolver o problema em questão. Os resultados mostraram que as ANNs, nomeadamente redes treinadas em dados com a dimensão reduzida por um autoencoder e redes treinadas em séries univariadas, obtiveram melhor performance que os modelos autorregressivos. Além disso, de entre as ANNs, as LSTMs obtiveram resultados bastante precisos, especialmente em previsões uma semana no futuro. Por fim, os modelos multivariados não tiveram melhor performance que os modelos univariados, o que pode ser devido ao facto de as variáveis adicionais usadas neste tese não fornecerem informação relevante sobre a contaminação de bivalves. Estes resultados podem ser considerados os primeiros passos para o desenvolvimento de uma ferramenta de previsão baseada em modelos, que permitirá ao setor produtivo antecipar a interdição da apanha de bivalves e desenvolver estratégias de mitigação das perdas económicas inerentes a esta situação.Lopes, MartaKrippahl, LudwigRUNCruz, Rafaela Carreira Eleutério Gregório da2022-06-07T10:09:34Z2022-022022-02-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10362/139558enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-03-11T05:16:50Zoai:run.unl.pt:10362/139558Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:49:27.369816Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Forecasting shellfish contamination by marine biotoxins based onmultivariate time series
title Forecasting shellfish contamination by marine biotoxins based onmultivariate time series
spellingShingle Forecasting shellfish contamination by marine biotoxins based onmultivariate time series
Cruz, Rafaela Carreira Eleutério Gregório da
Shellfish Contamination
Time Series
Forecasting
Artificial Neural Networks
Feed-Forward Neural Networks
Convolutional Neural Networks
Domínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática
title_short Forecasting shellfish contamination by marine biotoxins based onmultivariate time series
title_full Forecasting shellfish contamination by marine biotoxins based onmultivariate time series
title_fullStr Forecasting shellfish contamination by marine biotoxins based onmultivariate time series
title_full_unstemmed Forecasting shellfish contamination by marine biotoxins based onmultivariate time series
title_sort Forecasting shellfish contamination by marine biotoxins based onmultivariate time series
author Cruz, Rafaela Carreira Eleutério Gregório da
author_facet Cruz, Rafaela Carreira Eleutério Gregório da
author_role author
dc.contributor.none.fl_str_mv Lopes, Marta
Krippahl, Ludwig
RUN
dc.contributor.author.fl_str_mv Cruz, Rafaela Carreira Eleutério Gregório da
dc.subject.por.fl_str_mv Shellfish Contamination
Time Series
Forecasting
Artificial Neural Networks
Feed-Forward Neural Networks
Convolutional Neural Networks
Domínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática
topic Shellfish Contamination
Time Series
Forecasting
Artificial Neural Networks
Feed-Forward Neural Networks
Convolutional Neural Networks
Domínio/Área Científica::Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática
description Shellfish production has been growing in recent years, having a high impact in several Portuguese coastal regions. However, this resource can be contaminated by marine biotoxins produced by toxic phytoplankton. The consumption of contaminated shellfish can cause serious health problems, and hence the harvesting and commercialisation of this product are prohibited whenever biotoxin concentration exceeds the safety limits. Since this prohibition leads to severe economic losses, it becomes necessary to develop strategies that predict shellfish contamination. Biotoxin concentration in bivalve molluscs can be predicted using univariate and multivariate time series, by modelling past information to predict the future. These time series include historical data on in-situ measurements of biotoxin concentration in several shellfish species, as well as other biological and meteorological data. In this thesis, multiple time series were acquired from different sources, integrated and pre-processed. Afterwards, various univariate andmultivariate time series forecasting methods were developed to predict mussel contamination in multiple production areas. In this context, autoregressive models and artificial neural networks (ANNs), such as feed-forward, convolutional and long short-term memory (LSTM) networks, were tested. Additionally, various data preparation and feature engineering methods were explored to improve these models. The forecasting models were evaluated and compared in order to determine which are the most suitable to solve the problem at hand. The results showed that the ANNs, namely networks trained on data whose dimension had previously been reduced using an autoencoder and networks trained on univariate time series, outperformed the classic autoregressive models. Moreover, among the ANN models, the LSTMs were very accurate, especially at one-week ahead predictions. Finally, the multivariate models did not outperform the univariate models, which may be explained by the fact that the additional variables used in this thesis did not provide relevant information to forecast shellfish contamination. These results might be regarded as the first pivotal steps towards the development of a model-based forecasting tool, which will allow the production sector to anticipate the harvesting prohibition, enabling the development of strategies to mitigate the economic losses inherent to this situation.
publishDate 2022
dc.date.none.fl_str_mv 2022-06-07T10:09:34Z
2022-02
2022-02-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10362/139558
url http://hdl.handle.net/10362/139558
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799138093424443392