Cluster-based analogue ensembles for hindcasting with multistations

Detalhes bibliográficos
Autor(a) principal: Balsa, Carlos
Data de Publicação: 2022
Outros Autores: Rodrigues, Carlos Veiga, Araújo, Leonardo Oliveira, Rufino, José
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10198/25905
Resumo: The Analogue Ensemble (AnEn) method enables the reconstruction of meteorological observations or deterministic predictions for a certain variable and station by using data from the same station or from other nearby stations. However, depending on the dimension and granularity of the historical datasets used for the reconstruction, this method may be computationally very demanding even if parallelization is used. In this work, the classical AnEn method is modified so that analogues are determined using K-means clustering. The proposed combined approach allows the use of several predictors in a dependent or independent way. As a result of the flexibility and adaptability of this new approach, it is necessary to define several parameters and algorithmic options. The effects of the critical parameters and main options were tested on a large dataset from real-world meteorological stations. The results show that adequate monitoring and tuning of the new method allows for a considerable improvement of the computational performance of the reconstruction task while keeping the accuracy of the results. Compared to the classical AnEn method, the proposed variant is at least 15-times faster when processing is serial. Both approaches benefit from parallel processing, with the K-means variant also being always faster than the classic method under that execution regime (albeit its performance advantage diminishes as more CPU threads are used).
id RCAP_ea0a71430fc08bb9952344d5603e8383
oai_identifier_str oai:bibliotecadigital.ipb.pt:10198/25905
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Cluster-based analogue ensembles for hindcasting with multistationsHindcastingMeteorological datasetAnalogue ensembleK-meansTime-seriesThe Analogue Ensemble (AnEn) method enables the reconstruction of meteorological observations or deterministic predictions for a certain variable and station by using data from the same station or from other nearby stations. However, depending on the dimension and granularity of the historical datasets used for the reconstruction, this method may be computationally very demanding even if parallelization is used. In this work, the classical AnEn method is modified so that analogues are determined using K-means clustering. The proposed combined approach allows the use of several predictors in a dependent or independent way. As a result of the flexibility and adaptability of this new approach, it is necessary to define several parameters and algorithmic options. The effects of the critical parameters and main options were tested on a large dataset from real-world meteorological stations. The results show that adequate monitoring and tuning of the new method allows for a considerable improvement of the computational performance of the reconstruction task while keeping the accuracy of the results. Compared to the classical AnEn method, the proposed variant is at least 15-times faster when processing is serial. Both approaches benefit from parallel processing, with the K-means variant also being always faster than the classic method under that execution regime (albeit its performance advantage diminishes as more CPU threads are used).This work has been partially supported by FCT—Fundação para a Ciência e Tecnologia within the Project Scope: UIDB/05757/2020.MDPIBiblioteca Digital do IPBBalsa, CarlosRodrigues, Carlos VeigaAraújo, Leonardo OliveiraRufino, José2022-09-14T14:14:36Z20222022-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10198/25905engBalsa, Carlos; Rodrigues, Carlos Veiga; Araújo, Leonardo; Rufino, José (2022). Cluster-based analogue ensembles for hindcasting with multistations. Computations. EISSN 2079-3197. 10:6, p. 1-2110.3390/computation100600912079-3197info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-21T10:57:58Zoai:bibliotecadigital.ipb.pt:10198/25905Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:16:29.577126Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Cluster-based analogue ensembles for hindcasting with multistations
title Cluster-based analogue ensembles for hindcasting with multistations
spellingShingle Cluster-based analogue ensembles for hindcasting with multistations
Balsa, Carlos
Hindcasting
Meteorological dataset
Analogue ensemble
K-means
Time-series
title_short Cluster-based analogue ensembles for hindcasting with multistations
title_full Cluster-based analogue ensembles for hindcasting with multistations
title_fullStr Cluster-based analogue ensembles for hindcasting with multistations
title_full_unstemmed Cluster-based analogue ensembles for hindcasting with multistations
title_sort Cluster-based analogue ensembles for hindcasting with multistations
author Balsa, Carlos
author_facet Balsa, Carlos
Rodrigues, Carlos Veiga
Araújo, Leonardo Oliveira
Rufino, José
author_role author
author2 Rodrigues, Carlos Veiga
Araújo, Leonardo Oliveira
Rufino, José
author2_role author
author
author
dc.contributor.none.fl_str_mv Biblioteca Digital do IPB
dc.contributor.author.fl_str_mv Balsa, Carlos
Rodrigues, Carlos Veiga
Araújo, Leonardo Oliveira
Rufino, José
dc.subject.por.fl_str_mv Hindcasting
Meteorological dataset
Analogue ensemble
K-means
Time-series
topic Hindcasting
Meteorological dataset
Analogue ensemble
K-means
Time-series
description The Analogue Ensemble (AnEn) method enables the reconstruction of meteorological observations or deterministic predictions for a certain variable and station by using data from the same station or from other nearby stations. However, depending on the dimension and granularity of the historical datasets used for the reconstruction, this method may be computationally very demanding even if parallelization is used. In this work, the classical AnEn method is modified so that analogues are determined using K-means clustering. The proposed combined approach allows the use of several predictors in a dependent or independent way. As a result of the flexibility and adaptability of this new approach, it is necessary to define several parameters and algorithmic options. The effects of the critical parameters and main options were tested on a large dataset from real-world meteorological stations. The results show that adequate monitoring and tuning of the new method allows for a considerable improvement of the computational performance of the reconstruction task while keeping the accuracy of the results. Compared to the classical AnEn method, the proposed variant is at least 15-times faster when processing is serial. Both approaches benefit from parallel processing, with the K-means variant also being always faster than the classic method under that execution regime (albeit its performance advantage diminishes as more CPU threads are used).
publishDate 2022
dc.date.none.fl_str_mv 2022-09-14T14:14:36Z
2022
2022-01-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10198/25905
url http://hdl.handle.net/10198/25905
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Balsa, Carlos; Rodrigues, Carlos Veiga; Araújo, Leonardo; Rufino, José (2022). Cluster-based analogue ensembles for hindcasting with multistations. Computations. EISSN 2079-3197. 10:6, p. 1-21
10.3390/computation10060091
2079-3197
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv MDPI
publisher.none.fl_str_mv MDPI
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799135450253754368