Cluster-based analogue ensembles for hindcasting with multistations
Autor(a) principal: | |
---|---|
Data de Publicação: | 2022 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10198/25905 |
Resumo: | The Analogue Ensemble (AnEn) method enables the reconstruction of meteorological observations or deterministic predictions for a certain variable and station by using data from the same station or from other nearby stations. However, depending on the dimension and granularity of the historical datasets used for the reconstruction, this method may be computationally very demanding even if parallelization is used. In this work, the classical AnEn method is modified so that analogues are determined using K-means clustering. The proposed combined approach allows the use of several predictors in a dependent or independent way. As a result of the flexibility and adaptability of this new approach, it is necessary to define several parameters and algorithmic options. The effects of the critical parameters and main options were tested on a large dataset from real-world meteorological stations. The results show that adequate monitoring and tuning of the new method allows for a considerable improvement of the computational performance of the reconstruction task while keeping the accuracy of the results. Compared to the classical AnEn method, the proposed variant is at least 15-times faster when processing is serial. Both approaches benefit from parallel processing, with the K-means variant also being always faster than the classic method under that execution regime (albeit its performance advantage diminishes as more CPU threads are used). |
id |
RCAP_ea0a71430fc08bb9952344d5603e8383 |
---|---|
oai_identifier_str |
oai:bibliotecadigital.ipb.pt:10198/25905 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Cluster-based analogue ensembles for hindcasting with multistationsHindcastingMeteorological datasetAnalogue ensembleK-meansTime-seriesThe Analogue Ensemble (AnEn) method enables the reconstruction of meteorological observations or deterministic predictions for a certain variable and station by using data from the same station or from other nearby stations. However, depending on the dimension and granularity of the historical datasets used for the reconstruction, this method may be computationally very demanding even if parallelization is used. In this work, the classical AnEn method is modified so that analogues are determined using K-means clustering. The proposed combined approach allows the use of several predictors in a dependent or independent way. As a result of the flexibility and adaptability of this new approach, it is necessary to define several parameters and algorithmic options. The effects of the critical parameters and main options were tested on a large dataset from real-world meteorological stations. The results show that adequate monitoring and tuning of the new method allows for a considerable improvement of the computational performance of the reconstruction task while keeping the accuracy of the results. Compared to the classical AnEn method, the proposed variant is at least 15-times faster when processing is serial. Both approaches benefit from parallel processing, with the K-means variant also being always faster than the classic method under that execution regime (albeit its performance advantage diminishes as more CPU threads are used).This work has been partially supported by FCT—Fundação para a Ciência e Tecnologia within the Project Scope: UIDB/05757/2020.MDPIBiblioteca Digital do IPBBalsa, CarlosRodrigues, Carlos VeigaAraújo, Leonardo OliveiraRufino, José2022-09-14T14:14:36Z20222022-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10198/25905engBalsa, Carlos; Rodrigues, Carlos Veiga; Araújo, Leonardo; Rufino, José (2022). Cluster-based analogue ensembles for hindcasting with multistations. Computations. EISSN 2079-3197. 10:6, p. 1-2110.3390/computation100600912079-3197info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-21T10:57:58Zoai:bibliotecadigital.ipb.pt:10198/25905Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:16:29.577126Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Cluster-based analogue ensembles for hindcasting with multistations |
title |
Cluster-based analogue ensembles for hindcasting with multistations |
spellingShingle |
Cluster-based analogue ensembles for hindcasting with multistations Balsa, Carlos Hindcasting Meteorological dataset Analogue ensemble K-means Time-series |
title_short |
Cluster-based analogue ensembles for hindcasting with multistations |
title_full |
Cluster-based analogue ensembles for hindcasting with multistations |
title_fullStr |
Cluster-based analogue ensembles for hindcasting with multistations |
title_full_unstemmed |
Cluster-based analogue ensembles for hindcasting with multistations |
title_sort |
Cluster-based analogue ensembles for hindcasting with multistations |
author |
Balsa, Carlos |
author_facet |
Balsa, Carlos Rodrigues, Carlos Veiga Araújo, Leonardo Oliveira Rufino, José |
author_role |
author |
author2 |
Rodrigues, Carlos Veiga Araújo, Leonardo Oliveira Rufino, José |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
Biblioteca Digital do IPB |
dc.contributor.author.fl_str_mv |
Balsa, Carlos Rodrigues, Carlos Veiga Araújo, Leonardo Oliveira Rufino, José |
dc.subject.por.fl_str_mv |
Hindcasting Meteorological dataset Analogue ensemble K-means Time-series |
topic |
Hindcasting Meteorological dataset Analogue ensemble K-means Time-series |
description |
The Analogue Ensemble (AnEn) method enables the reconstruction of meteorological observations or deterministic predictions for a certain variable and station by using data from the same station or from other nearby stations. However, depending on the dimension and granularity of the historical datasets used for the reconstruction, this method may be computationally very demanding even if parallelization is used. In this work, the classical AnEn method is modified so that analogues are determined using K-means clustering. The proposed combined approach allows the use of several predictors in a dependent or independent way. As a result of the flexibility and adaptability of this new approach, it is necessary to define several parameters and algorithmic options. The effects of the critical parameters and main options were tested on a large dataset from real-world meteorological stations. The results show that adequate monitoring and tuning of the new method allows for a considerable improvement of the computational performance of the reconstruction task while keeping the accuracy of the results. Compared to the classical AnEn method, the proposed variant is at least 15-times faster when processing is serial. Both approaches benefit from parallel processing, with the K-means variant also being always faster than the classic method under that execution regime (albeit its performance advantage diminishes as more CPU threads are used). |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-09-14T14:14:36Z 2022 2022-01-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10198/25905 |
url |
http://hdl.handle.net/10198/25905 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Balsa, Carlos; Rodrigues, Carlos Veiga; Araújo, Leonardo; Rufino, José (2022). Cluster-based analogue ensembles for hindcasting with multistations. Computations. EISSN 2079-3197. 10:6, p. 1-21 10.3390/computation10060091 2079-3197 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
MDPI |
publisher.none.fl_str_mv |
MDPI |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799135450253754368 |