Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.22/17149 |
Resumo: | In recent years, several tools for building energy analysis and simulation have been developed to assist in increasing building energy performance, harvesting its computing capabilities for a reliable and accurate energy performance prediction. To perform this analysis, energy tools typically require crucial data regarding the building's surrounding environment, which is acquired from neighbouring weather stations. However, these stations often experience hardware malfunctions, resulting in either erroneous or missing data. Traditionally, these values are rectified through empirical and geostatistical methods, which, while reflecting several decades of practice, may prove to be inadequate when considering a purely data-driven approach. To this end, the present study introduces a machine learning methodology proposing the application of regression algorithms to rectify the erroneous values in datasets, and the clustering of weather stations, based on their recorded climatic conditions, to enhance the regression models. A shape-based approach for clustering time series of different climatic parameters and weather stations is pursued, using the k-medoids algorithm alongside dynamic time warping as the similarity measure. Both Artificial Neural Networks (ANN) and Support Vector Regression (SVR) models are evaluated as exemplary regression algorithms, with different sets of predictors. Mean Squared Error is used as the performance metric. A data set of different climatic parameters from southeastern Brazil was used, with air temperature being chosen as the response variable, given its importance in energy consumption. Results indicate that a machine learning approach to the problem is indeed viable. ANN slightly outperforms SVR in the prediction of the studied weather variable.Building energy analysis |
id |
RCAP_877520e6860f375cd7d13903a9899010 |
---|---|
oai_identifier_str |
oai:recipp.ipp.pt:10400.22/17149 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality controlBuilding energy analysisWeather data quality controlTime series clusteringArtificial neural networksSupport vector regressionIn recent years, several tools for building energy analysis and simulation have been developed to assist in increasing building energy performance, harvesting its computing capabilities for a reliable and accurate energy performance prediction. To perform this analysis, energy tools typically require crucial data regarding the building's surrounding environment, which is acquired from neighbouring weather stations. However, these stations often experience hardware malfunctions, resulting in either erroneous or missing data. Traditionally, these values are rectified through empirical and geostatistical methods, which, while reflecting several decades of practice, may prove to be inadequate when considering a purely data-driven approach. To this end, the present study introduces a machine learning methodology proposing the application of regression algorithms to rectify the erroneous values in datasets, and the clustering of weather stations, based on their recorded climatic conditions, to enhance the regression models. A shape-based approach for clustering time series of different climatic parameters and weather stations is pursued, using the k-medoids algorithm alongside dynamic time warping as the similarity measure. Both Artificial Neural Networks (ANN) and Support Vector Regression (SVR) models are evaluated as exemplary regression algorithms, with different sets of predictors. Mean Squared Error is used as the performance metric. A data set of different climatic parameters from southeastern Brazil was used, with air temperature being chosen as the response variable, given its importance in energy consumption. Results indicate that a machine learning approach to the problem is indeed viable. ANN slightly outperforms SVR in the prediction of the studied weather variable.Building energy analysisThis work was partially financially supported by UID/ECI/04708/ 2019 – CONSTRUCT –Instituto de I&D em Estruturas e Construções and UIDB/04234/2020 – CISTER Research Unit, both funded by national funds through the FCT/MCTES (PIDDAC).ElsevierRepositório Científico do Instituto Politécnico do PortoSanhudo, LuísCoelho Rodrigues, João ManuelVasconcelos Filho, Ênio20212120-01-01T00:00:00Z2021-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/17149eng2352-710210.1016/j.jobe.2020.101996metadata only accessinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-03-13T13:06:26Zoai:recipp.ipp.pt:10400.22/17149Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:36:46.730404Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control |
title |
Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control |
spellingShingle |
Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control Sanhudo, Luís Building energy analysis Weather data quality control Time series clustering Artificial neural networks Support vector regression |
title_short |
Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control |
title_full |
Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control |
title_fullStr |
Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control |
title_full_unstemmed |
Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control |
title_sort |
Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control |
author |
Sanhudo, Luís |
author_facet |
Sanhudo, Luís Coelho Rodrigues, João Manuel Vasconcelos Filho, Ênio |
author_role |
author |
author2 |
Coelho Rodrigues, João Manuel Vasconcelos Filho, Ênio |
author2_role |
author author |
dc.contributor.none.fl_str_mv |
Repositório Científico do Instituto Politécnico do Porto |
dc.contributor.author.fl_str_mv |
Sanhudo, Luís Coelho Rodrigues, João Manuel Vasconcelos Filho, Ênio |
dc.subject.por.fl_str_mv |
Building energy analysis Weather data quality control Time series clustering Artificial neural networks Support vector regression |
topic |
Building energy analysis Weather data quality control Time series clustering Artificial neural networks Support vector regression |
description |
In recent years, several tools for building energy analysis and simulation have been developed to assist in increasing building energy performance, harvesting its computing capabilities for a reliable and accurate energy performance prediction. To perform this analysis, energy tools typically require crucial data regarding the building's surrounding environment, which is acquired from neighbouring weather stations. However, these stations often experience hardware malfunctions, resulting in either erroneous or missing data. Traditionally, these values are rectified through empirical and geostatistical methods, which, while reflecting several decades of practice, may prove to be inadequate when considering a purely data-driven approach. To this end, the present study introduces a machine learning methodology proposing the application of regression algorithms to rectify the erroneous values in datasets, and the clustering of weather stations, based on their recorded climatic conditions, to enhance the regression models. A shape-based approach for clustering time series of different climatic parameters and weather stations is pursued, using the k-medoids algorithm alongside dynamic time warping as the similarity measure. Both Artificial Neural Networks (ANN) and Support Vector Regression (SVR) models are evaluated as exemplary regression algorithms, with different sets of predictors. Mean Squared Error is used as the performance metric. A data set of different climatic parameters from southeastern Brazil was used, with air temperature being chosen as the response variable, given its importance in energy consumption. Results indicate that a machine learning approach to the problem is indeed viable. ANN slightly outperforms SVR in the prediction of the studied weather variable.Building energy analysis |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021 2021-01-01T00:00:00Z 2120-01-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.22/17149 |
url |
http://hdl.handle.net/10400.22/17149 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
2352-7102 10.1016/j.jobe.2020.101996 |
dc.rights.driver.fl_str_mv |
metadata only access info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
metadata only access |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Elsevier |
publisher.none.fl_str_mv |
Elsevier |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1817554332839575552 |