Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control

Detalhes bibliográficos
Autor(a) principal: Sanhudo, Luís
Data de Publicação: 2021
Outros Autores: Coelho Rodrigues, João Manuel, Vasconcelos Filho, Ênio
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.22/17149
Resumo: In recent years, several tools for building energy analysis and simulation have been developed to assist in increasing building energy performance, harvesting its computing capabilities for a reliable and accurate energy performance prediction. To perform this analysis, energy tools typically require crucial data regarding the building's surrounding environment, which is acquired from neighbouring weather stations. However, these stations often experience hardware malfunctions, resulting in either erroneous or missing data. Traditionally, these values are rectified through empirical and geostatistical methods, which, while reflecting several decades of practice, may prove to be inadequate when considering a purely data-driven approach. To this end, the present study introduces a machine learning methodology proposing the application of regression algorithms to rectify the erroneous values in datasets, and the clustering of weather stations, based on their recorded climatic conditions, to enhance the regression models. A shape-based approach for clustering time series of different climatic parameters and weather stations is pursued, using the k-medoids algorithm alongside dynamic time warping as the similarity measure. Both Artificial Neural Networks (ANN) and Support Vector Regression (SVR) models are evaluated as exemplary regression algorithms, with different sets of predictors. Mean Squared Error is used as the performance metric. A data set of different climatic parameters from southeastern Brazil was used, with air temperature being chosen as the response variable, given its importance in energy consumption. Results indicate that a machine learning approach to the problem is indeed viable. ANN slightly outperforms SVR in the prediction of the studied weather variable.Building energy analysis
id RCAP_877520e6860f375cd7d13903a9899010
oai_identifier_str oai:recipp.ipp.pt:10400.22/17149
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality controlBuilding energy analysisWeather data quality controlTime series clusteringArtificial neural networksSupport vector regressionIn recent years, several tools for building energy analysis and simulation have been developed to assist in increasing building energy performance, harvesting its computing capabilities for a reliable and accurate energy performance prediction. To perform this analysis, energy tools typically require crucial data regarding the building's surrounding environment, which is acquired from neighbouring weather stations. However, these stations often experience hardware malfunctions, resulting in either erroneous or missing data. Traditionally, these values are rectified through empirical and geostatistical methods, which, while reflecting several decades of practice, may prove to be inadequate when considering a purely data-driven approach. To this end, the present study introduces a machine learning methodology proposing the application of regression algorithms to rectify the erroneous values in datasets, and the clustering of weather stations, based on their recorded climatic conditions, to enhance the regression models. A shape-based approach for clustering time series of different climatic parameters and weather stations is pursued, using the k-medoids algorithm alongside dynamic time warping as the similarity measure. Both Artificial Neural Networks (ANN) and Support Vector Regression (SVR) models are evaluated as exemplary regression algorithms, with different sets of predictors. Mean Squared Error is used as the performance metric. A data set of different climatic parameters from southeastern Brazil was used, with air temperature being chosen as the response variable, given its importance in energy consumption. Results indicate that a machine learning approach to the problem is indeed viable. ANN slightly outperforms SVR in the prediction of the studied weather variable.Building energy analysisThis work was partially financially supported by UID/ECI/04708/ 2019 – CONSTRUCT –Instituto de I&D em Estruturas e Construções and UIDB/04234/2020 – CISTER Research Unit, both funded by national funds through the FCT/MCTES (PIDDAC).ElsevierRepositório Científico do Instituto Politécnico do PortoSanhudo, LuísCoelho Rodrigues, João ManuelVasconcelos Filho, Ênio20212120-01-01T00:00:00Z2021-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.22/17149eng2352-710210.1016/j.jobe.2020.101996metadata only accessinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-03-13T13:06:26Zoai:recipp.ipp.pt:10400.22/17149Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T17:36:46.730404Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control
title Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control
spellingShingle Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control
Sanhudo, Luís
Building energy analysis
Weather data quality control
Time series clustering
Artificial neural networks
Support vector regression
title_short Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control
title_full Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control
title_fullStr Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control
title_full_unstemmed Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control
title_sort Multivariate time series clustering and forecasting for building energy analysis: Application to weather data quality control
author Sanhudo, Luís
author_facet Sanhudo, Luís
Coelho Rodrigues, João Manuel
Vasconcelos Filho, Ênio
author_role author
author2 Coelho Rodrigues, João Manuel
Vasconcelos Filho, Ênio
author2_role author
author
dc.contributor.none.fl_str_mv Repositório Científico do Instituto Politécnico do Porto
dc.contributor.author.fl_str_mv Sanhudo, Luís
Coelho Rodrigues, João Manuel
Vasconcelos Filho, Ênio
dc.subject.por.fl_str_mv Building energy analysis
Weather data quality control
Time series clustering
Artificial neural networks
Support vector regression
topic Building energy analysis
Weather data quality control
Time series clustering
Artificial neural networks
Support vector regression
description In recent years, several tools for building energy analysis and simulation have been developed to assist in increasing building energy performance, harvesting its computing capabilities for a reliable and accurate energy performance prediction. To perform this analysis, energy tools typically require crucial data regarding the building's surrounding environment, which is acquired from neighbouring weather stations. However, these stations often experience hardware malfunctions, resulting in either erroneous or missing data. Traditionally, these values are rectified through empirical and geostatistical methods, which, while reflecting several decades of practice, may prove to be inadequate when considering a purely data-driven approach. To this end, the present study introduces a machine learning methodology proposing the application of regression algorithms to rectify the erroneous values in datasets, and the clustering of weather stations, based on their recorded climatic conditions, to enhance the regression models. A shape-based approach for clustering time series of different climatic parameters and weather stations is pursued, using the k-medoids algorithm alongside dynamic time warping as the similarity measure. Both Artificial Neural Networks (ANN) and Support Vector Regression (SVR) models are evaluated as exemplary regression algorithms, with different sets of predictors. Mean Squared Error is used as the performance metric. A data set of different climatic parameters from southeastern Brazil was used, with air temperature being chosen as the response variable, given its importance in energy consumption. Results indicate that a machine learning approach to the problem is indeed viable. ANN slightly outperforms SVR in the prediction of the studied weather variable.Building energy analysis
publishDate 2021
dc.date.none.fl_str_mv 2021
2021-01-01T00:00:00Z
2120-01-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.22/17149
url http://hdl.handle.net/10400.22/17149
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 2352-7102
10.1016/j.jobe.2020.101996
dc.rights.driver.fl_str_mv metadata only access
info:eu-repo/semantics/openAccess
rights_invalid_str_mv metadata only access
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Elsevier
publisher.none.fl_str_mv Elsevier
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1817554332839575552