Robust and automatic data cleansing method for short-term load forecasting of distribution feeders
Article
Huyghues-Beaufond, N., Tindemans, S., Falugi, P., Sun, M. and Strbac, G. 2020. Robust and automatic data cleansing method for short-term load forecasting of distribution feeders. Applied Energy. 261 (Art. 114405). https://doi.org/10.1016/j.apenergy.2019.114405
Authors | Huyghues-Beaufond, N., Tindemans, S., Falugi, P., Sun, M. and Strbac, G. |
---|---|
Abstract | Distribution networks are undergoing fundamental changes at medium voltage level. To support growing planning and control decision-making, the need for large numbers of short-term load forecasts has emerged. Data-driven modelling of medium voltage feeders can be affected by (1) data quality issues, namely, large gross errors and missing observations (2) the presence of structural breaks in the data due to occasional network reconfiguration and load transfers. The present work investigates and reports on the effects of advanced data cleansing techniques on forecast accuracy. A hybrid framework to detect and remove outliers in large datasets is proposed; this automatic procedure combines the Tukey labelling rule and the binary segmentation algorithm to cleanse data more efficiently, it is fast and easy to implement. Various approaches for missing value imputation are investigated, including unconditional mean, Hot Deck via k-nearest neighbour and Kalman smoothing. A combination of the automatic detection/removal of outliers and the imputation methods mentioned above are implemented to cleanse time series of 342 medium-voltage feeders. A nested rolling-origin-validation technique is used to evaluate the feed-forward deep neural network models. The proposed data cleansing framework efficiently removes outliers from the data, and the accuracy of forecasts is improved. It is found that Hot Deck (k-NN) imputation performs best in balancing the bias-variance trade-off for short-term forecasting. |
Journal | Applied Energy |
Journal citation | 261 (Art. 114405) |
ISSN | 0306-2619 |
Year | 2020 |
Publisher | Elsevier |
Accepted author manuscript | License File Access Level Anyone |
Digital Object Identifier (DOI) | https://doi.org/10.1016/j.apenergy.2019.114405 |
Publication dates | |
Online | 07 Jan 2020 |
07 Jan 2020 | |
Publication process dates | |
Accepted | 14 Dec 2019 |
Deposited | 21 Aug 2023 |
Copyright holder | © 2020, The Author(s) |
https://repository.uel.ac.uk/item/8w422
Download files
90
total views99
total downloads9
views this month5
downloads this month