Model4
收藏IEEE2020-11-01 更新2026-04-17 收录
下载链接:
https://ieee-dataport.org/analysis/model4-2
下载链接
链接失效反馈官方服务:
资源简介:
We have followed the three-step approach to tackle this challenge.Data preparation – We started with the aggregation of the energy consumption data at a daily level (originally given at a half-hourly interval). Missing values were observed in temperature metadata, which we imputed using daily level linear interpolation.We also performed fuzzy clustering of time series using December daily consumption data then daily consumption data for the whole year for missing months was imputed using cluster's average consumption and probability of meter-id to fall in each cluster and took summation.Next was feature engineering which was essential to bring out the best accuracy in the model.There are mainly three features we have tried to work with:a) Average weather data – used as is (we have used avg. weather data only)b) Time Series features created using date1. Month (1,2,3,…..12) and2. Weekday (Mon, Tue,……Sun)3. WeekofYear(1,2,…52)c) Cyclic encoding of periodic features for the month and weekday1. Month gets mapped to Month_x = sin(2*pi*month/12) and Month_y=cos(2*pi*month/12)2. Weekday gets mapped to weekday_x = sin(2*pi*weekday/6) and weekday_y=cos(2*pi*weekday /6).Post multiple iterations, we settled with the XgBoost model since it performed the best among all. We trained the model on individual meter_ID and forecasted energy consumption at a daily level. The outcome was then rolled-up, to a monthly level to get our final submission.Update- We hyper-tuned the existing model to improve the overall accuracy.
创建时间:
2020-11-01



