five

Model18

收藏
DataCite Commons2020-11-06 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/analysis/model18
下载链接
链接失效反馈
官方服务:
资源简介:
Data preparation – We started with the aggregation of the energy consumption data at a daily level (originally given at a half-hourly interval). Missing values were observed in consumption and temperature metadata, which we imputed using daily level linear interpolation.Next was feature engineering which was essential to bring out the best accuracy in the model.There are mainly three features we have tried to work with:a) Average weather data, Three lag variable for average weather data (day1, day2, day3 lag)b) Time-Series features created using date1. Month (1,2,3,…..12) and2. Weekday (Mon, Tue,……Sun)3. WeekofYear(1,2,…52)c) Cyclic encoding of periodic features for the month and weekday1. Month gets mapped to Month_x = sin(2*pi*month/12) and Month_y=cos(2*pi*month/12)2. Weekday gets mapped to weekday_x = sin(2*pi*weekday/6) and weekday_y=cos(2*pi*weekday /6).Post multiple iterations, we settled with the XGBoost model since it performed the best among all. We trained the model on individual meter_ID and forecasted energy consumption at a daily level. The outcome was then rolled-up, to a monthly level to get our final submission.Update- we identified those customers where our model was not performing well and used the Moving average to forecast for these customers.
提供机构:
IEEE DataPort
创建时间:
2020-11-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作