five

Preprocessed Spatial Dynamic Wind Power Forecasting (SDWPF) Dataset from Baidu KDD Cup 2022 Challenge

收藏
Figshare2025-12-04 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Preprocessed_Spatial_Dynamic_Wind_Power_Forecasting_SDWPF_Dataset_from_Baidu_KDD_Cup_2022_Challenge/30787586
下载链接
链接失效反馈
官方服务:
资源简介:
This repository contains a preprocessed dataset derived from the Spatial Dynamic Wind Power Forecasting (SDWPF) challenge, hosted by Baidu KDD Cup 2022. The original challenge aimed to facilitate the progress of data-driven machine learning methods for wind power forecasting.The raw data underwent a comprehensive preprocessing pipeline to enhance its quality and applicability for machine learning tasks. Key preprocessing steps include:1. **Basic Preprocessing:** Handling invalid data (e.g., negative power, zero power with high wind speed, abnormal pitch/nacelle angles), filling missing values (bfill then ffill), and engineering time-related features (Day, Hour, Minute) from 'Tmstamp'. A new feature 'Pab_max' (maximum of Pab1, Pab2, Pab3) was created, and original pitch angle columns were dropped.2. **Wavelet Denoising:** Application of Discrete Wavelet Transform (db4 wavelet, level 1, soft thresholding) to reduce noise and highlight underlying trends in the time-series data.3. **Standardization:** Independent standardization of features and the target variable ('Patv') using StandardScaler, with the scalers saved for consistent future use.The original dataset fields included: TurbID, Day, Tmstamp, Wspd, Wdir, Etmp, Itmp, Ndir, Pab1, Pab2, Pab3, Prtv, Patv.The final features used for model input after preprocessing are: Pab_max, Wspd, Wdir, Etmp, Itmp, Ndir, Prtv, Patv, Day, Hour, Minute.This processed dataset is suitable for research and development in wind power forecasting models, offering a clean and standardized input for machine learning algorithms.
创建时间:
2025-12-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作