five

Wind Turbine SCADA Data For Early Fault Detection

收藏
DataCite Commons2026-02-24 更新2026-05-05 收录
下载链接:
https://ldm.kisski.de/dataset/ec72bdf3-c16d-4bbf-865b-8f35157f4ab0
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset is published together with the [paper](https://doi.org/10.3390/data9120138) "CARE to Compare: A real-world dataset for anomaly detection in wind turbine data" which explains the dataset in detail and defines the CARE score that can be used to evaluate anomaly detection algorithms on this dataset. When referring to this dataset, please cite the paper mentioned in the related work section. The data consists of 95 datasets, containing 89 years of SCADA time series distributed across 36 different wind turbines from the three wind farms A, B and C. The number of features depends on the wind farm; Wind farm A has 86 features, wind farm B has 257 features and wind farm C has 957 features. The overall dataset is balanced, as 45 out the 95 datasets contain a labeled anomaly event that leads up to a turbine fault and the other 50 datasets represent normal behavior. Additionally, the quality of training data is ensured by turbine-status-based labels for each data point and further information about some of the given turbine faults are included. The data for Wind farm A is based on data from the EDP open data platform (https://www.edp.com/en/innovation/open-data/data), and consists of 5 wind turbines of an onshore wind farm in Portugal. It contains SCADA data and information derived by a given fault logbook which defines start timestamps for specified faults. From this data 22 datasets were selected to be included in this data collection. The other two wind farms are offshore wind farms located in Germany. All three datasets were anonymized due to confidentiality reasons for the wind farms B and C. Each dataset is provided in form of a csv-file with columns defining the features and rows representing the data points of the time series. Files More detailed information can be found in the included README-file. **Notes** In wind farm A status_type_id labels can be ignored while evaluating prediction time frames of error events with metrics like the CARE-score since the status_type_id is of wind farm A is based on the EDP failure logbook and it is intended to be used for filtering of the training data.
提供机构:
TIB
创建时间:
2026-02-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作