five

Wind Turbine Accident News (1980-2013)

收藏
Mendeley Data2024-03-27 更新2024-06-26 收录
下载链接:
https://data.mendeley.com/datasets/jkjvmn9tz3
下载链接
链接失效反馈
官方服务:
资源简介:
This data sets includes 216 news on 240 wind turbine accidents between the years 1980 and 2013. The analysis of this data set and the insights obtained are reported in the following research paper: Asian, S., Ertek, G., Haksoz, C., Pakter, S. and Ulun, S., 2017. Wind turbine accidents: A data mining study. IEEE Systems Journal, 11(3), pp.1567-1578. As of now, the most extensive data available on the Internet on wind turbines accidents is published by the Caithness Windfarm Information Forum (CWIF), a UK-based grassroots organization opposing wind turbine installations. While the Caithness list is impressive in magnitude, the quality and reliability of the list is open to discussion because of the following reason: * Many of the web links to the news sources are not valid, and some of the accidents appear in multiple lines of the data. In spite of containing much more magnitude of data, the data available in other online sources also exhibit similar deficiencies. So, there are problems when it comes to using the Caithness data or other data in research studies. To this end, we collected data on wind turbine accidents ourselves, also using the data from Caithness and we share our collected data on this page (please click the link at the top of the page to download the data). The data we collected consists of three folders, and a MS Excel file. The folder News.txt contains the accident news, with each news in a separate text file: The folder News.doc contains news, with each news in a separate MS Word file: Finally, the folder News.doc.with.notes contains news, with each news in a separate MS Word file, but with extensive comments, explaining how the database in the MS Excel file was constructed: The MS Excel file News.Database.xlsx contains the structured data created based on the detailed reading of the accident news text: The MS Excel file is the file that was analyzed in our research paper.

本数据集收录了1980年至2013年间240起风力涡轮机事故相关的216条新闻报道。本数据集的分析过程与所得结论已发表于下述学术论文:Asian, S.、Ertek, G.、Haksoz, C.、Pakter, S.及Ulun, S.于2017年发表于《IEEE系统期刊(IEEE Systems Journal)》第11卷第3期的《风力涡轮机事故:一项数据挖掘研究》,页码范围为1567至1578。 截至目前,互联网上可获取的规模最大的风力涡轮机事故数据集由凯斯内斯风电场信息论坛(Caithness Windfarm Information Forum, CWIF)发布,该组织是英国本土反对风力涡轮机安装的草根公益机构。尽管凯斯内斯数据集的规模可观,但其数据质量与可靠性仍存在争议,原因如下: - 多数指向新闻来源的网页链接已失效,且部分事故条目在数据集中重复出现。 其他在线数据源即便数据规模更大,也存在类似的缺陷。因此,在研究中使用凯斯内斯数据集或其他公开在线数据源时,均存在一定问题。 有鉴于此,我们在参考凯斯内斯数据集的基础上,自主收集了风力涡轮机事故相关数据,并将所采集的数据集发布于本页面(点击页面顶部的链接即可下载该数据集)。本次发布的数据集包含3个文件夹与1个Microsoft Excel(MS Excel)文件: 其中,`News.txt`文件夹存储事故新闻的纯文本版本,每条新闻对应一个独立的文本文件;`News.doc`文件夹存储事故新闻的Microsoft Word(MS Word)版本,每条新闻对应一个独立的Word文档;最后,`News.doc.with.notes`文件夹同样存储每条新闻对应的独立Word文档,但附加了详细注释,用于说明本研究中Excel格式数据库的构建过程。 `News.Database.xlsx`为结构化数据文件,其内容基于对事故新闻文本的逐篇精读整理而成,该Excel文件正是我们在上述学术论文中进行分析的数据源。
创建时间:
2024-01-23
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集整理了1980-2013年间全球240起风力涡轮机事故的216条新闻报道,包含文本、Word和Excel三种格式文件,主要用于风力涡轮机安全性和可靠性的数据挖掘研究。数据集特别针对现有公开数据(如CWIF)的链接失效和重复问题进行了清理和结构化处理。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作