3W数据集 石油勘探中不良真实事件数据
收藏帕依提提2024-03-04 收录
下载链接:
https://www.payititi.com/opendatasets/show-25919.html
下载链接
链接失效反馈官方服务:
资源简介:
据作者所知,这是第一个现实和公开的数据集,具有油井中罕见的不良真实事件,可以作为基准数据集,用于开发与实际数据固有困难相关的机器学习技术。 关于该数据集背后的理论的更多信息,可在《石油科学与工程杂志》(Journal of Petroleum Science and Engineering)上发表的论文《油井中罕见不良真实事件的现实和公共数据集》中找到。本文定义并提出了从业人员和研究人员可以与3W数据集一起使用的具体挑战(基准)。 3W数据集由1984个CSV文件组成,其结构如下。由于GitHub的限制,此数据集保存在7z文件中,自动拆分并保存在数据目录中。在使用3W数据集之前,必须对其进行解压缩。然后,子目录名就是实例的标签。每个文件代表一个实例。文件名显示其来源。所有文件的标准化如下。每行一次观测,每列一次系列观测。列由逗号分隔,小数由句点分隔。第一列包含时间戳,最后一列显示观察值的标签,其他列是多变量时间序列(MTS)(即实例本身)。 3W数据集的文件在[Web link]中,但我们相信,在UCI机器学习存储库中发布3W数据集有利于机器学习社区。 Attribute Information: Pressure at the Permanent Downhole Gauge (PDG); Pressure at the Temperature and Pressure Transducer (TPT); Temperature at the TPT; Pressure upstream of the Production Choke (PCK); Temperature downstream of the PCK; Pressure downstream of the Gas Lift Choke (GLCK); Temperature downstream of the GLCK; Gas Lift flow. Relevant Papers: 'A realistic and public dataset with rare undesirable real events in oil wells' published in the Journal of Petroleum Science and Engineering ([Web link]). Citation Request: If you have no special citation requests, please leave this field blank.
To the best of our knowledge, this is the first realistic and public dataset containing rare undesirable real events in oil wells, which can serve as a benchmark dataset for developing machine learning techniques related to the inherent difficulties of real-world data. More information about the theory underlying this dataset can be found in the paper *A Realistic and Public Dataset with Rare Undesirable Real Events in Oil Wells* published in the *Journal of Petroleum Science and Engineering*. This paper defines and proposes specific benchmark challenges that practitioners and researchers can use alongside the 3W Dataset. The 3W Dataset consists of 1984 CSV files, structured as follows. Due to GitHub's storage restrictions, this dataset is stored in automatically split 7z files saved in the data directory. The 3W Dataset must be decompressed before use. The names of the subdirectories correspond to the labels of the instances. Each file represents one instance, and the filename indicates its source. All files are standardized as follows: one observation per row, one sequence of observations per column, with columns separated by commas and decimals separated by periods. The first column contains timestamps, the last column displays the label of the observation, and the remaining columns are multivariate time series (MTS), i.e., the instance itself. The files of the 3W Dataset are available at [Web link]; however, we believe that publishing the 3W Dataset in the UCI Machine Learning Repository will benefit the machine learning community. Attribute Information: Pressure at the Permanent Downhole Gauge (PDG); Pressure at the Temperature and Pressure Transducer (TPT); Temperature at the TPT; Pressure upstream of the Production Choke (PCK); Temperature downstream of the PCK; Pressure downstream of the Gas Lift Choke (GLCK); Temperature downstream of the GLCK; Gas Lift flow. Relevant Papers: 'A realistic and public dataset with rare undesirable real events in oil wells' published in the Journal of Petroleum Science and Engineering ([Web link]). Citation Request: If you have no special citation requests, please leave this field blank.
提供机构:
帕依提提
搜集汇总
数据集介绍

背景与挑战
背景概述
3W数据集是石油勘探领域首个公开的不良真实事件数据集,包含1984个标准化CSV文件,记录8种油井传感器数据,专门用于开发处理实际数据困难的机器学习技术。数据集具有明确的多变量时间序列结构和学术论文支持,可作为行业基准。
以上内容由遇见数据集搜集并总结生成



