five

Replication data for: Assessment of Data-Driven Techniques for Flow Rate Predictions in Sub-sea Oil Production

收藏
DataCite Commons2026-02-24 更新2026-04-25 收录
下载链接:
https://dataverse.no/citation?persistentId=doi:10.18710/KIJEWJ
下载链接
链接失效反馈
官方服务:
资源简介:
<p>The data set consists of simulated time‑series measurements from two gas‑lifted subsea oil wells, used to develop and evaluate data‑driven virtual flow metering (VFM) models for oil and gas flow rate prediction.</p> ​ <p>Purpose: To assess a range of machine learning algorithms (10 methods, including LSTM, MLP, XGBoost, SVR, tree‑based and linear methods) for predicting multiphase flow rates in subsea oil production, and identify which give the lowest prediction error.</p> ​ <p>To study the impact of measurement noise, the effect of noise filtering (median filter), and the quantification of prediction uncertainty (via 95% confidence intervals in XGBoost) in a VFM context.</p> ​ <p>Scope: Two wells (Well 1 and Well 2) are considered, each represented by an open‑loop simulation model of a gas‑lifted oil well derived from Janatian et al. (2022).</p> ​ <p>For each well, 5 762 samples of process data are generated and split into 70% training and 30% test sets using a time‑series split; key input variables include bottom‑hole and wellhead pressures and temperatures plus choke opening, with oil and gas flow rates as targets.</p> ​ <p>The study covers the full workflow: data collection from the simulator, preprocessing (scaling, time‑series splitting, noise injection and filtering), model training and hyperparameter tuning, performance comparison via MAPE, and uncertainty quantification.</p> ​ <p>Nature of the data: Synthetic, model‑generated process data rather than field measurements: data come from a validated dynamic model of gas‑lifted wells, not directly from a physical asset.</p> ​ <p>Multivariate, time‑series data at sample‑level resolution, comprising sensor‑like inputs (pressures, temperatures, choke openings) and corresponding oil and gas flow rates over time for each well.</p> ​ <p>Used primarily as a benchmarking set for supervised learning: different regression algorithms are trained and tested on identical data to compare prediction accuracy, robustness to impulse noise, and the effect of noise reduction and uncertainty quantification techniques.</p>
提供机构:
DataverseNO
创建时间:
2026-02-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作