five

Machine Learning-Based Retention Time Prediction Tool for Routine LC-MS Data Analysis

收藏
Figshare2025-07-16 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Machine_Learning-Based_Retention_Time_Prediction_Tool_for_Routine_LC-MS_Data_Analysis/29582197
下载链接
链接失效反馈
官方服务:
资源简介:
Accurate retention time (RT) prediction models can significantly improve liquid chromatography–mass spectrometry (LC-MS) data analysis widely used in chemical synthesis. As hundreds of thousands of syntheses are performed annually at Enamine, a large amount of experimental data has been generated internally. In this paper, we present the development of an RT prediction model based on the GATv2Conv + DL graph neural network (NN) architecture, trained on the internal data and further evaluated using the METLIN SMRT data set. The final model achieved a mean absolute error (MAE) of 2.48 s for the 120 s LC-MS method. We also conducted a detailed analysis of RT prediction errors and determined that the interval between RT – 7.12 s and RT + 9.58 s contained over 95% of the data. The developed model has been successfully integrated into the existing in-house LC-MS analysis toolkit, enhancing its predictive and analytical capabilities. Additionally, we have published a curated subset of 20,000 data points from our internal data set to support community benchmarking and further research.
创建时间:
2025-07-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作