five

Implementing Annotation Confidence Scoring in Untargeted Mass Spectrometry Workflows for Small Molecule Analysis

收藏
Figshare2026-04-01 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Implementing_Annotation_Confidence_Scoring_in_Untargeted_Mass_Spectrometry_Workflows_for_Small_Molecule_Analysis/31915206
下载链接
链接失效反馈
官方服务:
资源简介:
Untargeted small molecule analysis by high-resolution mass spectrometry is integral to environmental and biological research, enabling comprehensive characterization of complex samples. However, data interpretation and reporting remain challenging due to the complexity and high dimensionality of molecular features in untargeted data sets. Current data analysis platforms provide integrated tools for processing and annotation yet lack a standardized framework for assigning and reporting annotation confidence. Communicating varying levels of confidence in untargeted data sets continues to pose challenges without an automated ranking system. To address this, a custom scripting node was developed that assigns annotation confidence levels based on the widely adopted Schymanski et al. scoring scheme. While implemented here for metabolomics, the scoring approach is broadly applicable to other untargeted small molecule workflows. The script can be incorporated into a commercial data analysis software package and functions as a standalone postprocessing node, expanding the original five-level system with four new sublevels (levels 3a/3b and 4a/4b) to improve specificity and distinguish cases that fall between established categories. Annotation confidence is assessed using available information from all compound identification workflow nodes (e.g., Predicted Composition, mzVault, mzCloud, ChemSpider), and the consensus scoring algorithm is used to evaluate agreement among search nodes for greater accuracy. Validation using NIST SRM 1950 plasma samples demonstrated 100% scoring accuracy in negative mode and >99.5% in positive mode across both RPLC and HILIC separations. This tool enhances data reporting, improves transparency, and promotes consistency across studies, facilitating standardization and comparability of untargeted metabolomics results.
创建时间:
2026-04-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作