Implementing Annotation Confidence Scoring in Untargeted Mass Spectrometry Workflows for Small Molecule Analysis
收藏Figshare2026-04-01 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Implementing_Annotation_Confidence_Scoring_in_Untargeted_Mass_Spectrometry_Workflows_for_Small_Molecule_Analysis/31915206
下载链接
链接失效反馈官方服务:
资源简介:
Untargeted small molecule analysis by high-resolution mass spectrometry is integral to environmental and biological research, enabling comprehensive characterization of complex samples. However, data interpretation and reporting remain challenging due to the complexity and high dimensionality of molecular features in untargeted data sets. Current data analysis platforms provide integrated tools for processing and annotation yet lack a standardized framework for assigning and reporting annotation confidence. Communicating varying levels of confidence in untargeted data sets continues to pose challenges without an automated ranking system. To address this, a custom scripting node was developed that assigns annotation confidence levels based on the widely adopted Schymanski et al. scoring scheme. While implemented here for metabolomics, the scoring approach is broadly applicable to other untargeted small molecule workflows. The script can be incorporated into a commercial data analysis software package and functions as a standalone postprocessing node, expanding the original five-level system with four new sublevels (levels 3a/3b and 4a/4b) to improve specificity and distinguish cases that fall between established categories. Annotation confidence is assessed using available information from all compound identification workflow nodes (e.g., Predicted Composition, mzVault, mzCloud, ChemSpider), and the consensus scoring algorithm is used to evaluate agreement among search nodes for greater accuracy. Validation using NIST SRM 1950 plasma samples demonstrated 100% scoring accuracy in negative mode and >99.5% in positive mode across both RPLC and HILIC separations. This tool enhances data reporting, improves transparency, and promotes consistency across studies, facilitating standardization and comparability of untargeted metabolomics results.
创建时间:
2026-04-01



