five

Automated Metadata Extraction from mzML Files with RunAssessor

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Automated_Metadata_Extraction_from_mzML_Files_with_RunAssessor/32034001
下载链接
链接失效反馈
官方服务:
资源简介:
The reusability of proteomics data sets depends on the ability to obtain accurate metadata to guide reprocessing pipelines. However, many data sets deposited in public data repositories lack sufficient and reliable annotation, limiting large-scale reanalyses. To address this challenge, we developed RunAssessor, a tool that systematically extracts and summarizes information directly from mass spectrometry data files prior to peptide identification analysis. RunAssessor extracts and summarizes sample preparation and instrument acquisition parameters directly from the data where possible. Using one complete data set and test files from 18 other data sets as examples, we demonstrate RunAssessor’s ability to extract instrument models, isobaric labels, phosphoenrichment, precursor and fragment ion tolerances, along with the dynamic exclusion time used by the instrument. These extracted metadata are stored in a comprehensive output file, and summarized in a standard Sample and Data Relationship Format (SDRF) file, thereby reducing the burden of manual curation and improving the reliability of proteomics data set metadata, facilitating the reuse of public data.
创建时间:
2026-04-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作