five

DuReS: An R Package for Denoising Experimental Tandem Mass Spectra and Metabolite Annotation

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/DuReS_An_R_Package_for_Denoising_Experimental_Tandem_Mass_Spectra_and_Metabolite_Annotation/29255780
下载链接
链接失效反馈
官方服务:
资源简介:
Mass spectrometry-based untargeted metabolomics is a powerful technique for profiling small molecules in biological samples, yet accurate metabolite identification remains challenging. The presence of random noise peaks in tandem mass spectra can lead to false annotations and necessitate time-consuming manual verification. A common method for removing noise from mass spectra is intensity thresholding, where low-intensity peaks are discarded by applying a user-defined cutoff. However, determining an optimal threshold is often data set-specific and may still retain many noisy peaks. We hypothesize that true signal peaks consistently recur across replicate tandem spectra generated from the same precursor ion, unlike random noise. Here, we present a freely available R package, Denoising Using Replicate Spectra (DuReS) (https://github.com/BiosystemEngineeringLab-IITB/dures), which accepts mzML files and feature lists and returns high-quality annotations and denoised mzML files, enabling users to integrate the denoising pipeline into their workflow seamlessly. This package is designed for data-dependent acquisition mode (DDA) data. It has (i) the main denoising module and (i) an optional tuning module to determine each data set’s optimal recurrence frequency cutoff (Fthreshold), considering variations in the intrinsic noise characteristics. We tested the tool on eight representative data sets selected from those available in metabolomics repositories. Our approach minimizes signal loss while maximizing noise reduction, effectively preserving diagnostically significant low-intensity fragments that would otherwise be lost through conventional intensity thresholding. This improves spectral matching metrics, leading to more accurate annotations and fewer false positives.
创建时间:
2025-06-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作