Unassigned Mass Spectrometry Data for Machine Learning
收藏DataCite Commons2025-05-01 更新2025-05-17 收录
下载链接:
https://data.mendeley.com/datasets/ycw25mpjb6
下载链接
链接失效反馈官方服务:
资源简介:
The development of machine learning methods for the intelligent analysis of raw mass spec-trometric HPLC-MS/MS measurements without data preprocessing and identification seems promising. In this study, we tested the application of neural networks of two types, 1D-Residual CNN and 3D-CNN, to the combined metabolomic and proteomic HPLC-MS/MS data for the classification of three cancer phenotypes. Both neural networks are capable of classifying as gender-mixed oncological phenotypes (kidney cancer) as gender-specific phenotypes (ovarian cancer) and recognize healthy condition accuracy of 0.95 by analyzing ‘omics’ data in the ‘mgf’ data format. The neural network makes possible to determine their similarity degree (distance matrix) between submitted phenotypes, thus overcoming algorithmic barriers in identifying HPLC-MS/MS spectra. The closest distance was shown between ovarian cancer, kidney cancer, and prostate cancer/kidney cancer, whereas the healthy phenotype was the most outer from cancer phenotypes. Neural networks are versatile and can be applied to standard experimental data formats of different analytical platforms.
提供机构:
Mendeley
创建时间:
2021-11-22



