Employing fingerprinting of medicinal plants by means of LC-MS and machine learning for species identification task
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://www.omicsdi.org/dataset/metabolights_dataset/MTBLS688
下载链接
链接失效反馈官方服务:
资源简介:
A dataset of liquid chromatography-mass spectrometry measurements of medicinal plant extracts from 76 species was generated and used for training and validating plant species identification algorithms. Various strategies for data handling and feature space selection were tested. Constrained Tucker decomposition, large-scale (more than 1500 variables) discrete Bayesian Networks and autoencoder based dimensionality reduction coupled with continuous Bayes classifier and logistic regression were optimized to achieve the best accuracy. Classification algorithms based on Tucker decomposition of original data and logistic regression on representation learned with autoencoder showed identification accuracy of up to 96%, outperforming various implementations of Bayesian Networks. Benefits and drawbacks of used approaches were discussed. Tolerance to changes in data created by using different extraction methods and equipment was tentatively tested.
Main study is reported in the current study MTBLS688
Helianthus tuberosus assay is reported in MTBLS759
创建时间:
2022-04-15



