five

Sequence-Based Prediction of Plant Allergenic Proteins: Machine Learning Classification Approach

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://figshare.com/articles/dataset/Sequence-Based_Prediction_of_Plant_Allergenic_Proteins_Machine_Learning_Classification_Approach/21936415
下载链接
链接失效反馈
官方服务:
资源简介:
This Article proposes a novel chemometric approach to understanding and exploring the allergenic nature of food proteins. Using machine learning methods (supervised and unsupervised), this work aims to predict the allergenicity of plant proteins. The strategy is based on scoring descriptors and testing their classification performance. Partitioning was based on support vector machines (SVM), and a k-nearest neighbor (KNN) classifier was applied. A fivefold cross-validation approach was used to validate the KNN classifier in the variable selection step as well as the final classifier. To overcome the problem of food allergies, a robust and efficient method for protein classification is needed.

本文提出一种新颖的化学计量学方法,用于解析与探究食品蛋白质的致敏特性。本研究采用监督与无监督机器学习方法,旨在预测植物蛋白质的致敏性。该研究策略通过对描述符进行评分并检验其分类性能得以实现。本研究通过支持向量机(Support Vector Machines, SVM)完成数据集划分,并应用了k近邻(k-nearest neighbor, KNN)分类器。研究采用五折交叉验证方法,在变量选择阶段与最终分类器构建环节均对KNN分类器进行了验证。为应对食品过敏问题,亟需一种稳健且高效的蛋白质分类方法。
创建时间:
2023-01-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作