five

PhyCysID: Plant Cystatin Protein Prediction by an Artificial Intelligence Approach

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/PhyCysID_Plant_Cystatin_Protein_Prediction_by_an_Artificial_Intelligence_Approach/30053372
下载链接
链接失效反馈
官方服务:
资源简介:
Phytocystatins are proteinaceous inhibitors found in plants that competitively target various classes of cysteine proteinases, including papain-like enzymes, cathepsins, and legumains. Based on structural characteristics and gene organization, phytocystatins can be classified into four subtypes: intronless (I1 and I2), intron-containing (IwI), and multidomain cystatins containing more than one inhibitory region (II). This work presents PhyCysID, a dedicated web server designed for the rapid classification of phytocystatin subtypes. PhyCysID uses a set of 21 features derived from amino acid composition, in combination with 15 distinct machine learning algorithms, to classify phytocystatin sequences into one of the four subtypes. Initially, the input sequence is analyzed to verify if it comprises a true phytocystatin sequence. If so, the input sequence is further analyzed using a specialized classification pipeline called PhyCysID 12M, which integrates 12 machine learning models to assign it to one of the four defined phytocystatin classes. As a case study, a curated dataset of phytocystatin sequences from the UniProt database was used to evaluate the algorithm’s performance. The PhyCysID web server enables rapid classification of both individual and batch-submitted sequences in less than 15 s, providing high-throughput analysis for an accurate identification of phytocystatin class and function. PhyCysID is freely available at https://www.ufrgs.br/labec/phycysid.
创建时间:
2025-09-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作