five

UdonPred

收藏
Figshare2026-03-02 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/UdonPred/31444642
下载链接
链接失效反馈
官方服务:
资源简介:
Datasets used to train the UdonPred Models. UdonPred is available at: https://github.com/DavidWagemann/UdonPredThe UdonPred Preprint is available at: https://www.biorxiv.org/content/10.64898/2026.01.26.701679v2Motivation Regions in intrinsic disordered proteins (IDPs) constitute important continuous aspects of protein function. While their existence on a structural continuum is widely accepted, most computational predictions have, nevertheless, focused on binary classifications. Existing datasets are severely limited in size and experimental evidence for continuous disorder.Results Building on recently released datasets of continuous protein disorder and flexibility, we introduce UdonPred, a lightweight neural network exclusively inputting embeddings from the protein Language Model (pLM) ProstT5 to predict per-residue protein disorder from sequence alone. Training and evaluating UdonPred on seven datasets with divergent definitions of disorder and flexibility suggests that not model capacity, but agreement and nuance of disorder annotations, remains the main driver of performance. Binary disorder annotations can be reliably predicted from a multitude of different disorder and flexibility datasets, but there is still room for improvement in predicting continuous disorder.
创建时间:
2026-03-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作