UdonPred
收藏Figshare2026-03-02 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/UdonPred/31444642
下载链接
链接失效反馈官方服务:
资源简介:
Datasets used to train the UdonPred Models. UdonPred is available at: https://github.com/DavidWagemann/UdonPredThe UdonPred Preprint is available at: https://www.biorxiv.org/content/10.64898/2026.01.26.701679v2Motivation Regions in intrinsic disordered proteins (IDPs) constitute important continuous aspects of protein function. While their existence on a structural continuum is widely accepted, most computational predictions have, nevertheless, focused on binary classifications. Existing datasets are severely limited in size and experimental evidence for continuous disorder.Results Building on recently released datasets of continuous protein disorder and flexibility, we introduce UdonPred, a lightweight neural network exclusively inputting embeddings from the protein Language Model (pLM) ProstT5 to predict per-residue protein disorder from sequence alone. Training and evaluating UdonPred on seven datasets with divergent definitions of disorder and flexibility suggests that not model capacity, but agreement and nuance of disorder annotations, remains the main driver of performance. Binary disorder annotations can be reliably predicted from a multitude of different disorder and flexibility datasets, but there is still room for improvement in predicting continuous disorder.
创建时间:
2026-03-02



