jeanq1/IDP-Euka-90
收藏Hugging Face2025-11-20 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/jeanq1/IDP-Euka-90
下载链接
链接失效反馈官方服务:
资源简介:
IDP-Euka-90是一个包含经过筛选的真核生物蛋白质序列的数据集,用于表示学习和内源性无序蛋白质/区域(IDPs/IDRs)的下游分析。数据集由UniProt中所有真核生物蛋白质组的IDP区域提取并聚类得到,以去除近似重复序列。
IDP-Euka-90 is a dataset of curated eukaryotic protein sequences for representation learning and downstream analysis of intrinsically disordered proteins/regions (IDPs/IDRs). The dataset consists of IDP regions extracted from all available eukaryota proteomes in UniProt and clustered at 90% to remove near duplicate sequences.
提供机构:
jeanq1



