keiwoo/TCRdb2
收藏Hugging Face2025-10-17 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/keiwoo/TCRdb2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是从TCRdb2.0收集的T细胞受体序列数据集,经过去重处理后包含约290,315,598条非重复序列,另有一个经过cd-hit筛选后保留约1,782,927条高相似度序列的版本。数据集适用于特征提取任务,数据量在100M到1B之间。
This dataset is a collection of T cell receptor sequences from TCRdb2.0, containing approximately 290,315,598 non-redundant sequences after deduplication, and another version with about 1,782,927 sequences remaining after cd-hit filtering for high similarity. It is suitable for feature extraction tasks and the dataset size is between 100M and 1B.
提供机构:
keiwoo



