honicky/genept-composable-embeddings-source-data
收藏Hugging Face2025-01-14 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/honicky/genept-composable-embeddings-source-data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是基于GenePT项目和论文的扩展,以更易于使用的格式重现了原始数据。它旨在让用户能够跨维度组合嵌入,以针对特定任务进行优化,并通过生成新的描述并将其嵌入相同空间来扩展现有的基础嵌入。数据集包括来自NCBI和UniProt的基因描述、基因信息表以及AI生成的基因描述。
This dataset reproduces and expands upon the GenePT project and paper, providing the original data in a more user-friendly format. It aims to enable users to compose embeddings across dimensions for task specialization and to augment existing base embeddings by generating new descriptions and incorporating them into the same space. The dataset includes gene descriptions from NCBI and UniProt, a gene information table, and AI-generated gene descriptions.
提供机构:
honicky



