hyunnnnnnnn/rxrx3_smiles_embedding
收藏Hugging Face2025-12-19 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/hyunnnnnnnn/rxrx3_smiles_embedding
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两部分内容:1) Cell Images (RxRx3):提供了SMILES嵌入数据和元数据,以及PyTorch保存的对象,包含embeddings、smiles和well_id等键值;2) Protein sequence (TDC DTI):提供了5222种蛋白质的ID、序列和特征数据(特征维度为0~2559)。数据来源于https://huggingface.co/datasets/recursionpharma/rxrx3-core/tree/main。
The dataset consists of two parts: 1) Cell Images (RxRx3): Provides SMILES embedding data and metadata, as well as PyTorch saved objects with keys embeddings, smiles, and well_id; 2) Protein sequence (TDC DTI): Provides ID, sequence, and feature data (feature dimensions 0~2559) for 5222 proteins. Data source: https://huggingface.co/datasets/recursionpharma/rxrx3-core/tree/main.
提供机构:
hyunnnnnnnn



