GleghornLab/amplify_embeddings
收藏Hugging Face2024-10-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/GleghornLab/amplify_embeddings
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个主要特征:seqs和vectors,分别代表序列和向量。数据集被分为两个部分:AMPLIFY_120M和AMPLIFY_350M,每个部分都有特定的字节大小和示例数量。AMPLIFY_120M部分包含757774个示例,占用2142475517字节;AMPLIFY_350M部分也包含757774个示例,占用3112426237字节。整个数据集的下载大小为6188735081字节,数据集总大小为5254901754字节。配置信息指定了数据文件的路径。
The dataset includes two main features: seqs and vectors, representing sequences and vectors respectively. The dataset is divided into two parts: AMPLIFY_120M and AMPLIFY_350M, each with specific byte sizes and example counts. The AMPLIFY_120M part contains 757774 examples and occupies 2142475517 bytes; the AMPLIFY_350M part also contains 757774 examples and occupies 3112426237 bytes. The total download size of the dataset is 6188735081 bytes, and the total dataset size is 5254901754 bytes. Configuration information specifies the paths to the data files.
提供机构:
GleghornLab



