colinhorger/random_embeddings_temp
收藏Hugging Face2024-12-06 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/colinhorger/random_embeddings_temp
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个主要特征:seqs(字符串类型)和vectors(浮点数序列类型)。数据集被分为训练集、测试集和验证集,分别包含1511、190和190个样本。训练集大小为3612119字节,测试集大小为456835字节,验证集大小为459068字节。总下载大小为6095257字节,数据集总大小为4528022字节。数据文件的路径分别为:训练集路径为data/train-*,测试集路径为data/test-*,验证集路径为data/validation-*。
The dataset contains two main features: seqs (string type) and vectors (sequence of float32). The dataset is divided into training, test, and validation sets, containing 1511, 190, and 190 samples respectively. The training set size is 3612119 bytes, the test set size is 456835 bytes, and the validation set size is 459068 bytes. The total download size is 6095257 bytes, and the total dataset size is 4528022 bytes. The data file paths are: training set path is data/train-*, test set path is data/test-*, and validation set path is data/validation-*.
提供机构:
colinhorger



