thusinh1969/Rerank-LLaMA-3.2-3B-30Nov2024-11M
收藏Hugging Face2024-12-04 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/thusinh1969/Rerank-LLaMA-3.2-3B-30Nov2024-11M
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个主要特征:input_ids(序列类型,int32)、labels(序列类型,int64)、len(int64)和text(字符串类型)。数据集分为训练集、测试集和评估集,分别包含11052047、40000和10000个样本。训练集的大小为150874169089字节,测试集为549128717字节,评估集为136608158字节。整个数据集的下载大小为32789230071字节,总大小为151559905964字节。数据文件路径分别为data/train-*、data/test-*和data/eval-*。
The dataset includes four main features: input_ids (sequence type, int32), labels (sequence type, int64), len (int64), and text (string type). The dataset is divided into training, test, and evaluation sets, containing 11052047, 40000, and 10000 samples respectively. The training set size is 150874169089 bytes, the test set is 549128717 bytes, and the evaluation set is 136608158 bytes. The total download size of the dataset is 32789230071 bytes, with an overall size of 151559905964 bytes. The data file paths are data/train-*, data/test-*, and data/eval-*.
提供机构:
thusinh1969



