bclavie/msmarco-2m-triplets
收藏Hugging Face2024-05-21 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/bclavie/msmarco-2m-triplets
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: query
dtype: string
- name: positive
dtype: string
- name: negative
dtype: string
splits:
- name: train
num_bytes: 1487444945.3028998
num_examples: 2000000
download_size: 946860523
dataset_size: 1487444945.3028998
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset includes three main features: query, positive, and negative, all of which are string type. The dataset is divided into a training set (train) with 2000000 examples, totaling 1487444945.3028998 bytes. The download size of the dataset is 946860523 bytes. The dataset configuration is named default, and the training data file path is data/train-*.
提供机构:
bclavie
原始信息汇总
数据集概述
数据集特征
- query: 数据类型为字符串(string)。
- positive: 数据类型为字符串(string)。
- negative: 数据类型为字符串(string)。
数据集分割
- 训练集(train):
- 数据量: 2,000,000 条记录
- 存储大小: 1,487,444,945.3028998 字节
数据集大小
- 下载大小: 946,860,523 字节
- 数据集总大小: 1,487,444,945.3028998 字节
配置信息
- 默认配置(default):
- 训练集路径:
data/train-*
- 训练集路径:



