sbhargav/reddit_tomt_mteb
收藏Hugging Face2026-01-06 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/sbhargav/reddit_tomt_mteb
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: corpus
features:
- name: _id
dtype: string
- name: title
dtype: string
- name: text
dtype: string
splits:
- name: corpus
num_bytes: 9772890386
num_examples: 3185312
download_size: 4789138669
dataset_size: 9772890386
- config_name: default
features:
- name: query-id
dtype: string
- name: corpus-id
dtype: string
- name: score
dtype: int64
splits:
- name: test
num_bytes: 34706
num_examples: 1180
- name: train
num_bytes: 277704
num_examples: 9455
- name: val
num_bytes: 34817
num_examples: 1186
download_size: 202741
dataset_size: 347227
- config_name: queries
features:
- name: _id
dtype: string
- name: text
dtype: string
splits:
- name: queries
num_bytes: 7870264
num_examples: 11821
download_size: 4733975
dataset_size: 7870264
configs:
- config_name: corpus
data_files:
- split: corpus
path: corpus/corpus-*
- config_name: default
data_files:
- split: test
path: data/test-*
- split: train
path: data/train-*
- split: val
path: data/val-*
- config_name: queries
data_files:
- split: queries
path: queries/queries-*
---
提供机构:
sbhargav



