yvngexe/stack-exchange-paired-v0
收藏Hugging Face2025-09-03 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/yvngexe/stack-exchange-paired-v0
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: rl_deduplicate_shuffled
features:
- name: qid
dtype: int64
- name: question
dtype: string
- name: date
dtype: string
- name: metadata
sequence: string
- name: response_j
dtype: string
- name: response_k
dtype: string
splits:
- name: train
num_bytes: 8012790573
num_examples: 2817470
download_size: 4407573092
dataset_size: 8012790573
- config_name: rl_deduplicate_shuffled_20000
features:
- name: qid
dtype: int64
- name: question
dtype: string
- name: date
dtype: string
- name: metadata
sequence: string
- name: response_j
dtype: string
- name: response_k
dtype: string
splits:
- name: train
num_bytes: 56944706
num_examples: 20000
download_size: 31272569
dataset_size: 56944706
configs:
- config_name: rl_deduplicate_shuffled
data_files:
- split: train
path: rl_deduplicate_shuffled/train-*
- config_name: rl_deduplicate_shuffled_20000
data_files:
- split: train
path: rl_deduplicate_shuffled_20000/train-*
---
数据集信息:
- 配置名称:rl_deduplicate_shuffled
特征列:
- 字段名称:qid,数据类型:64位整数(int64)
- 字段名称:question,数据类型:字符串(string)
- 字段名称:date,数据类型:字符串(string)
- 字段名称:metadata,数据类型:字符串序列(sequence<string>)
- 字段名称:response_j,数据类型:字符串(string)
- 字段名称:response_k,数据类型:字符串(string)
数据划分:
- 划分名称:train,字节数:8012790573,样本数量:2817470
下载大小:4407573092
数据集存储大小:8012790573
- 配置名称:rl_deduplicate_shuffled_20000
特征列:
- 字段名称:qid,数据类型:64位整数(int64)
- 字段名称:question,数据类型:字符串(string)
- 字段名称:date,数据类型:字符串(string)
- 字段名称:metadata,数据类型:字符串序列(sequence<string>)
- 字段名称:response_j,数据类型:字符串(string)
- 字段名称:response_k,数据类型:字符串(string)
数据划分:
- 划分名称:train,字节数:56944706,样本数量:20000
下载大小:31272569
数据集存储大小:56944706
配置项:
- 配置名称:rl_deduplicate_shuffled
数据文件:
- 数据划分:train,文件路径:rl_deduplicate_shuffled/train-*
- 配置名称:rl_deduplicate_shuffled_20000
数据文件:
- 数据划分:train,文件路径:rl_deduplicate_shuffled_20000/train-*
提供机构:
yvngexe



