YYYYYYibo/ultrafeedback_binarized_dataset_offline_pairrm_part1_mini_2
收藏Hugging Face2024-05-06 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/YYYYYYibo/ultrafeedback_binarized_dataset_offline_pairrm_part1_mini_2
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: prompt
dtype: string
- name: prompt_id
dtype: string
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: score_chosen
dtype: float64
- name: score_rejected
dtype: float64
- name: reference_response
dtype: string
- name: chosen
list:
- name: content
dtype: string
- name: role
dtype: string
- name: rejected
list:
- name: content
dtype: string
- name: role
dtype: string
splits:
- name: train_prefs
num_bytes: 38911554
num_examples: 4900
download_size: 21718104
dataset_size: 38911554
configs:
- config_name: default
data_files:
- split: train_prefs
path: data/train_prefs-*
---
# Dataset Card for "ultrafeedback_binarized_dataset_offline_pairrm_part1_mini_2"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The dataset includes multiple features such as prompt, prompt_id, messages, score_chosen, score_rejected, reference_response, chosen, and rejected. Each feature has its specific data type. The dataset is divided into multiple configurations, with one named default containing training data. The size and download size of the dataset are also clearly recorded.
提供机构:
YYYYYYibo
原始信息汇总
数据集概述
数据集名称
- ultrafeedback_binarized_dataset_offline_pairrm_part1_mini_2
数据集特征
- prompt (字符串类型)
- prompt_id (字符串类型)
- messages (列表类型,包含以下子特征)
- content (字符串类型)
- role (字符串类型)
- score_chosen (浮点数类型)
- score_rejected (浮点数类型)
- reference_response (字符串类型)
- chosen (列表类型,包含以下子特征)
- content (字符串类型)
- role (字符串类型)
- rejected (列表类型,包含以下子特征)
- content (字符串类型)
- role (字符串类型)
数据集分割
- train_prefs
- 数据大小: 38911554 字节
- 示例数量: 4900
数据集大小
- 下载大小: 21718104 字节
- 数据集大小: 38911554 字节



