mcding-org/CorrectDPO-Dataset-U0
收藏Hugging Face2024-05-21 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/mcding-org/CorrectDPO-Dataset-U0
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: source
dtype: string
- name: instruction
dtype: string
- name: models
sequence: string
- name: completions
list:
- name: annotations
struct:
- name: helpfulness
struct:
- name: Rating
dtype: string
- name: Rationale
dtype: string
- name: Rationale For Rating
dtype: string
- name: Type
sequence: string
- name: honesty
struct:
- name: Rating
dtype: string
- name: Rationale
dtype: string
- name: instruction_following
struct:
- name: Rating
dtype: string
- name: Rationale
dtype: string
- name: truthfulness
struct:
- name: Rating
dtype: string
- name: Rationale
dtype: string
- name: Rationale For Rating
dtype: string
- name: Type
sequence: string
- name: critique
dtype: string
- name: custom_system_prompt
dtype: string
- name: fine-grained_score
dtype: float64
- name: model
dtype: string
- name: overall_score
dtype: float64
- name: principle
dtype: string
- name: response
dtype: string
- name: correct_answers
sequence: string
- name: incorrect_answers
sequence: string
- name: prompt
dtype: string
- name: chosen
dtype: string
- name: rejected
dtype: string
splits:
- name: train
num_bytes: 972373441
num_examples: 60435
- name: eval
num_bytes: 51047585
num_examples: 3187
download_size: 437066514
dataset_size: 1023421026
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: eval
path: data/eval-*
---
This dataset contains multiple features, primarily used for evaluating and training models in various aspects such as helpfulness, honesty, instruction following, and truthfulness. The dataset is divided into training and evaluation parts, with 60435 examples in the training part and 3187 examples in the evaluation part.
提供机构:
mcding-org
原始信息汇总
数据集信息
特征
- source: 字符串类型
- instruction: 字符串类型
- models: 字符串序列
- completions: 列表类型,包含以下结构:
- annotations: 结构体,包含以下字段:
- helpfulness: 结构体,包含以下字段:
- Rating: 字符串类型
- Rationale: 字符串类型
- Rationale For Rating: 字符串类型
- Type: 字符串序列
- honesty: 结构体,包含以下字段:
- Rating: 字符串类型
- Rationale: 字符串类型
- instruction_following: 结构体,包含以下字段:
- Rating: 字符串类型
- Rationale: 字符串类型
- truthfulness: 结构体,包含以下字段:
- Rating: 字符串类型
- Rationale: 字符串类型
- Rationale For Rating: 字符串类型
- Type: 字符串序列
- helpfulness: 结构体,包含以下字段:
- critique: 字符串类型
- custom_system_prompt: 字符串类型
- fine-grained_score: 浮点数类型
- model: 字符串类型
- overall_score: 浮点数类型
- principle: 字符串类型
- response: 字符串类型
- annotations: 结构体,包含以下字段:
- correct_answers: 字符串序列
- incorrect_answers: 字符串序列
- prompt: 字符串类型
- chosen: 字符串类型
- rejected: 字符串类型
数据分割
- train: 包含972373441字节,60435个样本
- eval: 包含51047585字节,3187个样本
数据大小
- 下载大小: 437066514字节
- 数据集大小: 1023421026字节
配置
- default: 包含以下数据文件:
- train: 路径为
data/train-* - eval: 路径为
data/eval-*
- train: 路径为



