argilla/ultrafeedback-prompts-with-ultrajudge

Name: argilla/ultrafeedback-prompts-with-ultrajudge
Creator: argilla
Published: 2023-11-24 12:27:36
License: 暂无描述

Hugging Face2023-11-24 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/argilla/ultrafeedback-prompts-with-ultrajudge

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: source dtype: string - name: input dtype: string - name: models sequence: string - name: completions list: - name: annotations struct: - name: helpfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: honesty struct: - name: Rating dtype: string - name: Rationale dtype: string - name: instruction_following struct: - name: Rating dtype: string - name: Rationale dtype: string - name: truthfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: critique dtype: string - name: custom_system_prompt dtype: string - name: model dtype: string - name: overall_score dtype: float64 - name: principle dtype: string - name: response dtype: string - name: correct_answers sequence: string - name: incorrect_answers sequence: string - name: generation_model dtype: string - name: generation_prompt dtype: string - name: raw_generation_responses sequence: string - name: generations sequence: string - name: labelling_model dtype: string - name: labelling_prompt list: - name: content dtype: string - name: role dtype: string - name: raw_labelling_response dtype: string - name: rating sequence: float64 - name: areas list: - name: Authenticity & Reliability struct: - name: rating dtype: string - name: rationale dtype: string - name: Clarity & Transparency struct: - name: rating dtype: string - name: rationale dtype: string - name: Compliance with Intent struct: - name: rating dtype: string - name: rationale dtype: string - name: Practical Accuracy struct: - name: rating dtype: string - name: rationale dtype: string splits: - name: train num_bytes: 1844998918 num_examples: 63967 download_size: 0 dataset_size: 1844998918 configs: - config_name: default data_files: - split: train path: data/train-* ---

数据集信息（dataset_info）：特征字段（features）： - 字段名称：source，数据类型（dtype）：字符串（string） - 字段名称：input，数据类型（dtype）：字符串（string） - 字段名称：models，数据类型（dtype）：字符串序列（sequence of string） - 字段名称：completions，数据类型（dtype）：列表（list），列表项为结构体（struct），包含以下子字段： - 子字段annotations（标注信息），结构体包含： - 子字段helpfulness（有用性），结构体包含： - Rating（评分）：字符串（string）类型 - Rationale（理由）：字符串（string）类型 - Rationale For Rating（评分理由）：字符串（string）类型 - Type（类型）：字符串序列（sequence of string） - 子字段honesty（诚实性），结构体包含： - Rating（评分）：字符串（string）类型 - Rationale（理由）：字符串（string）类型 - 子字段instruction_following（指令遵循性），结构体包含： - Rating（评分）：字符串（string）类型 - Rationale（理由）：字符串（string）类型 - 子字段truthfulness（真实性），结构体包含： - Rating（评分）：字符串（string）类型 - Rationale（理由）：字符串（string）类型 - Rationale For Rating（评分理由）：字符串（string）类型 - Type（类型）：字符串序列（sequence of string） - 子字段critique（批判性评价）：字符串（string）类型 - 子字段custom_system_prompt（自定义系统提示词）：字符串（string）类型 - 子字段model（生成模型）：字符串（string）类型 - 子字段overall_score（总体得分）：双精度浮点数（float64）类型 - 子字段principle（遵循原则）：字符串（string）类型 - 子字段response（模型响应）：字符串（string）类型 - 字段名称：correct_answers（正确答案集）：字符串序列（sequence of string） - 字段名称：incorrect_answers（错误答案集）：字符串序列（sequence of string） - 字段名称：generation_model（生成模型）：字符串（string）类型 - 字段名称：generation_prompt（生成提示词）：字符串（string）类型 - 字段名称：raw_generation_responses（原始生成响应集）：字符串序列（sequence of string） - 字段名称：generations（生成结果集）：字符串序列（sequence of string） - 字段名称：labelling_model（标注模型）：字符串（string）类型 - 字段名称：labelling_prompt（标注提示词）：列表（list）类型，包含以下子字段： - content（内容）：字符串（string）类型 - role（角色）：字符串（string）类型 - 字段名称：raw_labelling_response（原始标注响应）：字符串（string）类型 - 字段名称：rating（评分集）：浮点数序列（sequence of float64） - 字段名称：areas（评估领域）：列表（list）类型，列表项为结构体（struct），包含以下项： - Authenticity & Reliability（真实性与可靠性），结构体包含： - rating（评分）：字符串（string）类型 - rationale（理由）：字符串（string）类型 - Clarity & Transparency（清晰性与透明度），结构体包含： - rating（评分）：字符串（string）类型 - rationale（理由）：字符串（string）类型 - Compliance with Intent（意图遵循性），结构体包含： - rating（评分）：字符串（string）类型 - rationale（理由）：字符串（string）类型 - Practical Accuracy（实际准确性），结构体包含： - rating（评分）：字符串（string）类型 - rationale（理由）：字符串（string）类型拆分集（splits）： - 拆分名称：train（训练集），字节大小：1844998918，样本数量：63967 下载大小（download_size）：0 数据集总大小（dataset_size）：1844998918 配置项（configs）： - 配置名称：default（默认配置），数据文件： - 拆分：train（训练集），路径：data/train-*

提供机构：

argilla

原始信息汇总

数据集概述

数据特征

source: 字符串类型
input: 字符串类型
models: 字符串序列
completions: 列表类型，包含以下结构：
- annotations: 结构体，包含以下字段：
  - helpfulness: 结构体，包含以下字段：
    - Rating: 字符串类型
    - Rationale: 字符串类型
    - Rationale For Rating: 字符串类型
    - Type: 字符串序列
  - honesty: 结构体，包含以下字段：
    - Rating: 字符串类型
    - Rationale: 字符串类型
  - instruction_following: 结构体，包含以下字段：
    - Rating: 字符串类型
    - Rationale: 字符串类型
  - truthfulness: 结构体，包含以下字段：
    - Rating: 字符串类型
    - Rationale: 字符串类型
    - Rationale For Rating: 字符串类型
    - Type: 字符串序列
- critique: 字符串类型
- custom_system_prompt: 字符串类型
- model: 字符串类型
- overall_score: 浮点数类型
- principle: 字符串类型
- response: 字符串类型
correct_answers: 字符串序列
incorrect_answers: 字符串序列
generation_model: 字符串类型
generation_prompt: 字符串类型
raw_generation_responses: 字符串序列
generations: 字符串序列
labelling_model: 字符串类型
labelling_prompt: 列表类型，包含以下字段：
- content: 字符串类型
- role: 字符串类型
raw_labelling_response: 字符串类型
rating: 浮点数序列
areas: 列表类型，包含以下结构：
- Authenticity & Reliability: 结构体，包含以下字段：
  - rating: 字符串类型
  - rationale: 字符串类型
- Clarity & Transparency: 结构体，包含以下字段：
  - rating: 字符串类型
  - rationale: 字符串类型
- Compliance with Intent: 结构体，包含以下字段：
  - rating: 字符串类型
  - rationale: 字符串类型
- Practical Accuracy: 结构体，包含以下字段：
  - rating: 字符串类型
  - rationale: 字符串类型

数据分割

train: 包含63967个样本，占用1844998918字节

数据集大小

下载大小: 0字节
数据集大小: 1844998918字节

配置

default: 包含以下数据文件：
- train: 路径为data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集