plaguss/curation-ultrafeedback-scores

Name: plaguss/curation-ultrafeedback-scores
Creator: plaguss
Published: 2023-12-05 12:59:16
License: 暂无描述

Hugging Face2023-12-05 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/plaguss/curation-ultrafeedback-scores

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: source dtype: string - name: instruction dtype: string - name: best_rated_is_different_from_best_overall dtype: bool - name: best_overall_model dtype: string - name: score_best_overall dtype: float64 - name: best_rated_model dtype: string - name: score_best_rated dtype: float64 - name: best_overall_score_response struct: - name: annotations struct: - name: helpfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: honesty struct: - name: Rating dtype: string - name: Rationale dtype: string - name: instruction_following struct: - name: Rating dtype: string - name: Rationale dtype: string - name: truthfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: average_rating dtype: float64 - name: critique dtype: string - name: custom_system_prompt dtype: string - name: model dtype: string - name: overall_score dtype: float64 - name: principle dtype: string - name: response dtype: string - name: random_response_for_best_overall struct: - name: annotations struct: - name: helpfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: honesty struct: - name: Rating dtype: string - name: Rationale dtype: string - name: instruction_following struct: - name: Rating dtype: string - name: Rationale dtype: string - name: truthfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: average_rating dtype: float64 - name: critique dtype: string - name: custom_system_prompt dtype: string - name: model dtype: string - name: overall_score dtype: float64 - name: principle dtype: string - name: response dtype: string - name: best_rated_response struct: - name: annotations struct: - name: helpfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: honesty struct: - name: Rating dtype: string - name: Rationale dtype: string - name: instruction_following struct: - name: Rating dtype: string - name: Rationale dtype: string - name: truthfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: average_rating dtype: float64 - name: critique dtype: string - name: custom_system_prompt dtype: string - name: model dtype: string - name: overall_score dtype: float64 - name: principle dtype: string - name: response dtype: string - name: random_response_for_best_rated struct: - name: annotations struct: - name: helpfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: honesty struct: - name: Rating dtype: string - name: Rationale dtype: string - name: instruction_following struct: - name: Rating dtype: string - name: Rationale dtype: string - name: truthfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: average_rating dtype: float64 - name: critique dtype: string - name: custom_system_prompt dtype: string - name: model dtype: string - name: overall_score dtype: float64 - name: principle dtype: string - name: response dtype: string - name: score_random_response_for_best_overall dtype: float64 - name: score_random_response_for_rated dtype: float64 - name: completions list: - name: annotations struct: - name: helpfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: honesty struct: - name: Rating dtype: string - name: Rationale dtype: string - name: instruction_following struct: - name: Rating dtype: string - name: Rationale dtype: string - name: truthfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: average_rating dtype: float64 - name: critique dtype: string - name: custom_system_prompt dtype: string - name: model dtype: string - name: overall_score dtype: float64 - name: principle dtype: string - name: response dtype: string - name: random_response_for_rated dtype: float64 - name: rating-distilabel-gpt4 sequence: float64 - name: rationale-distilabel-gpt4 sequence: string splits: - name: train num_bytes: 46976035 num_examples: 2405 download_size: 18006660 dataset_size: 46976035 configs: - config_name: default data_files: - split: train path: data/train-* ---

提供机构：

plaguss

原始信息汇总

数据集概述

数据集特征

基本特征

source: 数据来源，类型为字符串。
instruction: 指令，类型为字符串。
best_rated_is_different_from_best_overall: 最佳评级与最佳总体是否不同，类型为布尔值。
best_overall_model: 最佳总体模型，类型为字符串。
score_best_overall: 最佳总体模型得分，类型为浮点数。
best_rated_model: 最佳评级模型，类型为字符串。
score_best_rated: 最佳评级模型得分，类型为浮点数。

复杂特征

best_overall_score_response: 最佳总体模型得分响应，包含以下结构：
- annotations: 注释，包含以下子结构：
  - helpfulness: 帮助性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
  - honesty: 诚实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - instruction_following: 遵循指令，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - truthfulness: 真实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
- average_rating: 平均评级，类型为浮点数。
- critique: 批评，类型为字符串。
- custom_system_prompt: 自定义系统提示，类型为字符串。
- model: 模型，类型为字符串。
- overall_score: 总体得分，类型为浮点数。
- principle: 原则，类型为字符串。
- response: 响应，类型为字符串。
random_response_for_best_overall: 最佳总体模型的随机响应，包含以下结构：
- annotations: 注释，包含以下子结构：
  - helpfulness: 帮助性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
  - honesty: 诚实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - instruction_following: 遵循指令，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - truthfulness: 真实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
- average_rating: 平均评级，类型为浮点数。
- critique: 批评，类型为字符串。
- custom_system_prompt: 自定义系统提示，类型为字符串。
- model: 模型，类型为字符串。
- overall_score: 总体得分，类型为浮点数。
- principle: 原则，类型为字符串。
- response: 响应，类型为字符串。
best_rated_response: 最佳评级响应，包含以下结构：
- annotations: 注释，包含以下子结构：
  - helpfulness: 帮助性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
  - honesty: 诚实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - instruction_following: 遵循指令，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - truthfulness: 真实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
- average_rating: 平均评级，类型为浮点数。
- critique: 批评，类型为字符串。
- custom_system_prompt: 自定义系统提示，类型为字符串。
- model: 模型，类型为字符串。
- overall_score: 总体得分，类型为浮点数。
- principle: 原则，类型为字符串。
- response: 响应，类型为字符串。
random_response_for_best_rated: 最佳评级模型的随机响应，包含以下结构：
- annotations: 注释，包含以下子结构：
  - helpfulness: 帮助性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
  - honesty: 诚实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - instruction_following: 遵循指令，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - truthfulness: 真实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
- average_rating: 平均评级，类型为浮点数。
- critique: 批评，类型为字符串。
- custom_system_prompt: 自定义系统提示，类型为字符串。
- model: 模型，类型为字符串。
- overall_score: 总体得分，类型为浮点数。
- principle: 原则，类型为字符串。
- response: 响应，类型为字符串。
score_random_response_for_best_overall: 最佳总体模型的随机响应得分，类型为浮点数。
score_random_response_for_rated: 评级模型的随机响应得分，类型为浮点数。
completions: 完成，类型为列表，包含以下结构：
- annotations: 注释，包含以下子结构：
  - helpfulness: 帮助性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
  - honesty: 诚实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - instruction_following: 遵循指令，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
  - truthfulness: 真实性，包含以下子结构：
    - Rating: 评级，类型为字符串。
    - Rationale: 理由，类型为字符串。
    - Rationale For Rating: 评级理由，类型为字符串。
    - Type: 类型，类型为字符串序列。
- average_rating: 平均评级，类型为浮点数。
- critique: 批评，类型为字符串。
- custom_system_prompt: 自定义系统提示，类型为字符串。
- model: 模型，类型为字符串。
- overall_score: 总体得分，类型为浮点数。
- principle: 原则，类型为字符串。
- response: 响应，类型为字符串。
random_response_for_rated: 评级模型的随机响应，类型为浮点数。
rating-distilabel-gpt4: 评级分布，类型为浮点数序列。
rationale-distilabel-gpt4: 理由分布，类型为字符串序列。

数据集拆分

train: 训练集，包含46,976,035字节，2,405个样本。

数据集大小

下载大小: 18,006,660字节
数据集大小: 46,976,035字节

配置

default: 默认配置，包含以下数据文件：
- train: 训练集，路径为data/train-*。

5,000+

优质数据集

54 个

任务类型

进入经典数据集