when2rl/UltraFeedback_binarized_cleaned_annotated

Name: when2rl/UltraFeedback_binarized_cleaned_annotated
Creator: when2rl
Published: 2024-04-17 00:42:19
License: 暂无描述

Hugging Face2024-04-17 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/when2rl/UltraFeedback_binarized_cleaned_annotated

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: prompt dtype: string - name: prompt_id dtype: string - name: chosen list: - name: content dtype: string - name: role dtype: string - name: rejected list: - name: content dtype: string - name: role dtype: string - name: messages list: - name: content dtype: string - name: role dtype: string - name: score_chosen dtype: float64 - name: score_rejected dtype: float64 - name: other_info struct: - name: chosen_annotations struct: - name: annotations struct: - name: helpfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: honesty struct: - name: Rating dtype: string - name: Rationale dtype: string - name: instruction_following struct: - name: Rating dtype: string - name: Rationale dtype: string - name: truthfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: critique dtype: string - name: fine_grained_score dtype: float64 - name: model dtype: string - name: overall_score dtype: float64 - name: correct_answers sequence: string - name: incorrect_answers sequence: string - name: rejected_annotations struct: - name: annotations struct: - name: helpfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: honesty struct: - name: Rating dtype: string - name: Rationale dtype: string - name: instruction_following struct: - name: Rating dtype: string - name: Rationale dtype: string - name: truthfulness struct: - name: Rating dtype: string - name: Rationale dtype: string - name: Rationale For Rating dtype: string - name: Type sequence: string - name: critique dtype: string - name: fine_grained_score dtype: float64 - name: model dtype: string - name: overall_score dtype: float64 - name: source dtype: string splits: - name: train_prefs num_bytes: 610449160.9601701 num_examples: 60700 - name: test_prefs num_bytes: 19882677.836 num_examples: 1988 download_size: 326664614 dataset_size: 630331838.7961701 configs: - config_name: default data_files: - split: train_prefs path: data/train_prefs-* - split: test_prefs path: data/test_prefs-* --- # Dataset Card for UltraFeedback Binarized, Cleaned, and Annotated  This basically comes from: 1. start from UltraFeedback Binarized 2. recover metadata information such as `source` and `annotations` by matching prompts from the original `UltraFeedback` dataset 3. augment the original dset with metadata information stored in `other_info` 4. *(new)* removed all rows where the `chosen` is the same as `rejected`. This removed 435 rows from the training set, and 12 rows from test set. ## Dataset Details Same usage as `HuggingFaceH4/ultrafeedback_binarized`, but added the `other_info` which contains information such as `source` and `annotations`. ### Dataset Description  - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional]  - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses  ### Direct Use  [More Information Needed] ### Out-of-Scope Use  [More Information Needed] ## Dataset Structure  [More Information Needed] ## Dataset Creation ### Curation Rationale  [More Information Needed] ### Source Data  #### Data Collection and Processing  [More Information Needed] #### Who are the source data producers?  [More Information Needed] ### Annotations [optional]  #### Annotation process  [More Information Needed] #### Who are the annotators?  [More Information Needed] #### Personal and Sensitive Information  [More Information Needed] ## Bias, Risks, and Limitations  [More Information Needed] ### Recommendations  Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional]  **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional]  [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]

提供机构：

when2rl

原始信息汇总

数据集概述

数据集信息

特征

prompt: 字符串类型
prompt_id: 字符串类型
chosen: 列表类型，包含以下字段：
- content: 字符串类型
- role: 字符串类型
rejected: 列表类型，包含以下字段：
- content: 字符串类型
- role: 字符串类型
messages: 列表类型，包含以下字段：
- content: 字符串类型
- role: 字符串类型
score_chosen: 浮点数类型
score_rejected: 浮点数类型
other_info: 结构体类型，包含以下字段：
- chosen_annotations: 结构体类型，包含以下字段：
  - annotations: 结构体类型，包含以下字段：
    - helpfulness: 结构体类型，包含以下字段：
      - Rating: 字符串类型
      - Rationale: 字符串类型
      - Rationale For Rating: 字符串类型
      - Type: 字符串序列类型
    - honesty: 结构体类型，包含以下字段：
      - Rating: 字符串类型
      - Rationale: 字符串类型
    - instruction_following: 结构体类型，包含以下字段：
      - Rating: 字符串类型
      - Rationale: 字符串类型
    - truthfulness: 结构体类型，包含以下字段：
      - Rating: 字符串类型
      - Rationale: 字符串类型
      - Rationale For Rating: 字符串类型
      - Type: 字符串序列类型
  - critique: 字符串类型
  - fine_grained_score: 浮点数类型
  - model: 字符串类型
  - overall_score: 浮点数类型
- correct_answers: 字符串序列类型
- incorrect_answers: 字符串序列类型
- rejected_annotations: 结构体类型，包含以下字段：
  - annotations: 结构体类型，包含以下字段：
    - helpfulness: 结构体类型，包含以下字段：
      - Rating: 字符串类型
      - Rationale: 字符串类型
      - Rationale For Rating: 字符串类型
      - Type: 字符串序列类型
    - honesty: 结构体类型，包含以下字段：
      - Rating: 字符串类型
      - Rationale: 字符串类型
    - instruction_following: 结构体类型，包含以下字段：
      - Rating: 字符串类型
      - Rationale: 字符串类型
    - truthfulness: 结构体类型，包含以下字段：
      - Rating: 字符串类型
      - Rationale: 字符串类型
      - Rationale For Rating: 字符串类型
      - Type: 字符串序列类型
  - critique: 字符串类型
  - fine_grained_score: 浮点数类型
  - model: 字符串类型
  - overall_score: 浮点数类型
- source: 字符串类型

数据分割

train_prefs: 包含60700个样本，总字节数为610449160.9601701
test_prefs: 包含1988个样本，总字节数为19882677.836

数据集大小

下载大小: 326664614字节
数据集大小: 630331838.7961701字节

配置

default: 包含以下数据文件：
- train_prefs: 路径为data/train_prefs-*
- test_prefs: 路径为data/test_prefs-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集