when2rl/OpenHermesPreferences_reformatted

Name: when2rl/OpenHermesPreferences_reformatted
Creator: when2rl
Published: 2024-04-17 02:00:53
License: 暂无描述

Hugging Face2024-04-17 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/when2rl/OpenHermesPreferences_reformatted

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: prompt dtype: string - name: prompt_id dtype: string - name: chosen list: - name: content dtype: string - name: role dtype: string - name: rejected list: - name: content dtype: string - name: role dtype: string - name: messages list: - name: content dtype: string - name: role dtype: string - name: score_chosen dtype: float64 - name: score_rejected dtype: float64 - name: other_info struct: - name: candidate_policies sequence: string - name: candidates_completions sequence: string - name: category dtype: string - name: chosen_policy dtype: string - name: chosen_rank dtype: int64 - name: rank_str dtype: string - name: ranks sequence: int64 - name: rejected_policy dtype: string - name: rejected_rank dtype: int64 - name: source dtype: string splits: - name: train num_bytes: 9039102268.48677 num_examples: 985186 download_size: 4418340168 dataset_size: 9039102268.48677 configs: - config_name: default data_files: - split: train path: data/train-* --- # Dataset Card for OpenHermesPreferences_reformatted  This is a reformatted version of argilla's OpenHermesPreference: 1. reformatted the daatset to be consistent with ultrafeedback_binarized. 2. *(new)* removed all rows where the `chosen` is the same as `rejected`. This removed 4214 rows from the training set. Note that the `score_chosen` and `score_rejected` are dummy scores, since the original dataset did not provide any scoring. ## Dataset Details ### Dataset Description  - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional]  - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses  ### Direct Use  [More Information Needed] ### Out-of-Scope Use  [More Information Needed] ## Dataset Structure  [More Information Needed] ## Dataset Creation ### Curation Rationale  [More Information Needed] ### Source Data  #### Data Collection and Processing  [More Information Needed] #### Who are the source data producers?  [More Information Needed] ### Annotations [optional]  #### Annotation process  [More Information Needed] #### Who are the annotators?  [More Information Needed] #### Personal and Sensitive Information  [More Information Needed] ## Bias, Risks, and Limitations  [More Information Needed] ### Recommendations  Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional]  **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional]  [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]

提供机构：

when2rl

原始信息汇总

数据集概述

数据集名称

名称: OpenHermesPreferences_reformatted

数据集修改信息

修改内容:
- 格式化数据集以与ultrafeedback_binarized保持一致。
- 移除所有chosen与rejected相同的行，共移除4214行。

数据集特征

特征列表:
- prompt: 字符串类型
- prompt_id: 字符串类型
- chosen: 列表类型，包含content和role，均为字符串类型
- rejected: 列表类型，包含content和role，均为字符串类型
- messages: 列表类型，包含content和role，均为字符串类型
- score_chosen: 浮点数类型
- score_rejected: 浮点数类型
- other_info: 结构类型，包含多个字段如candidate_policies, candidates_completions, category, chosen_policy, chosen_rank, rank_str, ranks, rejected_policy, rejected_rank, source等，类型包括字符串、整数和序列。

数据集大小

下载大小: 4418340168字节
数据集大小: 9039102268.48677字节

数据集分割

分割信息:
- train: 985186个示例，大小为9039102268.48677字节

数据集配置

配置名称: default
数据文件:
- train: 路径为data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集