alvarobartt/openhermes-preferences-metamath
收藏Hugging Face2024-02-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/alvarobartt/openhermes-preferences-metamath
下载链接
链接失效反馈官方服务:
资源简介:
---
license: other
task_categories:
- text-generation
language:
- en
source_datasets:
- argilla/OpenHermesPreferences
annotations_creators:
- Argilla
- HuggingFaceH4
tags:
- dpo
- synthetic
- metamath
size_categories:
- 10K<n<100K
dataset_info:
features:
- name: chosen
list:
- name: content
dtype: string
- name: role
dtype: string
- name: rejected
list:
- name: content
dtype: string
- name: role
dtype: string
splits:
- name: train
num_bytes: 169676613.83305642
num_examples: 50799
- name: test
num_bytes: 18855183.863611557
num_examples: 5645
download_size: 44064373
dataset_size: 188531797.69666797
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
---
# Dataset Card for OpenHermes Preferences - MetaMath
This dataset is a subset from [`argilla/OpenHermesPreferences`](https://hf.co/datasets/argilla/OpenHermesPreferences),
only keeping the preferences of `metamath`, and removing all the columns besides the `chosen` and `rejected` ones, that
come in OpenAI chat formatting, so that's easier to fine-tune a model using tools like: [`huggingface/alignment-handbook`](https://github.com/huggingface/alignment-handbook)
or [`axolotl`](https://github.com/OpenAccess-AI-Collective/axolotl), among others.
## Reference
[`argilla/OpenHermesPreferences`](https://hf.co/datasets/argilla/OpenHermesPreferences) dataset created as a collaborative
effort between Argilla and the HuggingFaceH4 team from HuggingFace.
提供机构:
alvarobartt
原始信息汇总
数据集卡片 - OpenHermes Preferences - MetaMath
数据集概述
- 许可证: other
- 任务类别: 文本生成
- 语言: 英语
- 源数据集:
argilla/OpenHermesPreferences - 标注创建者: Argilla, HuggingFaceH4
- 标签: dpo, synthetic, metamath
- 大小类别: 10K<n<100K
数据集信息
- 特征:
- chosen:
- content: string
- role: string
- rejected:
- content: string
- role: string
- chosen:
- 分割:
- train:
- 字节数: 169676613.83305642
- 示例数: 50799
- test:
- 字节数: 18855183.863611557
- 示例数: 5645
- train:
- 下载大小: 44064373
- 数据集大小: 188531797.69666797
配置
- 配置名称: default
- 数据文件:
- train: data/train-*
- test: data/test-*
- 数据文件:



