Modified dpo-mix-7k dataset

Name: Modified dpo-mix-7k dataset
Creator: Authors of the paper
License: 暂无描述

arXiv2025-09-30 收录

下载链接：

https://huggingface.co/datasets/argilla/dpo-mix-7k

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集用于对gemma-7b模型进行微调，其中一半的回复被替换为来自gemma-2b的低质量回复。此外，该数据集模拟了训练过程中遇到的各类数据集质量参差不齐的挑战。这项任务的目的是对大型语言模型进行微调。

This dataset is intended for fine-tuning the gemma-7b model, with half of its responses replaced by low-quality outputs generated by gemma-2b. Additionally, this dataset simulates a range of challenges arising from uneven data quality encountered during the model training process. The objective of this task is to fine-tune large language models.

提供机构：

Authors of the paper

5,000+

优质数据集

54 个

任务类型

进入经典数据集