Modified dpo-mix-7k dataset
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/argilla/dpo-mix-7k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集用于对gemma-7b模型进行微调,其中一半的回复被替换为来自gemma-2b的低质量回复。此外,该数据集模拟了训练过程中遇到的各类数据集质量参差不齐的挑战。这项任务的目的是对大型语言模型进行微调。
This dataset is intended for fine-tuning the gemma-7b model, with half of its responses replaced by low-quality outputs generated by gemma-2b. Additionally, this dataset simulates a range of challenges arising from uneven data quality encountered during the model training process. The objective of this task is to fine-tune large language models.
提供机构:
Authors of the paper



