HumanLLMs/Human-Like-DPO-Dataset
收藏Hugging Face2026-01-15 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/HumanLLMs/Human-Like-DPO-Dataset
下载链接
链接失效反馈官方服务:
资源简介:
Human-Like-DPO-Dataset是一个包含10,884个样本,涵盖256个主题的数据集,旨在通过Direct Preference Optimization(DPO)等格式,帮助大型语言模型生成更类似于人类的对话响应。每个样本包含一个自然对话式的问题和回答,以及一个正式的AI式回答。
The Human-Like-DPO-Dataset consists of 10,884 samples across 256 topics, designed to assist large language models in generating more human-like conversational responses through formats like Direct Preference Optimization (DPO). Each sample includes a natural conversational question and answer, as well as a formal AI-style response.
提供机构:
HumanLLMs



