allenai/Dolci-DPO-Model-Response-Pool
收藏Hugging Face2026-01-05 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/allenai/Dolci-DPO-Model-Response-Pool
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含Olmo 3 DPO模型池中每个模型多达250万个响应,总计约7100万个提示-响应对。提示来源于[allenai/Dolci-Instruct-SFT](https://huggingface.co/datasets/allenai/Dolci-Instruct-SFT),并额外添加了来自[allenai/WildChat](https://huggingface.co/datasets/allenai/WildChat)的数据。数据集结构包括多个模型配置,每个配置包含唯一标识符、来源数据集、提示、模型响应和模型名称等字段。数据集旨在用于研究和教育用途,遵循ODC-BY许可。
This dataset contains up to 2.5 million responses for each model in the Olmo 3 DPO model pool, totalling about 71 million prompt, response pairs. Prompts are sourced from [allenai/Dolci-Instruct-SFT](https://huggingface.co/datasets/allenai/Dolci-Instruct-SFT), with additional data from [allenai/WildChat](https://huggingface.co/datasets/allenai/WildChat). The dataset structure includes multiple model configurations, each containing fields such as unique identifier, source dataset, prompt, model response, and model name. The dataset is intended for research and educational use under the ODC-BY license.
提供机构:
allenai



