sl-alex/openai-prm800k-stepwise-critic

Name: sl-alex/openai-prm800k-stepwise-critic
Creator: sl-alex
Published: 2023-07-12 16:00:16
License: 暂无描述

Hugging Face2023-07-12 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/sl-alex/openai-prm800k-stepwise-critic

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: mit --- Denormalized dataset created by processing OpenAI's [PRM800K](https://github.com/openai/prm800k/tree/main) process supervision dataset via [prm800k-denorm](https://github.com/scottlogic-alex/prm800k-denorm). We include every conversation turn (i.e. "what's been said so far" + "the next step in the conversation"), good and bad. Plus the human evaluator's rating of whether it was a good or bad response. You could use this for training a classifier. Dataset description and usage instructions in [prm800k-denorm README](https://github.com/scottlogic-alex/prm800k-denorm/blob/main/README.md).

提供机构：

sl-alex

原始信息汇总

数据集概述

数据来源

该数据集是通过处理OpenAI的PRM800K过程监督数据集，使用prm800k-denorm工具创建的非规范化数据集。

数据内容

数据集包含每个对话轮次（即“到目前为止所说的话”和“对话的下一步”），包括好的和坏的对话。
还包括人类评估者对响应是好是坏的评级。

数据用途

该数据集可用于训练分类器。

详细描述和使用说明

数据集的详细描述和使用说明可在prm800k-denorm README中找到。

5,000+

优质数据集

54 个

任务类型

进入经典数据集