btrabucco/insta-150k-v2-grpo-n1
收藏Hugging Face2025-04-09 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/btrabucco/insta-150k-v2-grpo-n1
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含输入提示、输出和成功率的训练数据集。输入提示分为内容和角色两部分,输出是模型的响应,成功率表示模型响应的正确性。数据集仅包含训练集,共有8758个示例。
This is a training dataset containing input prompts, outputs, and success rates. Input prompts are divided into content and role parts, the output is the models response, and the success rate indicates the correctness of the models response. The dataset only contains a training set with a total of 8758 examples.
提供机构:
btrabucco



