selfcorrexp2/llama3_sft_2ep_math_first_wrong_prompt
收藏Hugging Face2025-01-06 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_sft_2ep_math_first_wrong_prompt
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含对话和解决方案的集合,适用于机器学习模型训练。数据集中的每个样本都包含一个唯一的索引、一个解决方案字符串、一个标识是否为首选奖励的布尔值、一个目标字符串、一系列消息(包括内容和角色)以及一系列对话(也包括内容和角色)。数据集被分割为训练集,并提供了相关文件的大小信息。
This dataset is a collection of dialogues and solutions, suitable for machine learning model training. Each sample in the dataset contains a unique index, a solution string, a boolean indicating the preferred reward, a target string, a series of messages (including content and role), and a series of conversations (including content and role as well). The dataset is split into a training set, and information about the size of the related files is provided.
提供机构:
selfcorrexp2



