selfcorrexp2/llama3_sft_first_corr_prompt_generation_all_merged
收藏Hugging Face2024-12-28 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_sft_first_corr_prompt_generation_all_merged
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含索引(idx)、提示(prompt)、答案序列(answers)、是否为第一轮(first_round)、真实标签(gt)和奖励序列(rewards)等字段的数据集。数据集分为训练集,共有60000个样本,总大小为336MB。
This dataset includes fields such as index (idx), prompt (prompt), answer sequence (answers), whether it is the first round (first_round), ground truth label (gt), and reward sequence (rewards). The dataset is split into a training set with a total of 60,000 samples and a total size of 336MB.
提供机构:
selfcorrexp2



