selfcorrexp2/llama3_sft_first_wrong_processed_old_format
收藏Hugging Face2024-12-28 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_sft_first_wrong_processed_old_format
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含对话和回答的数据集,其中有索引、提示、是否为第一轮对话、目标、奖励、解决方案、标志、轮数等信息。数据集分为训练集,提供了字节数和示例数。
This dataset contains dialogues and responses, including information such as index, prompt, whether it is the first round of conversation, target, reward, solution, flag, round number, etc. The dataset is split into a training set, providing the number of bytes and examples.
提供机构:
selfcorrexp2



