selfcorrexp2/llama3_sft_balanced_rr60k
收藏Hugging Face2024-12-28 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_sft_balanced_rr60k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个对话数据集,包含了对话的索引、提示信息、是否为第一轮对话的标记、真实答案、奖励信息、用户的解决方案、是否为正确标记、对话轮数、对话内容(包括内容和角色)、对话难度级别、对话类型、预测答案以及用户提示的对话内容。数据集提供了训练集,并给出了数据集的字节大小和示例数量。
The dataset is a dialogue dataset, which includes dialogue index, prompt information, whether it is the first round of dialogue, ground truth, reward information, users solution, correct flag, dialogue turn, dialogue content (including content and role), dialogue difficulty level, dialogue type, predicted answer, and user prompt dialogue content. The dataset provides a training set and gives the byte size and number of examples of the dataset.
提供机构:
selfcorrexp2



