selfrew/llama3_8b_self_gen_n40_packed_reward_tmp10
收藏Hugging Face2024-12-15 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/selfrew/llama3_8b_self_gen_n40_packed_reward_tmp10
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为llama3_8b_self_gen_n40_packed_reward_tmp10,包含字段有索引、真实标签、提示文本、难度等级、类型、解决方案、奖励和用户解决方案。数据集被划分为训练集,共有5000个示例。数据集的结构旨在存储与任务相关的各种信息,包括任务的提示和解决方案,以及奖励信息,可能是用于某种强化学习或生成任务的训练。
The dataset named llama3_8b_self_gen_n40_packed_reward_tmp10 includes fields such as index, ground truth label, prompt text, difficulty level, type, solution, rewards, and user-generated solution. The dataset is split into a training set with a total of 5,000 examples. The structure of the dataset is designed to store various pieces of information related to tasks, including task prompts and solutions, as well as reward information, which might be used for training in some form of reinforcement learning or generative task.
提供机构:
selfrew



