selfrew/llama3_8b_self_gen_n40_packed_reward_tmp0
收藏Hugging Face2024-12-15 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/selfrew/llama3_8b_self_gen_n40_packed_reward_tmp0
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征字段,包括索引(idx)、目标(gt)、提示(prompt)、级别(level)、类型(type)、解决方案(solution)、奖励(rewards)和我的解决方案(my_solu)。数据集主要用于训练,包含5000个示例,总大小为21559392字节。这些字段可能用于某种任务或挑战的解决方案评估和奖励分配。
This dataset includes multiple feature fields such as index (idx), goal (gt), prompt (prompt), level (level), type (type), solution (solution), rewards (rewards), and my solution (my_solu). The dataset is primarily used for training, containing 5000 examples with a total size of 21559392 bytes. These fields are likely used for the evaluation of solutions and allocation of rewards in some task or challenge.
提供机构:
selfrew



