simonycl/self-play-sft-llama3-Qwen-Qwen3-32B-52k
收藏Hugging Face2025-11-13 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/simonycl/self-play-sft-llama3-Qwen-Qwen3-32B-52k
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含对话和元数据信息的数据集。对话部分包含内容和角色信息,元数据部分包含行为、环境ID、剧集ID、最终奖励、模型名称、玩家ID、步骤和模板类型等信息。数据集分为训练集,包含50997个示例,数据大小为718MB。
This is a dataset containing conversation and metadata information. The conversation part includes content and role information, while the metadata part includes action, environment ID, episode ID, final reward, model name, player ID, step, and template type, etc. The dataset is split into a training set, containing 50997 examples, with a total data size of 718MB.
提供机构:
simonycl



