mfirth/sciworld-rollout-dataset-llama-3.2-3b-instruct
收藏Hugging Face2025-10-18 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/mfirth/sciworld-rollout-dataset-llama-3.2-3b-instruct
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含任务名称、任务描述、轨迹、奖励、摘要、问答等字段的数据集。轨迹记录了动作、库存和观察等信息,摘要和问答部分包含了内容和角色信息。数据集分为训练集,提供了详细的大小和示例数量信息。
This dataset includes fields such as task name, task description, trajectory, reward, summarization, QA, rtl, and rb. The trajectory, summarization, and QA fields contain sub-fields for content and role. The dataset is split into a training set, with detailed information on size and number of examples provided.
提供机构:
mfirth



