mfirth/sciworld-rollout-dataset
收藏Hugging Face2025-10-16 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/mfirth/sciworld-rollout-dataset
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含用户轨迹、奖励、摘要和问答等信息的数据集。具体包含动作、库存、观察、摘要内容、摘要角色、问答内容、问答角色、rtl和rb等字段。数据集分为训练集,包含3个示例,总共19831字节。
This dataset includes user trajectories, rewards, summaries, and QA information. It specifically contains fields for actions, inventory, observations, summary content, summary roles, QA content, QA roles, rtl, and rb. The dataset is split into a training set with 3 examples, totaling 19831 bytes.
提供机构:
mfirth



