thuml/webarena-world-model-cot
收藏Hugging Face2025-05-26 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/thuml/webarena-world-model-cot
下载链接
链接失效反馈官方服务:
资源简介:
RLVR-World数据集是一个用于强化学习训练世界模型的数据集。它可能包含了用于训练和测试的虚拟环境或场景,以便模型能够通过强化学习来预测和模拟环境中的行为。
RLVR-World dataset is a collection of environments or scenarios for training world models with reinforcement learning. It likely includes data for both training and testing purposes, allowing models to predict and simulate behaviors within the environment through reinforcement learning.
提供机构:
thuml



