LeoFan01/RoboBench
收藏Hugging Face2025-10-24 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/LeoFan01/RoboBench
下载链接
链接失效反馈官方服务:
资源简介:
RoboBench是一个全面的评估基准,旨在评估多模态大型语言模型在具身智能任务中的能力。该基准提供了一个系统的框架,用于评估这些模型在理解机器人场景方面的表现。数据集包含数千个精心策划的例子,支持文本、图像和视频数据,特别适合机器人应用。
RoboBench is a comprehensive evaluation benchmark designed to assess the capabilities of Multimodal Large Language Models (MLLMs) in embodied intelligence tasks. This benchmark provides a systematic framework for evaluating how well these models can understand and reason about robotic scenarios, containing thousands of carefully curated examples, supporting text, images, and video data, specifically tailored for robotic applications.
提供机构:
LeoFan01



