opencsg/autohub-benchmark

Name: opencsg/autohub-benchmark
Creator: opencsg
Published: 2025-03-09 06:19:13
License: 暂无描述

Hugging Face2025-03-09 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/opencsg/autohub-benchmark

下载链接

链接失效反馈

官方服务：

资源简介：

autohub-benchmark项目是一个专门用于评估视觉语言模型（VLMs）在特定场景下，即基于网页的代码、模型和数据集托管平台中的本地化性能的评测项目。该项目提供了不同模型在不同平台上的评测结果，包括准确度、错误率、无效率和完成率等指标。

The autohub-benchmark project is designed to evaluate the localization performance of visual language models (VLMs) in specialized scenarios related to web-based code, model, and dataset hosting platforms. This project provides evaluation results for various models on different platforms, including metrics such as Accuracy, Error, Invalid, and Completion Rate.

提供机构：

opencsg

5,000+

优质数据集

54 个

任务类型

进入经典数据集