Gradygu3u/VSI-Super-Wild-Benchmark

Name: Gradygu3u/VSI-Super-Wild-Benchmark
Creator: Gradygu3u
Published: 2026-04-17 08:49:50
License: 暂无描述

Hugging Face2026-04-17 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/Gradygu3u/VSI-Super-Wild-Benchmark

下载链接

链接失效反馈

官方服务：

资源简介：

--- pretty_name: VSI-Super-Wild license: mit task_categories: - visual-question-answering language: - en tags: - video - benchmark - lmms-eval size_categories: - 10K<n<100K --- # VSI-Super-Wild This repository stores the public video assets and the current lmms-eval benchmark release for **Toward Supersensing**. ## Current benchmark release - Version: `cambw_v2_recheck_20260409_contentfix_mc4` - Location: `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/` - QA count: `12021` - Split count: - `part1_long`: `511` - `part2_3_short`: `11510` ## Layout - `videos/`: public video files - `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/data/`: materialized lmms-eval jsonl files - `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/source/`: source QA json files used to build the benchmark - `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/manifest.json`: benchmark manifest ## Code The corresponding lmms-eval integration and runner scripts are published in: - GitHub: `Grady10086/Toward-Supersensing` ## Notes - This benchmark release is the corrected content-fix version with balanced 4-way multiple-choice ordering. - Existing model results from older benchmark versions should be backfilled or delta-rerun against this version rather than mixed directly.

提供机构：

Gradygu3u

5,000+

优质数据集

54 个

任务类型

进入经典数据集