Gradygu3u/VSI-Super-Wild-Benchmark
收藏Hugging Face2026-04-17 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Gradygu3u/VSI-Super-Wild-Benchmark
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: VSI-Super-Wild
license: mit
task_categories:
- visual-question-answering
language:
- en
tags:
- video
- benchmark
- lmms-eval
size_categories:
- 10K<n<100K
---
# VSI-Super-Wild
This repository stores the public video assets and the current lmms-eval benchmark release for **Toward Supersensing**.
## Current benchmark release
- Version: `cambw_v2_recheck_20260409_contentfix_mc4`
- Location: `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/`
- QA count: `12021`
- Split count:
- `part1_long`: `511`
- `part2_3_short`: `11510`
## Layout
- `videos/`: public video files
- `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/data/`: materialized lmms-eval jsonl files
- `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/source/`: source QA json files used to build the benchmark
- `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/manifest.json`: benchmark manifest
## Code
The corresponding lmms-eval integration and runner scripts are published in:
- GitHub: `Grady10086/Toward-Supersensing`
## Notes
- This benchmark release is the corrected content-fix version with balanced 4-way multiple-choice ordering.
- Existing model results from older benchmark versions should be backfilled or delta-rerun against this version rather than mixed directly.
提供机构:
Gradygu3u



