five

Gradygu3u/VSI-Super-Wild-Benchmark

收藏
Hugging Face2026-04-17 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Gradygu3u/VSI-Super-Wild-Benchmark
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: VSI-Super-Wild license: mit task_categories: - visual-question-answering language: - en tags: - video - benchmark - lmms-eval size_categories: - 10K<n<100K --- # VSI-Super-Wild This repository stores the public video assets and the current lmms-eval benchmark release for **Toward Supersensing**. ## Current benchmark release - Version: `cambw_v2_recheck_20260409_contentfix_mc4` - Location: `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/` - QA count: `12021` - Split count: - `part1_long`: `511` - `part2_3_short`: `11510` ## Layout - `videos/`: public video files - `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/data/`: materialized lmms-eval jsonl files - `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/source/`: source QA json files used to build the benchmark - `benchmarks/cambw_v2_recheck_20260409_contentfix_mc4/manifest.json`: benchmark manifest ## Code The corresponding lmms-eval integration and runner scripts are published in: - GitHub: `Grady10086/Toward-Supersensing` ## Notes - This benchmark release is the corrected content-fix version with balanced 4-way multiple-choice ordering. - Existing model results from older benchmark versions should be backfilled or delta-rerun against this version rather than mixed directly.
提供机构:
Gradygu3u
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作