PanoVQA, PanoVQA-mini
收藏数据集概述
数据集基本信息
- 数据集名称:PanoVQA
- 相关论文:More than the Sum: Panorama-Language Models for Adverse Omni-Scenes
- 论文状态:CVPR 2026,预印本发布于arXiv
- arXiv ID:2603.09573
- 论文链接:https://arxiv.org/abs/2603.09573
- PDF链接:https://arxiv.org/pdf/2603.09573
数据集构成与获取
- 主要数据集:PanoVQA
- 精简版数据集:PanoVQA-mini
- 数据来源:数据集包含来自三个子数据集的图像:
- BlendPASS
- DeepAccident
- NuScenes
- 下载方式:
- PanoVQA下载链接:https://drive.google.com/drive/folders/1NOpXK-oR6P4JEm4ewuwkF29xV3kS-zE4?usp=drive_link
- PanoVQA-mini下载链接:https://drive.google.com/drive/folders/1jtoEJtUBpen3OS4G_udl2zODKSKYKT4m?usp=drive_link
- 数据准备:下载后需解压文件。
数据集目录结构
建议的工作空间组织方式如下:
Workspace/ ├── PanoVQA/ │ ├── BlendPASS/ │ ├── DeepAccident/ │ └── NuScenes/ ├── PanoVQA_mini/ │ ├── BlendPASS/ │ ├── DeepAccident/ │ └── NuScenes/ └── Panorama/ └── images/
相关任务与用途
该数据集用于全景-语言建模,旨在处理不利的全场景。具体任务涉及视觉问答,模型需要理解全景图像内容并回答相关问题。
引用信息
如需在学术工作中使用此数据集,请引用以下论文: bibtex @article{fan2026PanoVQA, title={More than the Sum: Panorama-Language Models for Adverse Omni-Scenes}, author={Fan, Weijia and Liu, Ruiping and Wei, Jiale and Chen, Yufan and Zheng, Junwei and Zeng, Zichao and Zhang, Jiaming and Li, Qiufu and Shen, Linlin and Stiefelhagen, Rainer}, journal={arXiv preprint arXiv:2603.09573}, year={2026} }
或 bibtex @article{fan2026PanoVQA, title={More than the Sum: Panorama-Language Models for Adverse Omni-Scenes}, author={Fan, Weijia and Liu, Ruiping and Wei, Jiale and Chen, Yufan and Zheng, Junwei and Zeng, Zichao and Zhang, Jiaming and Li, Qiufu and Shen, Linlin and Stiefelhagen, Rainer}, booktitle={2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, year={2026} }




