VizWiz-Caps
收藏魔搭社区2025-11-12 更新2024-10-12 收录
下载链接:
https://modelscope.cn/datasets/lmms-lab/VizWiz-Caps
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Card for "VizWiz-Caps"
<p align="center" width="100%">
<img src="https://i.postimg.cc/g0QRgMVv/WX20240228-113337-2x.png" width="100%" height="80%">
</p>
# Large-scale Multi-modality Models Evaluation Suite
> Accelerating the development of large-scale multi-modality models (LMMs) with `lmms-eval`
🏠 [Homepage](https://lmms-lab.github.io/) | 📚 [Documentation](docs/README.md) | 🤗 [Huggingface Datasets](https://huggingface.co/lmms-lab)
# This Dataset
This is a formatted version of [VizWiz-Caps](https://arxiv.org/abs/2002.08565v2). It is used in our `lmms-eval` pipeline to allow for one-click evaluations of large multi-modality models.
```
@inproceedings{gurari2020captioning,
title={Captioning images taken by people who are blind},
author={Gurari, Danna and Zhao, Yinan and Zhang, Meng and Bhattacharya, Nilavra},
booktitle={Computer Vision--ECCV 2020: 16th European Conference, Glasgow, UK, August 23--28, 2020, Proceedings, Part XVII 16},
pages={417--434},
year={2020},
organization={Springer}
}
```
# 「VizWiz-Caps」数据集卡片
<p align="center" width="100%">
<img src="https://i.postimg.cc/g0QRgMVv/WX20240228-113337-2x.png" width="100%" height="80%">
</p>
# 大规模多模态模型评测套件
> 通过`lmms-eval`加速大规模多模态模型(Large-scale Multi-modality Models, LMMs)的研发
🏠 [主页](https://lmms-lab.github.io/) | 📚 [文档](docs/README.md) | 🤗 [Huggingface数据集平台](https://huggingface.co/lmms-lab)
# 本数据集
本数据集是[VizWiz-Caps](https://arxiv.org/abs/2002.08565v2)的格式化版本,被应用于我们的`lmms-eval`流水线中,以实现大规模多模态模型的一键评测。
@inproceedings{gurari2020captioning,
title={视障人群拍摄图像的字幕生成},
author={Gurari, Danna and Zhao, Yinan and Zhang, Meng and Bhattacharya, Nilavra},
booktitle={《计算机视觉——2020年第16届欧洲计算机视觉会议(ECCV 2020)论文集》,英国格拉斯哥,2020年8月23日至28日,第XVII卷 第16部分},
pages={417--434},
year={2020},
organization={施普林格(Springer)}
}
提供机构:
maas
创建时间:
2024-10-07



