five

pixmo-cap-qa

收藏
魔搭社区2025-12-05 更新2025-02-15 收录
下载链接:
https://modelscope.cn/datasets/allenai/pixmo-cap-qa
下载链接
链接失效反馈
官方服务:
资源简介:
# PixMo-CapQA PixMo-CapQA is a synthetic dataset of question/answer pairs about images. The data was generated by using the [Claude](https://www.anthropic.com/claude) large language model to build Q/A pairs from [dense captions of images](https://huggingface.co/datasets/allenai/pixmo-cap) (the model did not see the actual images). PixMo-CapQA is a part of the [PixMo dataset collection](https://huggingface.co/collections/allenai/pixmo-674746ea613028006285687b) and was used to train the [Molmo family of models](https://huggingface.co/collections/allenai/molmo-66f379e6fe3b8ef090a8ca19) Quick links: - 📃 [Paper](https://molmo.allenai.org/paper.pdf) - 🎥 [Blog with Videos](https://molmo.allenai.org/blog) ## Loading ```python data = datasets.load_dataset("allenai/pixmo-cap-qa", split="train") ``` ## Data Format Images are stored as URLs that will need to be downloaded separately. The image URLs can be repeated since many of the images have multiple Q/A pairs. - The `question` field contains the input text, it includes "[USER]" and "[ASSISTANT]" tags - The `answer` field contains the final target output text - The `messages` field contains the same data in a list-of-messages formats. The first message is from the user, then messages alternative between user and assistant. This text does not contain "[USER]" and "[ASSISTANT]" tags ## License This dataset is licensed under ODC-BY-1.0. It is intended for research and educational use in accordance with Ai2's [Responsible Use Guidelines](https://allenai.org/responsible-use). This dataset includes data generated from Claude which are subject to Anthropic [terms of service](https://www.anthropic.com/legal/commercial-terms) and [usage policy](https://www.anthropic.com/legal/aup).

# PixMo-CapQA PixMo-CapQA是一款面向图像的问答对合成数据集。该数据集通过[Claude](https://www.anthropic.com/claude)大语言模型(Large Language Model),基于图像密集字幕(dense captions)构建问答对,且模型未接触过原始图像。 PixMo-CapQA隶属于[PixMo数据集合集](https://huggingface.co/collections/allenai/pixmo-674746ea613028006285687b),曾用于训练[Molmo系列模型](https://huggingface.co/collections/allenai/molmo-66f379e6fe3b8ef090a8ca19)。 ## 快速链接 - 📃 [论文](https://molmo.allenai.org/paper.pdf) - 🎥 [含视频的博客](https://molmo.allenai.org/blog) ## 加载方法 python data = datasets.load_dataset("allenai/pixmo-cap-qa", split="train") ## 数据格式 图像以URL形式存储,需单独下载。由于部分图像对应多组问答对,图像URL可能重复出现。 - `question`字段存储输入文本,其中包含"[USER]"与"[ASSISTANT]"标签 - `answer`字段存储最终目标输出文本 - `messages`字段以消息列表格式存储相同数据:首条消息来自用户,后续消息在用户与助手间交替,且文本中不包含"[USER]"与"[ASSISTANT]"标签 ## 许可协议 本数据集采用ODC-BY-1.0许可协议,仅可用于符合AI2[负责任使用指南](https://allenai.org/responsible-use)的研究与教育用途。 本数据集包含由Claude生成的数据,需遵守Anthropic的[服务条款](https://www.anthropic.com/legal/commercial-terms)与[使用政策](https://www.anthropic.com/legal/aup).
提供机构:
maas
创建时间:
2025-05-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作