yuyq96/R1-Vision-PixMo-Cap-QA

Name: yuyq96/R1-Vision-PixMo-Cap-QA
Creator: yuyq96
Published: 2025-02-08 08:48:29
License: 暂无描述

Hugging Face2025-02-08 更新2025-02-15 收录

下载链接：

https://hf-mirror.com/datasets/yuyq96/R1-Vision-PixMo-Cap-QA

下载链接

链接失效反馈

官方服务：

资源简介：

R1-Vision项目使用的数据集包括文本数据、文本渲染数据和多模态数据。文本数据来自Bespoke-Stratos-17k数据集，文本渲染数据同样来自Bespoke-Stratos-17k，经过重格式化和图像渲染处理。多模态数据来自AI2D、ScienceQA和PixMo-Cap-QA数据集。这些数据集用于训练一个能够处理文本和图像的双模态模型。

The R1-Vision project uses datasets that include text data, text rendering data, and multimodal data. The text data is from the Bespoke-Stratos-17k dataset, the text rendering data is also from Bespoke-Stratos-17k after reformatting and image rendering, and the multimodal data is from the AI2D, ScienceQA, and PixMo-Cap-QA datasets. These datasets are used to train a bimodal model capable of processing both text and images.

提供机构：

yuyq96

5,000+

优质数据集

54 个

任务类型

进入经典数据集