five

TIGER-Lab/VisualWebInstruct

收藏
Hugging Face2026-02-01 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/TIGER-Lab/VisualWebInstruct
下载链接
链接失效反馈
官方服务:
资源简介:
VisualWebInstruct是一个大规模多样化的多模态指令数据集,包含约90万个问题-答案对,旨在提高视觉语言模型的推理能力。数据集覆盖了数学、物理、金融、化学、工程等多个学科,经过精心策划和筛选,强调多步骤推理任务而非简单的基于感知的问题。

VisualWebInstruct is a large-scale, diverse multimodal instruction dataset containing approximately 900K question-answer pairs, designed to enhance the reasoning capabilities of vision-language models. The dataset covers multiple disciplines including Mathematics, Physics, Finance, Chemistry, Engineering, and others, and is carefully curated and emphasizes multi-step reasoning tasks rather than simple perception-based questions.
提供机构:
TIGER-Lab
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作