markendo/Visual-Extraction-Tuning-382K
收藏Hugging Face2025-11-25 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/markendo/Visual-Extraction-Tuning-382K
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
task_categories:
- visual-question-answering
- question-answering
- image-text-to-text
---
# Visual Extraction Tuning 382K
This repository contains the generated visual extraction tuning dataset from the paper [Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models](https://huggingface.co/papers/2511.17487).
Project page: https://web.stanford.edu/~markendo/projects/downscaling_intelligence
Code: https://github.com/markendo/downscaling_intelligence

## Overview
We provide the 382K examples generated using our visual extraction tuning data generation pipeline.
## Usage
In this repo, we provide json files for each data subset. The instructions for downloading corresponding images is at https://github.com/zackschen/CoIN.
提供机构:
markendo



