OneThinker-train-data

Name: OneThinker-train-data
Creator: maas
Published: 2025-12-22 23:16:43
License: 暂无描述

魔搭社区2025-12-22 更新2025-12-20 收录

下载链接：

https://modelscope.cn/datasets/OneThink/OneThinker-train-data

下载链接

链接失效反馈

官方服务：

资源简介：

# OneThinker-600k Training Data This repository contains the training data for **OneThinker**, an all-in-one reasoning model for image and video, as presented in the paper [OneThinker: All-in-one Reasoning Model for Image and Video](https://arxiv.org/abs/2512.03043). **Code**: [https://github.com/tulerfeng/OneThinker](https://github.com/tulerfeng/OneThinker) <div align="center"> <img src="https://github.com/tulerfeng/OneThinker/blob/main/assets/teaser.png?raw=true" alt="OneThinker teaser" width="95%"> </div> ## About the OneThinker Dataset **OneThinker-600k** is a large-scale multi-task training corpus designed to train `OneThinker`, an all-in-one multimodal reasoning model capable of understanding images and videos across diverse fundamental visual tasks. This corpus includes **OneThinker-SFT-340k**, which features high-quality Chain-of-Thought (CoT) annotations produced by a strong proprietary model (Seed1.5-VL) for effective Supervised Fine-Tuning (SFT) cold start. <div align="center"> <img src="https://github.com/tulerfeng/OneThinker/blob/main/assets/dataset.png?raw=true" alt="OneThinker dataset" width="95%"> </div> The dataset covers both image and video modalities and spans a series of fundamental visual reasoning tasks, including: * Rule-based Question Answering (QA) * Open-ended Question Answering (QA) * Captioning * Spatial Grounding * Temporal Grounding * Spatio-Temporal Grounding * Tracking * Segmentation ## Dataset Files The OneThinker training data consists of several JSON files tailored for different training stages: * `onethinker_rl_train.json`: Used for Reinforcement Learning (RL) training. * `onethinker_sft_image.json`: Used for Supervised Fine-Tuning (SFT) cold start on image data. * `onethinker_sft_video.json`: Used for Supervised Fine-Tuning (SFT) cold start on video data. Files ending with `_unsampled` represent the full, unsampled versions of these datasets. ## Citations If you find our work helpful for your research, please consider citing our work: ```bibtex @article{feng2025onethinker, title={OneThinker: All-in-one Reasoning Model for Image and Video}, author={Feng, Kaituo and Zhang, Manyuan and Li, Hongyu and Fan, Kaixuan and Chen, Shuang and Jiang, Yilei and Zheng, Dian and Sun, Peiwen and Zhang, Yiyuan and Sun, Haoze and others}, journal={arXiv preprint arXiv:2512.03043}, year={2025} } ```

# OneThinker-600k 训练数据集本仓库包含**OneThinker**的训练数据，该模型是一款面向图像与视频的全功能推理模型，相关研究成果发表于论文《OneThinker: All-in-one Reasoning Model for Image and Video》（https://arxiv.org/abs/2512.03043）。 **代码**：[https://github.com/tulerfeng/OneThinker](https://github.com/tulerfeng/OneThinker) <div align="center"> <img src="https://github.com/tulerfeng/OneThinker/blob/main/assets/teaser.png?raw=true" alt="OneThinker 概览图" width="95%"> </div> ## OneThinker 数据集简介 **OneThinker-600k**是一款大规模多任务训练语料库，用于训练`OneThinker`——一款可适配多样化基础视觉任务的图像与视频全模态推理模型。该语料库包含**OneThinker-SFT-340k**，其内置由自研高性能模型（Seed1.5-VL）生成的高质量思维链（Chain-of-Thought, CoT）标注数据，可用于高效的监督微调（Supervised Fine-Tuning, SFT）冷启动训练。 <div align="center"> <img src="https://github.com/tulerfeng/OneThinker/blob/main/assets/dataset.png?raw=true" alt="OneThinker 数据集结构" width="95%"> </div> 本数据集涵盖图像与视频两种模态，覆盖一系列基础视觉推理任务，包括： * 基于规则的问答（QA） * 开放式问答（QA） * 字幕生成 * 空间定位 * 时序定位 * 时空定位 * 目标跟踪 * 语义分割 ## 数据集文件说明 OneThinker训练数据包含多个针对不同训练阶段定制的JSON格式文件： * `onethinker_rl_train.json`：用于强化学习（Reinforcement Learning, RL）训练。 * `onethinker_sft_image.json`：用于图像数据的监督微调（Supervised Fine-Tuning, SFT）冷启动训练。 * `onethinker_sft_video.json`：用于视频数据的监督微调（Supervised Fine-Tuning, SFT）冷启动训练。文件名后缀为`_unsampled`的文件代表对应数据集的完整未采样版本。 ## 引用方式若您的研究工作中用到了本项目，请引用如下论文： bibtex @article{feng2025onethinker, title={OneThinker: All-in-one Reasoning Model for Image and Video}, author={Feng, Kaituo and Zhang, Manyuan and Li, Hongyu and Fan, Kaixuan and Chen, Shuang and Jiang, Yilei and Zheng, Dian and Sun, Peiwen and Zhang, Yiyuan and Sun, Haoze and others}, journal={arXiv preprint arXiv:2512.03043}, year={2025} }

提供机构：

maas

创建时间：

2025-12-11

5,000+

优质数据集

54 个

任务类型

进入经典数据集