five

OpenMMReasoner/OpenMMReasoner-SFT-874K

收藏
Hugging Face2025-12-09 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/OpenMMReasoner/OpenMMReasoner-SFT-874K
下载链接
链接失效反馈
官方服务:
资源简介:
--- task_categories: - image-text-to-text configs: - config_name: llava_cot data_files: - split: train path: parquet/llava_cot.parquet - config_name: OpenVLThinker-sft-iter3 data_files: - split: train path: parquet/OpenVLThinker-sft-iter3.parquet - config_name: WeMath data_files: - split: train path: parquet/WeMath.parquet - config_name: m1_sft data_files: - split: train path: parquet/m1_sft.parquet - config_name: mmr1 data_files: - split: train path: parquet/mmr1_filtered.parquet --- # OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe <div align="center"> [![Data](https://img.shields.io/badge/Data-0040A1?style=for-the-badge&logo=huggingface&logoColor=ffffff&labelColor)](https://huggingface.co/collections/lmms-lab/openmmreasoner) [![Paper](https://img.shields.io/badge/Paper-000000?style=for-the-badge&logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.16334) [![Project Page](https://img.shields.io/badge/Website-000000?style=for-the-badge&logo=google-chrome&logoColor=white)](https://evolvinglmms-lab.github.io/OpenMMReasoner/) [![Github](https://img.shields.io/badge/Code-000000?style=for-the-badge&logo=github&logoColor=white)](https://github.com/EvolvingLMMs-Lab/OpenMMReasoner) </div> ## Introduction OpenMMReasoner is a dataset introduced in the paper [OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe](https://arxiv.org/abs/2511.16334). It supports the development of multimodal reasoning capabilities, utilizing a fully transparent two-stage recipe spanning supervised fine-tuning (SFT) and reinforcement learning (RL). The dataset includes an 874K-sample cold-start dataset for foundational reasoning and a 74K-sample dataset across diverse domains to further sharpen these abilities. ## Data Preparation and Usage This repository contains the ColdStart Data used to train **[OpenMMReasoner-ColdStart](https://huggingface.co/OpenMMReasoner/OpenMMReasoner-ColdStart)**. We use **[lmms-engine](https://github.com/EvolvingLMMs-Lab/lmms-engine)** as the training framework. To get started, you can download all required datasets from Hugging Face using the provided script: ```bash bash examples/openmmreasoner/download_data.sh [LOCAL_DIR] ``` Replace `[LOCAL_DIR]` with your desired local directory (defaults to `./data`). After downloading, if the dataset includes image archives, unzip them to extract all images into the correct directory: ```bash unzip "image_zip/*.zip" -d <output_folder> ``` Next, prepare a data YAML file to point the training framework to your dataset: ```yaml datasets: - path: /path/to/your/dataset/llava_cot.parquet data_folder: "/path/to/your/dataset/images" data_type: parquet ``` This configuration ensures that `lmms-engine` can correctly locate and load the images. For more detailed instructions and concrete examples on SFT training, RL training, and evaluation, please refer to our [GitHub repository](https://github.com/EvolvingLMMs-Lab/OpenMMReasoner). ## Citation If you find OpenMMReasoner useful for your research and applications, please cite using this BibTeX: ```bibtex @misc{zhang2025openmmreasonerpushingfrontiersmultimodal, title={OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe}, author={Kaichen Zhang and Keming Wu and Zuhao Yang and Bo Li and Kairui Hu and Bin Wang and Ziwei Liu and Xingxuan Li and Lidong Bing}, year={2025}, eprint={2511.16334}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2511.16334}, } ```

task_categories: - 图像-文本转文本 configs: - config_name: llava_cot data_files: - split: train path: parquet/llava_cot.parquet - config_name: OpenVLThinker-sft-iter3 data_files: - split: train path: parquet/OpenVLThinker-sft-iter3.parquet - config_name: WeMath data_files: - split: train path: parquet/WeMath.parquet - config_name: m1_sft data_files: - split: train path: parquet/m1_sft.parquet - config_name: mmr1 data_files: - split: train path: parquet/mmr1_filtered.parquet --- # OpenMMReasoner:依托开源通用范式,拓展多模态推理前沿 <div align="center"> [![Data](https://img.shields.io/badge/Data-0040A1?style=for-the-badge&logo=huggingface&logoColor=ffffff&labelColor)](https://huggingface.co/collections/lmms-lab/openmmreasoner) [![Paper](https://img.shields.io/badge/Paper-000000?style=for-the-badge&logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.16334) [![Project Page](https://img.shields.io/badge/Website-000000?style=for-the-badge&logo=google-chrome&logoColor=white)](https://evolvinglmms-lab.github.io/OpenMMReasoner/) [![Github](https://img.shields.io/badge/Code-000000?style=for-the-badge&logo=github&logoColor=white)](https://github.com/EvolvingLMMs-Lab/OpenMMReasoner) </div> ## 引言 OpenMMReasoner是论文《OpenMMReasoner:依托开源通用范式,拓展多模态推理前沿》(https://arxiv.org/abs/2511.16334)中提出的数据集。该数据集旨在赋能多模态推理能力的研发,采用全透明的两阶段范式,涵盖监督微调(Supervised Fine-Tuning, SFT)与强化学习(Reinforcement Learning, RL)。数据集包含87.4万样本的冷启动基础推理数据集,以及7.4万样本的多领域数据集,用于进一步精进模型的推理能力。 ## 数据准备与使用 本仓库包含用于训练**OpenMMReasoner-ColdStart**的冷启动数据集。我们采用**lmms-engine**作为训练框架。 快速上手可通过提供的脚本从Hugging Face下载所有所需数据集: bash bash examples/openmmreasoner/download_data.sh [LOCAL_DIR] 将`[LOCAL_DIR]`替换为你期望的本地目录(默认路径为`./data`)。 下载完成后,若数据集包含图像压缩包,请将其解压至指定目录以提取所有图像: bash unzip "image_zip/*.zip" -d <output_folder> 随后需准备数据YAML配置文件,以指引训练框架定位数据集: yaml datasets: - path: /path/to/your/dataset/llava_cot.parquet data_folder: "/path/to/your/dataset/images" data_type: parquet 该配置可确保`lmms-engine`能够正确定位并加载图像数据。 如需了解监督微调、强化学习训练与评估的详细指南及实操示例,请参阅我们的[GitHub仓库](https://github.com/EvolvingLMMs-Lab/OpenMMReasoner)。 ## 引用 若您的研究与应用中使用了OpenMMReasoner,请引用如下BibTeX条目: bibtex @misc{zhang2025openmmreasonerpushingfrontiersmultimodal, title={OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe}, author={Kaichen Zhang and Keming Wu and Zuhao Yang and Bo Li and Kairui Hu and Bin Wang and Ziwei Liu and Xingxuan Li and Lidong Bing}, year={2025}, eprint={2511.16334}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/abs/2511.16334}, }
提供机构:
OpenMMReasoner
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作