five

OrionLLM/OpenMixedReasoning

收藏
Hugging Face2026-03-16 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/OrionLLM/OpenMixedReasoning
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 tags: - biology - medical - code - agent size_categories: - 100K<n<1M --- ![OpenMixedReasoning](https://cdn-uploads.huggingface.co/production/uploads/685ea8ff7b4139b6845ce395/0Q1vkXJBUGLGPL-Od-Z3M.png) # OpenMixedReasoning **OpenMixedReasoning** is a large synthetic reasoning dataset for **general reasoning tasks**, containing **~607k examples**. It is designed for **supervised fine-tuning (SFT)** and focuses heavily on reasoning-rich data across multiple domains. ## Dataset Composition OpenMixedReasoning is composed of the following domains: - **93.5% Code** - **3.3% Medical** - **3.2% Math** This distribution makes the dataset especially useful for training models that need strong **code reasoning**, while still benefiting from additional **medical** and **mathematical** reasoning capabilities. ## Source Datasets OpenMixedReasoning is a merged dataset built from the following sources: - [OrionLLM/OpenMedicalReasoning](https://huggingface.co/datasets/OrionLLM/OpenMedicalReasoning) - [nvidia/OpenCodeReasoning](https://huggingface.co/datasets/nvidia/OpenCodeReasoning) - [unsloth/OpenMathReasoning-mini](https://huggingface.co/datasets/unsloth/OpenMathReasoning-mini) --- OpenMixedReasoning is intended as a practical mixed-domain reasoning dataset for researchers and builders working on compact and capable reasoning models.
提供机构:
OrionLLM
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作