five

ReasonFlux-F1-SFT

收藏
魔搭社区2025-07-24 更新2025-05-24 收录
下载链接:
https://modelscope.cn/datasets/Gen-Verse/ReasonFlux-F1-SFT
下载链接
链接失效反馈
官方服务:
资源简介:
# Dataset Card for ReasonFlux-F1-SFT > ReasonFlux-F1-SFT consists of 1k high-quality problems from [s1k](https://huggingface.co/datasets/simplescaling/s1K). We use ReasonFlux-Zero to generate template augmented reasoning trajectories for each problem and transform them into Long-CoT format to finetune reasoning LLMs. * Github Repository: [Gen-Verse/ReasonFlux](https://github.com/Gen-Verse/ReasonFlux) * Paper:[ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates](https://arxiv.org/abs/2502.06772) * Model: [Gen-Verse/ReasonFlux-F1](https://huggingface.co/Gen-Verse/ReasonFlux-F1) ## Citation ```bash @article{yang2025reasonflux, title={ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates}, author={Yang, Ling and Yu, Zhaochen and Cui, Bin and Wang, Mengdi}, journal={arXiv preprint arXiv:2502.06772}, year={2025} } ```

# ReasonFlux-F1-SFT 数据集卡片 > ReasonFlux-F1-SFT 包含来自[s1k](https://huggingface.co/datasets/simplescaling/s1K)的1000道高质量问题。我们使用ReasonFlux-Zero为每道问题生成经过模板增强的推理轨迹,并将其转换为长思维链(Long Chain of Thought,Long-CoT)格式,以用于推理型大语言模型(Large Language Model,LLM)的微调。 * GitHub 仓库:[Gen-Verse/ReasonFlux](https://github.com/Gen-Verse/ReasonFlux) * 论文:[ReasonFlux:基于扩展思维模板的分层大语言模型推理](https://arxiv.org/abs/2502.06772) * 模型:[Gen-Verse/ReasonFlux-F1](https://huggingface.co/Gen-Verse/ReasonFlux-F1) ## 引用 bash @article{yang2025reasonflux, title={ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates}, author={Yang, Ling and Yu, Zhaochen and Cui, Bin and Wang, Mengdi}, journal={arXiv preprint arXiv:2502.06772}, year={2025} }
提供机构:
maas
创建时间:
2025-05-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作