Tachibana2-DeepSeek-R1-PREVIEW

Name: Tachibana2-DeepSeek-R1-PREVIEW
Creator: maas
Published: 2025-12-05 16:41:29
License: 暂无描述

魔搭社区2025-12-05 更新2025-07-12 收录

下载链接：

https://modelscope.cn/datasets/sequelbox/Tachibana2-DeepSeek-R1-PREVIEW

下载链接

链接失效反馈

官方服务：

资源简介：

**This is a preview of the full Tachibana 2 high-difficulty code-reasoning dataset**, containing the first ~6k rows. All responses generated by [deepseek-ai/DeepSeek-R1.](https://huggingface.co/deepseek-ai/DeepSeek-R1) The full dataset will be released for everyone once it's ready! This dataset contains: - 6k high-difficulty synthetic code-reasoning prompts created by [Llama 3.1 405b Instruct](meta-llama/Llama-3.1-405B-Instruct), with an emphasis on task complexity and technical skill. - Responses demonstrate the reasoning capabilities of DeepSeek's 685b parameter R1 reasoning model. **Responses have not been filtered or edited at all:** the Tachibana 2 dataset strives to accurately represent the R1 model. Potential issues may include inaccurate answers and infinite thought loops. Tachibana 2 is presented as-is to be used at your discretion. Users should consider applying their own sub-filtering and manual examination of the dataset before use in training. Do as you will.

**本预览为完整Tachibana 2高难度代码推理数据集的预览版，包含前约6000条数据条目。所有回复均由[deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1)生成。完整数据集准备就绪后将向全体用户公开发布。本数据集包含： - 约6000条由[Llama 3.1 405B Instruct](meta-llama/Llama-3.1-405B-Instruct)生成的高难度合成代码推理提示词，该数据集侧重任务复杂度与技术熟练度； - 其生成的回复可体现DeepSeek研发的6850亿参数R1推理模型的推理能力。 **所有回复均未经过任何过滤或编辑：Tachibana 2数据集旨在精准还原R1模型的原生输出表现。该数据集可能存在回复不准确、思维循环无限等问题。本数据集将按原始状态提供，使用者可自行决定使用方式。** 使用者在将该数据集用于模型训练前，应考虑自行进行次级筛选与人工核查。请按需使用。

提供机构：

maas

创建时间：

2025-07-10

5,000+

优质数据集

54 个

任务类型

进入经典数据集