Titanium2-DeepSeek-R1
收藏魔搭社区2025-07-11 更新2025-07-12 收录
下载链接:
https://modelscope.cn/datasets/sequelbox/Titanium2-DeepSeek-R1
下载链接
链接失效反馈官方服务:
资源简介:
**[Click here to support our open-source dataset and model releases!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)**
**Titanium2-DeepSeek-R1** is a dataset focused on architecture and DevOps, testing the limits of [DeepSeek R1's](https://huggingface.co/deepseek-ai/DeepSeek-R1) architect and coding skills!
This dataset contains:
- 32.4k synthetically generated prompts focused on architecture, cloud, and DevOps. All responses are generated using [DeepSeek R1.](https://huggingface.co/deepseek-ai/DeepSeek-R1) Primary areas of expertise are architecture (problem solving, scenario analysis, coding, full SDLC) and DevOps (Azure, AWS, GCP, Terraform, shell scripts)
- Synthetic prompts are generated using [Llama 3.1 405b Instruct.](https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct)
- Responses demonstrate the reasoning capabilities of DeepSeek's 685b parameter R1 reasoning model.
Thinking tags have been removed for response consistency; otherwise, **responses have not been filtered or edited at all:** the Titanium dataset strives to accurately represent the R1 model. Potential issues may include inaccurate answers and infinite thought loops. Titanium is presented as-is to be used at your discretion.
Users should consider applying their own sub-filtering and manual examination of the dataset before use in training.
Do as you will.
**[点击此处支持我们的开源数据集与模型发布!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)**
**Titanium2-DeepSeek-R1** 是一款专注于架构与开发运维(DevOps)领域的数据集,旨在挖掘[DeepSeek R1](https://huggingface.co/deepseek-ai/DeepSeek-R1)的架构设计与编码能力边界!
本数据集包含以下内容:
- 3.24万条聚焦于架构、云计算与DevOps的合成提示词。所有回复均由[DeepSeek R1](https://huggingface.co/deepseek-ai/DeepSeek-R1)生成,其核心专长领域涵盖架构设计(问题求解、场景分析、编码、完整软件开发生命周期(Software Development Life Cycle,SDLC))与DevOps(Azure、AWS、GCP、Terraform、Shell脚本)
- 合成提示词由[Llama 3.1 405B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct)生成
- 回复内容旨在展示DeepSeek 6850亿参数R1推理模型的推理能力
为保证回复一致性,已移除思考标记;除此之外,**所有回复均未经过任何过滤或编辑**:本Titanium数据集旨在真实还原R1模型的输出表现。潜在问题可能包含不准确的回复与无限思考循环。本数据集将以原始状态提供,使用者可自行酌情使用。
使用者在将本数据集用于训练前,应考虑自行进行二次筛选与人工审核。
请按需使用。
提供机构:
maas
创建时间:
2025-07-10



