five

Titanium2-DeepSeek-R1

收藏
魔搭社区2025-07-11 更新2025-07-12 收录
下载链接:
https://modelscope.cn/datasets/sequelbox/Titanium2-DeepSeek-R1
下载链接
链接失效反馈
官方服务:
资源简介:
**[Click here to support our open-source dataset and model releases!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)** **Titanium2-DeepSeek-R1** is a dataset focused on architecture and DevOps, testing the limits of [DeepSeek R1's](https://huggingface.co/deepseek-ai/DeepSeek-R1) architect and coding skills! This dataset contains: - 32.4k synthetically generated prompts focused on architecture, cloud, and DevOps. All responses are generated using [DeepSeek R1.](https://huggingface.co/deepseek-ai/DeepSeek-R1) Primary areas of expertise are architecture (problem solving, scenario analysis, coding, full SDLC) and DevOps (Azure, AWS, GCP, Terraform, shell scripts) - Synthetic prompts are generated using [Llama 3.1 405b Instruct.](https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct) - Responses demonstrate the reasoning capabilities of DeepSeek's 685b parameter R1 reasoning model. Thinking tags have been removed for response consistency; otherwise, **responses have not been filtered or edited at all:** the Titanium dataset strives to accurately represent the R1 model. Potential issues may include inaccurate answers and infinite thought loops. Titanium is presented as-is to be used at your discretion. Users should consider applying their own sub-filtering and manual examination of the dataset before use in training. Do as you will.

**[点击此处支持我们的开源数据集与模型发布!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)** **Titanium2-DeepSeek-R1** 是一款专注于架构与开发运维(DevOps)领域的数据集,旨在挖掘[DeepSeek R1](https://huggingface.co/deepseek-ai/DeepSeek-R1)的架构设计与编码能力边界! 本数据集包含以下内容: - 3.24万条聚焦于架构、云计算与DevOps的合成提示词。所有回复均由[DeepSeek R1](https://huggingface.co/deepseek-ai/DeepSeek-R1)生成,其核心专长领域涵盖架构设计(问题求解、场景分析、编码、完整软件开发生命周期(Software Development Life Cycle,SDLC))与DevOps(Azure、AWS、GCP、Terraform、Shell脚本) - 合成提示词由[Llama 3.1 405B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct)生成 - 回复内容旨在展示DeepSeek 6850亿参数R1推理模型的推理能力 为保证回复一致性,已移除思考标记;除此之外,**所有回复均未经过任何过滤或编辑**:本Titanium数据集旨在真实还原R1模型的输出表现。潜在问题可能包含不准确的回复与无限思考循环。本数据集将以原始状态提供,使用者可自行酌情使用。 使用者在将本数据集用于训练前,应考虑自行进行二次筛选与人工审核。 请按需使用。
提供机构:
maas
创建时间:
2025-07-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作