Celestia3-DeepSeek-R1-0528
收藏魔搭社区2025-12-05 更新2025-07-12 收录
下载链接:
https://modelscope.cn/datasets/sequelbox/Celestia3-DeepSeek-R1-0528
下载链接
链接失效反馈官方服务:
资源简介:
**[Click here to support our open-source dataset and model releases!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)**
**Celestia3-DeepSeek-R1-0528** is a dataset focused on science, testing the limits of [DeepSeek R1 0528's](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528) science-reasoning skills!
This dataset contains:
- 90.9k synthetically generated science prompts, with all responses generated using [DeepSeek R1 0528.](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528)
- Primary subjects are physics, chemistry, biology, and computer science; secondary subjects include Earth science, astronomy, and information theory.
- All prompts are synthetic, taken from the original **[sequelbox/Celestia](https://huggingface.co/datasets/sequelbox/Celestia)** Llama 3.1 dataset.
- Responses demonstrate the scientific reasoning capabilities of DeepSeek's newest R1-0528 reasoning model.
**Responses have not been filtered or edited at all:** the Celestia 3 dataset strives to accurately represent the R1-0528 model. Potential issues may include inaccurate answers and infinite thought loops. Celestia 3 is presented as-is to be used at your discretion.
Users should consider applying their own sub-filtering and manual examination of the dataset before use in training.
Do as you will. For the sun.
**[点击此处支持我们的开源数据集与模型发布!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)**
**Celestia3-DeepSeek-R1-0528** 是一款聚焦科学领域的数据集,旨在测试[DeepSeek R1 0528](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528)的科学推理能力边界!
该数据集包含以下内容:
- 90.9千条合成生成的科学提示词,所有回复均由[DeepSeek R1 0528](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528)生成。
- 核心学科涵盖物理学、化学、生物学与计算机科学;次要学科包括地球科学、天文学与信息论。
- 所有提示词均为合成生成,源自原始**[sequelbox/Celestia](https://huggingface.co/datasets/sequelbox/Celestia)** Llama 3.1数据集。
- 生成的回复可体现DeepSeek最新推出的R1-0528推理模型的科学推理能力。
**所有回复均未经过任何过滤或编辑**:Celestia 3数据集旨在精准还原R1-0528模型的实际表现。该数据集可能存在答案不准确、思维循环等潜在问题。Celestia 3数据集将以原样提供,供使用者自行斟酌使用。
使用者在将该数据集用于模型训练前,应考虑自行进行次级筛选与人工审核。
随心而为。为了太阳。
提供机构:
maas
创建时间:
2025-07-10



