five

Celestia3-DeepSeek-R1-0528-PREVIEW

收藏
魔搭社区2025-11-27 更新2025-07-12 收录
下载链接:
https://modelscope.cn/datasets/sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW
下载链接
链接失效反馈
官方服务:
资源简介:
**[Click here to support our open-source dataset and model releases!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)** **This is an early sneak preview of Celestia3-DeepSeek-R1-0528, containing the first 13.4k rows!** **Celestia3-DeepSeek-R1-0528** is a dataset focused on science, testing the limits of [DeepSeek R1's](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528) science-reasoning skills! This early preview release contains: - 13.4k synthetically generated science prompts. All responses are generated using [DeepSeek R1 0528.](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528) - Primary subjects are physics, chemistry, biology, and computer science; secondary subjects include Earth science, astronomy, and information theory. - All prompts are synthetic, taken from the original **[sequelbox/Celestia](https://huggingface.co/datasets/sequelbox/Celestia)** Llama 3.1 dataset. - Responses demonstrate the scientific reasoning capabilities of DeepSeek's newest R1-0528 reasoning model. **Responses have not been filtered or edited at all:** the Celestia 3 dataset strives to accurately represent the R1-0528 model. Potential issues may include inaccurate answers and infinite thought loops. Celestia 3 is presented as-is to be used at your discretion. Users should consider applying their own sub-filtering and manual examination of the dataset before use in training. Do as you will. For the sun.

**[点击此处支持我们的开源数据集与模型发布!](https://huggingface.co/spaces/sequelbox/SupportOpenSource)** **本文件为Celestia3-DeepSeek-R1-0528的早期预览版,包含前13.4k条数据行!** **Celestia3-DeepSeek-R1-0528是一款聚焦科学领域的数据集,旨在测试[DeepSeek R1-0528](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528)的科学推理能力上限!** 本早期预览版包含以下内容: - 13.4k条人工合成的科学提示词(prompt),所有回复均由[DeepSeek R1-0528](https://huggingface.co/deepseek-ai/DeepSeek-R1-0528)生成。 - 核心学科涵盖物理学、化学、生物学与计算机科学;次要学科包括地球科学、天文学与信息论。 - 所有提示词均为人工合成,源自原始**[sequelbox/Celestia](https://huggingface.co/datasets/sequelbox/Celestia)** Llama 3.1数据集。 - 回复内容可体现DeepSeek最新推出的R1-0528推理模型的科学推理能力。 **所有回复均未经过任何过滤或编辑:** Celestia 3数据集旨在准确呈现R1-0528模型的真实表现。该数据集可能存在回复不准确、思维循环无限等问题。Celestia 3数据集将按原样提供,使用者可自行决定使用方式。 使用者在将该数据集用于模型训练前,应考虑自行进行二次筛选与人工审核。 随心所欲即可。为了太阳。
提供机构:
maas
创建时间:
2025-07-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作