Deepseek R1 671B

RapidAPI2025-02-21 更新2025-03-01 收录

下载链接：

https://rapidapi.com/Glavier/api/deepseek-r1-671b1

下载链接

链接失效反馈

官方服务：

资源简介：

We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning. With RL, DeepSeek-R1-Zero naturally emerged with numerous powerful and interesting reasoning behaviors. However, DeepSeek-R1-Zero encounters challenges such as endless repetition, poor readability, and language mixing. To a...

创建时间：

2025-02-21

5,000+

优质数据集

54 个

任务类型

进入经典数据集