five

PrimeIntellect/SYNTHETIC-1-Preference-Data

收藏
Hugging Face2025-02-21 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/PrimeIntellect/SYNTHETIC-1-Preference-Data
下载链接
链接失效反馈
官方服务:
资源简介:
SYNTHETIC-1是一个从Deepseek-R1获得的推理数据集,通过众包计算生成并由各种验证器(如LLM评委或符号数学验证器)注释。该数据集包括数学问题、算法编码问题、现实世界软件工程问题、开放式STEM问题回答和合成代码理解任务。每个任务都有相应的验证器来确保答案的正确性。

SYNTHETIC-1 is a reasoning dataset obtained from Deepseek-R1, generated with crowdsourced compute and annotated with diverse verifiers such as LLM judges or symbolic mathematics verifiers. The dataset includes Mathematics Problems, Algorithmic Coding Problems, Real-World Software Engineering Problems, Open-Ended STEM Question Answering, and Synthetic Code Understanding Tasks, each with corresponding verifiers to ensure the correctness of the answers.
提供机构:
PrimeIntellect
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作