Compumacy/opthinking-1mil
收藏Hugging Face2025-05-11 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/Compumacy/opthinking-1mil
下载链接
链接失效反馈官方服务:
资源简介:
OpenThoughts2-1M是一个包含100万高质量示例的合成推理数据集,覆盖数学、科学、代码和谜题等领域。该数据集基于之前的OpenThoughts-114k数据集,通过整合如OpenR1等现有数据集以及额外的数学和代码推理数据进行了扩充。该数据集用于训练OpenThinker2-7B和OpenThinker2-32B模型。
OpenThoughts2-1M is an open synthetic reasoning dataset with 1M high-quality examples covering math, science, code, and puzzles. This dataset builds upon our previous OpenThoughts-114k dataset, augmenting it with existing datasets like OpenR1, as well as additional math and code reasoning data. It was used to train OpenThinker2-7B and OpenThinker2-32B models.
提供机构:
Compumacy



