O1-OPEN/OpenO1-SFT
收藏Hugging Face2025-04-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/O1-OPEN/OpenO1-SFT
下载链接
链接失效反馈官方服务:
资源简介:
该数据集用于通过SFT(监督微调)激活语言模型的链式思维(Chain-of-Thought),旨在增强模型生成连贯和逻辑推理序列的能力。数据集包含中英文数据,总记录数为77,685条,输出格式使用<Thought>和<Output>分隔符来区分思维过程和最终答案。
This dataset is used for fine-tuning a language model using SFT for Chain-of-Thought Activation. The dataset is designed to enhance the models ability to generate coherent and logical reasoning sequences. It contains 77,685 records in both Chinese and English, and the response field uses <Thought> </Thought> and <Output> </Output> delimiters to separate the thinking process and the final answer. By using this dataset, the model can learn to produce detailed and structured reasoning steps, enhancing its performance on complex reasoning tasks.
提供机构:
O1-OPEN



