mkurman/simplescaling-s1K-R1
收藏Hugging Face2025-02-07 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/mkurman/simplescaling-s1K-R1
下载链接
链接失效反馈官方服务:
资源简介:
s1k R1数据集是simplescaling/s1K数据集的一个分支,包含了一系列对话,其中助手的消息被增强,包含了在`<think> ... </think>`标签内的链式思维(Chain of Thought, CoT)推理。这种修改旨在通过提供明确的思维过程来提高AI模型的解释性和推理能力。数据集主要用于训练和评估能够生成更具解释性和推理性的响应的AI模型。
The s1k R1 dataset is a fork of the simplescaling/s1K dataset, containing a collection of conversations where the assistants messages have been enhanced to include Chain of Thought (CoT) reasoning within `<think> ... </think>` tags. This modification aims to improve the interpretability and reasoning capabilities of AI models by providing explicit thought processes in the responses. The dataset is primarily used for training and evaluating AI models that can generate more interpretable and reasoned responses.
提供机构:
mkurman



