wangx0t/8192-numina-deepseek-DeepSeek-R1-Distill-Llama-8B
收藏Hugging Face2025-02-05 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/wangx0t/8192-numina-deepseek-DeepSeek-R1-Distill-Llama-8B
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集包含了一些特征,如problem(问题),solution(解决方案),messages(消息),generation(生成)和model_name(模型名称)。数据集有一个训练分割,并且包含了一个pipeline.yaml文件,用于使用distilabel CLI重现生成过程。problem特征包含一个数学问题,而solution特征则提供了逐步解释和最终答案。messages特征包括用户的问题和助手的响应。generation特征似乎包含有关文本生成过程的元数据,包括输入和输出令牌的统计数据。数据集被标记为synthetic(合成的),distilabel和rlaif,表明其来源和特点。模型名称暗示这个数据集与DeepSeek-R1-Distill-Llama-8B模型相关。数据集的大小也非常大,超过1GB。
This dataset includes features such as problem, solution, messages, generation, and model_name. It is structured with a train split and contains a pipeline.yaml file for reproducing the generation process using the distilabel CLI. The problem feature consists of a mathematical question, while the solution feature provides a step-by-step explanation and the final answer. The messages feature includes a user question and an assistants response. The generation feature seems to contain metadata about the text generation process, including input and output token statistics. The dataset is tagged with synthetic, distilabel, and rlaif, indicating its origin and characteristics. The model_name suggests that the dataset is associated with the DeepSeek-R1-Distill-Llama-8B model. The dataset size is quite large, over 1GB.
提供机构:
wangx0t



