five

reasoning-1-1k

收藏
魔搭社区2026-01-06 更新2025-01-04 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/reasoning-1-1k
下载链接
链接失效反馈
官方服务:
资源简介:
# Reasoning-1 1K ## Short about This dataset will help in SFT training of LLM on the Alpaca format. The goal of the dataset: to teach LLM to reason and analyze its mistakes using SFT training. The size of 1.15K is quite small, so for effective training on SFTTrainer set *4-6* epochs instead of *1-3*. *Made by Fluently Team ([@ehristoforu](https://huggingface.co/ehristoforu)) using [distilabel](https://github.com/argilla-io/distilabel) with love🥰* ## Dataset structure This subset can be loaded as: ```python from datasets import load_dataset ds = load_dataset("fluently-sets/reasoning-1-1k", "default") ``` Or simply as it follows, since there's only one configuration and is named `default`: ```python from datasets import load_dataset ds = load_dataset("fluently-sets/reasoning-1-1k") ``` <p align="left"> <a href="https://github.com/argilla-io/distilabel"> <img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="Built with Distilabel" width="200" height="32"/> </a> </p>

# Reasoning-1 1K ## 数据集简介 本数据集可用于阿尔帕卡(Alpaca)格式下的大语言模型(Large Language Model,LLM)监督微调(Supervised Fine-Tuning,SFT)训练。 本数据集的目标为:通过监督微调训练,使大语言模型学会推理并分析自身的错误。 本数据集规模为1.15K,体量较小,因此若使用SFT训练器(SFTTrainer)开展高效训练时,请将训练轮次设置为**4-6轮**,而非**1-3轮**。 本数据集由Fluently团队([@ehristoforu](https://huggingface.co/ehristoforu))使用distilabel工具(https://github.com/argilla-io/distilabel)精心制作🥰 ## 数据集结构 本子集可通过以下方式加载: python from datasets import load_dataset ds = load_dataset("fluently-sets/reasoning-1-1k", "default") 或直接按如下方式加载,因该数据集仅有一种名为`default`的配置: python from datasets import load_dataset ds = load_dataset("fluently-sets/reasoning-1-1k") <p align="left"> <a href="https://github.com/argilla-io/distilabel"> <img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="基于Distilabel构建" width="200" height="32"/> </a> </p>
提供机构:
maas
创建时间:
2024-12-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作