smoltldr
收藏魔搭社区2025-12-05 更新2025-03-22 收录
下载链接:
https://modelscope.cn/datasets/mlabonne/smoltldr
下载链接
链接失效反馈官方服务:
资源简介:
This dataset was designed for the fine-tune of [HuggingFaceTB/SmolLM2-135M-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM2-135M-Instruct) using GRPO.
It is designed to summarize Reddit posts.
You can reproduce this training using this [colab notebook](https://colab.research.google.com/drive/13mRqgRIvMGGgkQfJL4CS0lzcL4Vl9xUN?usp=sharing). It takes about 40 minutes to train the model.
本数据集专为使用GRPO对[HuggingFaceTB/SmolLM2-135M-Instruct](https://huggingface.co/HuggingFaceTB/SmolLM2-135M-Instruct)进行微调而设计,旨在完成Reddit帖子的摘要生成任务。
用户可通过该[Colab笔记本](https://colab.research.google.com/drive/13mRqgRIvMGGgkQfJL4CS0lzcL4Vl9xUN?usp=sharing)复现整套训练流程,模型训练耗时约40分钟。
提供机构:
maas
创建时间:
2025-03-18



