simple-summaries
收藏魔搭社区2025-12-05 更新2025-08-23 收录
下载链接:
https://modelscope.cn/datasets/ProCreations/simple-summaries
下载链接
链接失效反馈官方服务:
资源简介:
# Simple Summaries
About 10,000 high quality data, with an original text sample from [A subset of Ultra Fineweb](https://huggingface.co/datasets/sumuks/Ultra-FineWeb-10M) and a summary generated by [Llama 3.2 3b instruct](meta-llama/Llama-3.2-3B-Instruct).
## Time & Cost
This took about 3 hours using a batch size of 8 on a rented GPU instance (NVIDIA L4) with a total cost of about 2 dollars.
## Intended use case
- Training summarization AI models
- Training language models
# 简易摘要数据集
该数据集包含约1万条高质量数据,其原始文本样本源自[Ultra Fineweb子集](https://huggingface.co/datasets/sumuks/Ultra-FineWeb-10M),摘要则由[Llama 3.2 3B Instruct](meta-llama/Llama-3.2-3B-Instruct)生成。
## 时间与成本
本次数据集构建使用租赁的NVIDIA L4型GPU实例,批次大小设为8,总耗时约3小时,总成本约2美元。
## 预期应用场景
- 训练摘要生成AI模型
- 训练大语言模型(Large Language Model,LLM)
提供机构:
maas
创建时间:
2025-08-20



