five

shorecode/summary-collection-60k-rows

收藏
Hugging Face2024-12-10 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/shorecode/summary-collection-60k-rows
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是一个摘要数据的集合,包含训练集、验证集和测试集,分别有60066、8009和12014个样本。数据集的总下载大小为59205760字节,总数据集大小为106040396字节。数据来源于多个仓库的摘要数据,并通过随机抽样将原始数据从200k行减少到60k行。

This dataset is a compilation of summaries from multiple sources including ijwatson98/formatted-summary-data, gizemgg/wiki-eng-summary-trial-gen0-transformed-instruction, argilla/cnn-dailymail-summaries, and agentlans/wikipedia-paragraph-summaries. It was reduced to 60k rows through random sampling from the original shorecode/summary-colletion-200k-rows repository. The dataset contains two main features: text and target, both of string type. The dataset is split into train, validation, and test sets with 60066, 8009, and 12014 samples respectively.
提供机构:
shorecode
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作