five

lynlyn555/multi_news

收藏
Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/lynlyn555/multi_news
下载链接
链接失效反馈
官方服务:
资源简介:
Multi-News是一个用于新闻文章摘要任务的数据集,包含来自newser.com的新闻文章及由专业编辑撰写的人工摘要。每个摘要都附有原始文章的链接。数据集包含两个主要字段:document(用特殊标记|||||分隔的新闻文章文本)和summary(新闻摘要)。数据集为英文单语数据集,规模在10K到100K之间,包含训练集(44972个样本)、验证集(5622个样本)和测试集(5622个样本)。

Multi-News consists of news articles and human-written summaries of these articles from the site newser.com. Each summary is professionally written by editors and includes links to the original articles cited. There are two features: document (text of news articles separated by special token |||||) and summary (news summary). The dataset is monolingual (English) with a size between 10K and 100K, containing train (44,972 examples), validation (5,622 examples), and test (5,622 examples) splits.
提供机构:
lynlyn555
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作