O4B
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/amanpreet692/open4business
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了17,458篇开放获取的商业文章及其参考文献摘要,旨在解决目前缺乏大规模商业文档摘要数据集的问题。与现有数据集相比,该数据集要求摘要具有高度概括性和简洁性,包含英文文章及其摘要,并使用了开源工具进行数据收集。规模上,数据集包含了17,458篇文章,任务重点在于自动摘要。
This dataset comprises 17,458 open-access business articles and their reference abstracts, designed to address the current shortage of large-scale business document summarization datasets. Compared to existing datasets, this one requires the abstracts to be highly concise and effectively summarizing, includes English articles and their respective abstracts, and utilizes open-source tools for data collection. Boasting a total of 17,458 articles, the core task of this dataset focuses on automatic text summarization.



