five

saiphyohein/cnn-dailymails-business

收藏
Hugging Face2026-04-22 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/saiphyohein/cnn-dailymails-business
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为CNN/DailyMail + BBC – Business Classification,包含来自CNN/DailyMail摘要和BBC新闻商业摘要的文本。这些文本通过`cross-encoder/nli-deberta-v3-small`模型进行分类,标记为yes(商业相关)或no(非商业相关)。商业主题涵盖商业、金融、分析、销售、物流和营销。训练集总共有282,694行数据,其中31,890行标记为商业相关,250,804行标记为非商业相关。

The dataset is named CNN/DailyMail + BBC – Business Classification and contains texts from CNN/DailyMail highlights and BBC News business summaries. These texts are classified by `cross-encoder/nli-deberta-v3-small` as `yes` (business-related) or `no` (not business-related). Business topics include business, finance, analytics, sales, logistics, and marketing. The training set consists of 282,694 rows, with 31,890 marked as business-related and 250,804 as not business-related.
提供机构:
saiphyohein
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作