amyguan/newswire-20-30-macro
收藏Hugging Face2024-12-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/amyguan/newswire-20-30-macro
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含新闻文章及其相关元数据,涵盖多个主题标签(如反垄断、民权、犯罪等)、命名实体识别(NER)信息、新闻发布地点信息、提到的人物信息等。数据集包含一个训练集,大小为32,222,140.72字节,包含6,465个示例。
This dataset includes various fields such as articles, bylines, dates, newspaper metadata, and multiple topic labels (e.g., antitrust, civil rights, crime, etc.). Additionally, it contains fields related to Named Entity Recognition (NER), geographical information, and information about people mentioned. The dataset is divided into a training set with 6465 samples.
提供机构:
amyguan



