amyguan/newswire-10-20-macro
收藏Hugging Face2024-12-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/amyguan/newswire-10-20-macro
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如文章内容、作者、日期、报纸元数据、主题分类、命名实体识别(NER)信息、地理位置信息、提及的人物信息等。数据集的结构包括多个字段,如article(文章内容)、byline(作者)、dates(日期)、newspaper_metadata(报纸元数据)、antitrust(反垄断)、civil_rights(民权)、crime(犯罪)、govt_regulation(政府监管)、labor_movement(劳工运动)、politics(政治)、protests(抗议)、ca_topic(主题分类)、ner_words(NER词汇)、ner_labels(NER标签)、wire_city(城市)、wire_state(州)、wire_country(国家)、wire_coordinates(坐标)、wire_location_notes(位置备注)、people_mentioned(提及的人物)、cluster_size(集群大小)、year(年份)等。数据集划分为训练集,包含1632个样本,下载大小为2277712字节,数据集大小为8134034.595657959字节。
This dataset includes multiple fields such as articles, bylines, dates, newspaper metadata, multiple topic labels, named entity recognition related fields, and geographic information related to news sources. Additionally, it includes information about mentioned people and cluster sizes. The dataset is divided into a training set with 1632 samples.
提供机构:
amyguan



