amyguan/newswire-40-50
收藏Hugging Face2024-12-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/amyguan/newswire-40-50
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,主要涉及新闻报道的相关信息,如文章内容、作者、日期、报纸元数据(包括图书馆控制号、报纸所在城市、州和标题)、以及多个主题标签(如反垄断、民权、犯罪、政府监管、劳工运动、政治、抗议等)。此外,数据集还包含命名实体识别(NER)的词汇和标签、新闻发布地点的城市、州、国家和坐标信息、提及的人物信息(包括性别、姓名、职业和维基数据ID)、聚类大小和年份。数据集主要用于分析新闻报道的内容、主题、地理位置和涉及的人物等信息。
The dataset includes multiple fields primarily related to news articles, such as article content, byline, dates, newspaper metadata (including library control number, newspaper city, state, and title), and various topic labels (such as antitrust, civil rights, crime, government regulation, labor movement, politics, protests, etc.). Additionally, the dataset contains named entity recognition (NER) words and labels, the city, state, country, and coordinates of the wire location, information about mentioned people (including gender, name, occupation, and Wikidata ID), cluster size, and year. The dataset is mainly used for analyzing the content, themes, geographical locations, and people involved in news reports.
提供机构:
amyguan



