PresageLabs/NewsBench
收藏Hugging Face2025-09-25 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/PresageLabs/NewsBench
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了多个线程相关的信息,如线程ID、URL、站点信息、发布时间、回复数量、参与人数、性能评分、域名排名、社交媒体互动数据等。此外,还包括文章的作者、发布时间、标题、正文内容、摘要信息、语言、情感倾向、分类、话题、是否允许AI处理、是否有标准链接、是否为突发新闻、是否有外部链接、实体信息、是否被转载、信任度分类、评分、爬取时间、更新时间、文件路径、数据集名称等。数据集分为训练集,包含大约100万个样本,数据集大小约为686MB。
The dataset contains various thread-related information such as thread ID, URL, site information, publish time, reply count, participant count, performance score, domain rank, social media interaction data, etc. It also includes article author, publish time, title, text content, summary information, language, sentiment, categories, topics, whether AI processing is allowed, whether there is a canonical link, whether its breaking news, whether there are external links, entity information, whether its syndicated, trust categories, rating, crawling time, update time, file path, dataset name, etc. The dataset is split into a training set with approximately 1 million samples, with a total size of about 686MB.
提供机构:
PresageLabs



