five

MapIntel Case Study Dataset

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/7nn6h86snn
下载链接
链接失效反馈
官方服务:
资源简介:
Daily news articles from multiple international sources collected using NewsAPI (https://newsapi.org/) during the period between October 2020 and June 2021. The total number of records is 334,925 documents. The format of the dataset is in JSON. Cleaning is applied to the direct results from the API. We ensure that each document is unique, is written in English, and doesn’t have any HTML tags or any strange pattern. Each record is a dictionary with the following keys and their descriptions: - "text": Cleaned content of the news article (concatenation of "title", "description", and "content" received from the API request. "content" is truncated to 200 characters). - "title": The headline or title of the article. - "url": The direct URL to the article. - "timestamp": The date and time that the article was published, in UTC (+000). Formatted as "%Y-%m-%dT%H:%M:%SZ". - "snippet": Excerpt of the document displayed in the user interface of MapIntel. - "image_url": The URL to a relevant image for the article.
创建时间:
2023-07-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作