GDELT Dataset
收藏paperswithcode.com2025-03-27 收录
下载链接:
https://paperswithcode.com/dataset/gdelt
下载链接
链接失效反馈官方服务:
资源简介:
The GDELT Project is a remarkable initiative that monitors our world by analyzing global news from various sources. Here are the key aspects of the GDELT dataset:
Scope and Purpose:
The GDELT Project aims to create a comprehensive, real-time database of global human society.
It monitors news from broadcasts, print media, and web sources in nearly every country and over 100 languages.
By analyzing this vast dataset, it identifies people, locations, organizations, themes, emotions, and events that shape our global society every second of every day.
Data Collection:
GDELT continuously captures and analyzes news articles, broadcasts, and online sources.
Its historical archives date back to January 1, 1979, and it updates every 15 minutes.
The project goes beyond Western media, providing a more global perspective on world events and sentiments.
Features:
GDELT uses sophisticated natural language and data mining algorithms, including powerful deep learning techniques.
It extracts over 300 categories of events, millions of themes, thousands of emotions, and the networks connecting them.
The dataset models human interactions at a large scale, making it valuable for research and analysis.
Vision:
The GDELT Project envisions using this data to:
Understand the world through others' eyes.
Break down language and access barriers.
Facilitate conversations between societies.
Empower local populations with information for safer lives.
Map happiness, conflict, and potentially forecast global tensions.
Global Reach:
GDELT monitors media in over 100 languages across every country, providing a truly global perspective.
It allows us to explore how social media is used worldwide and how people express themselves online.
Open Data:
The entire GDELT database is free and open.
Researchers can download raw data, visualize it, or analyze it at scale using tools like Google BigQuery¹²³⁴⁵.
Source: Conversation with Bing, 3/12/2024
(1) The GDELT Project. https://www.gdeltproject.org/.
(2) The GDELT Database | Aalto Datahub. https://datahub.aalto.fi/en/data-sources/the-gdelt-database.
(3) An Introduction to GDELT Data | MongoDB. https://www.mongodb.com/developer/products/mongodb/introduction-to-gdelt-data/.
(4) GDELT 2.0: Our Global World in Realtime – The GDELT Project. https://blog.gdeltproject.org/gdelt-2-0-our-global-world-in-realtime/.
(5) Data: Querying, Analyzing and Downloading: The GDELT Project. https://www.gdeltproject.org/data.html.
GDELT项目是一项卓越的创举,通过分析全球各类来源的新闻资讯,对世界进行监测。以下是GDELT数据集的关键特性:
范围与目的:
GDELT项目旨在构建一个全面、实时的全球人类社会数据库。该项目监控来自广播、印刷媒体和网络来源的新闻,覆盖近每个国家和超过100种语言。
通过分析这一庞大的数据集,GDELT能够识别出塑造我们每日每刻全球社会的个人、地点、组织、主题、情感和事件。
数据收集:
GDELT持续捕捉并分析新闻文章、广播和在线来源。其历史档案追溯至1979年1月1日,并每15分钟更新一次。
该项目超越了西方媒体,提供了对世界事件和情绪的更全球化的视角。
特性:
GDELT运用先进的自然语言处理和数据挖掘算法,包括强大的深度学习技术。
它提取超过300类事件、数百万主题、数千种情感以及连接它们的网络。
该数据集在宏观层面模拟人类互动,对于研究和分析具有重要意义。
愿景:
GDELT项目设想利用这些数据来实现以下目标:
通过他人的视角理解世界。
消除语言和获取信息的障碍。
促进不同社会间的对话。
赋予当地民众信息,保障其安全生活。
绘制幸福、冲突地图,并可能预测全球紧张局势。
全球影响力:
GDELT监控超过100种语言在全球范围内的媒体,提供了一个真正的全球视角。
它使我们能够探索社交媒体的全球使用情况以及人们如何在网络上表达自己。
开放数据:
整个GDELT数据库免费且开放。研究人员可以下载原始数据,使用如Google BigQuery等工具进行可视化或大规模分析。
资料来源:与Bing的对话,2024年3月12日
(1) GDELT项目。https://www.gdeltproject.org/
(2) GDELT数据库 | 阿尔托数据中心。https://datahub.aalto.fi/en/data-sources/the-gdelt-database
(3) GDELT数据简介 | MongoDB。https://www.mongodb.com/developer/products/mongodb/introduction-to-gdelt-data/
(4) GDELT 2.0:实时全球世界 – GDELT项目。https://blog.gdeltproject.org/gdelt-2-0-our-global-world-in-realtime/
(5) 数据:查询、分析和下载 – GDELT项目。https://www.gdeltproject.org/data.html
提供机构:
Papers with Code
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



