five

Digitally Accountable Public Representation (DAPR) Data

收藏
DataCite Commons2025-05-16 更新2025-04-15 收录
下载链接:
https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/A9EPYJ
下载链接
链接失效反馈
官方服务:
资源简介:
This is the main bulk data repository for the Digitally Accountable Public Representation (DAPR) Database, an innovative archive that systematically tracks and analyzes the online communications of federal, state, and local officials in the U.S. Focusing on X/Twitter and Facebook, the current database includes 28,980 public officials, their demographic information, and 5,769,904 Tweets along with 450,972 Facebook posts, dating from January 2020 to December 2024 for X/Twitter and January 2020 to December 2021 for Facebook, offering a rich historical perspective on digital political discourse by elected officials in the U.S.. To comply with the terms of data access on platform APIs, the raw post-level data is aggregated to the week level, and we disseminate content information as bags-of-words rather than the original raw text of posts. The data does include URLs to the original posts, which can be used to "rehydrate" the original data. We also distribute metadata on the officials in the DAPR database. Due to the size of the individual files, we disseminate each post data file in a compressed format. Note that due to changes in the Twitter/X API in 2023, as well as funding for the DAPR project, the scope and coverage of data collected from X shifted between 2024. See readme.pdf for details on the data as well as a description of how the data collection was affected by changes in the X API.
提供机构:
Harvard Dataverse
创建时间:
2024-10-09
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集是一个系统追踪美国联邦、州和地方官员在线通信的数据库,主要涵盖X/Twitter和Facebook平台,包含超过28,000名官员的人口统计信息和超过600万条社交媒体帖子。数据时间跨度从2020年到2024年,为研究美国政治数字话语提供了丰富的历史视角,但为遵守平台API条款,原始文本以词袋形式聚合分发,并包含原始帖子URL以供重新获取。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作