A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings and Interaction Networks (2011-2021)
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10715426
下载链接
链接失效反馈官方服务:
资源简介:
Data in brief
Small description of the Reddit Politics 10 years dataset (starting from 2011-01).
Political leanings
node_month_leaning_full_headings.zip
The zip file contains a csv where each row identifies a user content (either post or comment) with the following structure
node_id,month_progressive_id,leaning_lable,leaning_score,post_comment
where
node_id: is the user uniq identifiermonth_progressive:_id is a numeric value from 0 to 100 identifying the month in which the post/comment has been publishedleaning_label: is a discrete variable identifying left/right/moderates (it is based on leaning_score and can be re-binned if needed)leaning_score: is the continuos score describing the political leaning (range [0,1])post_comment: a flag P/C to differentiate the submission type
monthly_scores_json.zip
This archive contains 3 json files:monthly_scores.json: a dictionary month->node_id->{post: [list political leanings], comments: [list political leanings]};monthly_scores_post_agg.json: a dictionary mont->node_id->political_leaning, where the aggregated score is the average of the interquartile range of the political leaning of the sole users' posts;monthly_scores_agg.json: a dictionary mont->node_id->political_leaning, where the aggregated score is the weighted(*) average among (i) the mean value of the interquartile range of the political leaning of the users' posts, (ii) the mean value of the interquartile range of the political leaning of the users' comments;
(*) being posts' annotation more reliable than comments' ones we decided to weight the former 10 times the latter when aggregating.
monthly_networks_full.zip
This archive contains all the monthly undirected, unweighted, interaction network (each row identifying an edge among two node ids). The networks cover all users having having a political leanin computed (using *both* posts and comments).
monthly_networks_posts.zip
This archive contains all the monthly undirected, unweighted, interaction network (each row identifying an edge among two node ids). The networks cover all users having having a political leanin computed considering *only* posts.
创建时间:
2024-02-27



