five

Who Did the 115th US Congress Retweet ?

收藏
ICPSR2020-01-01 更新2026-04-16 收录
下载链接:
https://www.openicpsr.org/openicpsr/project/108303/version/V2/view
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset includes the retweets posted on Twitter by accounts associated with members of the US Congress during the 115th Congress (2017-2018). The list of accounts combines two sources: <br>Justin Littman's list (https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/UIVHQR)The United States project list (https://github.com/unitedstates/congress-legislators)Tweets were collected using Twitter's Search API through the twitter_user_collector Python script (https://github.com/casmlab/twitter_user_collector).<br><br>We filtered all tweets posted during the 115th Congress, leaving only those that have an associated attribute "retweeted_status", which indicates that the given CM's tweet is a retweet of another tweet. These retweets number 209,856 during the 115th Congress, made by 38,131 unique Twitter accounts.<br><br>We preserved and renamed metadata these tweets provided through Twitter's API, including the fields 'tweet_id_str', 'full_text', 'user_id_str', 'user_screen_name', 'user_followers_count', 'created_at', 'retweet_count', 'retweeted_status', and 'year' (extracted from 'created_at').<br><br>Beyond that tweet metadata provided through Twitter’s API, we collected additional demographic metadata for as many CMs as possible of those featured in our Tweet collection by using The United States Project's crowdsourced list of current legislators’ official Twitter handles, and associated metadata fields identifying a legislator’s unique bioguide ID ('bioguide' field), name (‘name’ field), chamber (‘chamber’ field), party (‘party’ field), state represented (‘state’ field), gender (‘gender’ field), and birthday (‘birthday’ field). For those CMs not included in The United States Project, we manually searched for information to fill each of these metadata fields.<br><br>Based on which state each of these CMs represents, we assigned each CM a region (‘region’ field) based on those U.S. regional divisions outlined by Karl and Koss in their 1984 paper (https://repository.library.noaa.gov/view/noaa/10238) and which is also used by the U.S. National Centers for Environmental Information. For those states not captured by Karl and Koss’ regions, we made determinations ourselves and assigned them according to climatological and cultural contexts. In doing so, we developed an additional regional label, “Islands”. Those states or territories that we independently assigned include American Samoa, Virgin Islands, Puerto Rico, Hawaii, District of Columbia, and Alaska.<br><br>We determined age (‘age’ field) at the time of dataset creation (Jan. 10, 2020) according to CMs’ reported birthdays. We then grouped these ages into those age buckets 30-39, 40-49, 50-59, 60-69, 70-79, 80-89 (‘age_bucket’ field).<br>The OpenICPSR dataset features tweets by 520 CMs with this associated metadata.<br><br>Finally, we include fields which describe the original tweet that the CM retweeted and the user who posted it. We include that original poster’s Twitter user ID ('rt_user_id' field), Twitter screen name ('rt_screen_name' field), number of Twitter followers ('rt_followers_count' field), and user bio ('rt_bio' field). We extracted these fields from the JSON value included in the Twitter API's 'retweeted_status' field.<br><br><br><br>
提供机构:
University of Michigan
创建时间:
2020-01-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作