five

#MeToo Tweet IDs, October 15-28, 2017

收藏
doi.org2019-11-14 更新2025-03-23 收录
下载链接:
https://doi.org/10.3886/ICPSR37447.v1
下载链接
链接失效反馈
官方服务:
资源简介:
This collection of tweet IDs pertains to the first two weeks of the #MeToo hashtag campaign in October 2017. During this time period there were over 1.5 million tweets with the #MeToo hashtag. Tweets containing the hashtag #MeToo were collected retroactively from a full historical Twitter Firehose (100%) collection, and reply threads in response to those tweets were separately collected from Twitter. According to Twitter Terms of Service, full tweet objects cannot be disseminated, but the tweet IDs can be rehydrated through Twitter's public GET statuses/lookup API endpoint. The available data for this study exist in one zipped folder containing 28 files. There are 14 .csv files, one for each day, between October 15th to October 28th, containing the tweet ID with one tweet ID appearing per line. Each file only contains a single column of data (tweet_id). There were on average 109,237 tweets per day during this two-week period ranging between 16,074 to 528,143 tweets per day. Tweets must have been public and not deleted or taken down at the time of collection in order to appear in this dataset. The other 14 .csv files correspond to the reply threads for each day in response to tweets containing the hashtag #MeToo. Each line indicates the tweet ID of a reply in a thread of replies to a #MeToo tweet (tweet_id) and the tweet ID of the tweet immediately preceeding that tweet in the reply thread (in_reply_to_tweet_id) as comma-separated values. There were on average 21,072 replies to tweets per day during this period with a range of 2,388 to 110,789 replies per day.

本数据集收录了2017年10月#MeToo运动前两周的推文ID。在此期间,含有#MeToo标签的推文数量超过150万条。通过追溯性地从Twitter全历史数据流(100%)中收集含有#MeToo标签的推文,并单独从Twitter收集对这些推文的回复线程。根据Twitter服务条款,完整推文对象不得传播,但可以通过Twitter公共GET statuses/lookup API端点重新提取推文ID。可供研究的数据存放在一个包含28个文件的压缩文件夹中,其中14个为.csv文件,分别对应10月15日至10月28日每一天的推文ID,每行包含一个推文ID,每个文件仅包含一个数据列(tweet_id)。在这两周期间,每天的推文数量平均为109,237条,日推文数量介于16,074条至528,143条之间。只有那些在收集时是公开的、未被删除或撤下的推文才能出现在此数据集中。另外14个.csv文件对应于每天对含有#MeToo标签的推文的回复线程。每行指示一个回复线程中回复的推文ID(tweet_id)以及在该回复线程中紧接该推文之前的推文ID(in_reply_to_tweet_id),以逗号分隔的值表示。在此期间,每天的推文回复数量平均为21,072条,日回复数量介于2,388条至110,789条之间。
提供机构:
Inter-university Consortium for Political and Social Research [distributor]
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作