five

Virality Measures of "Data Tweets"

收藏
DataCite Commons2020-08-25 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/Virality_Measures_of_Data_Tweets_/11940426
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists of two files in TSV format derived from a large number of tweets (16754250) that were identified as containing different forms of "numeric data" in an extended collection of tweets from Twitter's 1% public sample over 11 months from September 2018. <br>Both files have a key column labelled "TweetID" which is the Twitter API ID that can be used to retrieve the full twitter data (recommended retrieval via TWARC).<br>The file "datatweet-numeric-occurrences.txt" consists of three columns:1 TweetID2 NumericDataString - the actual substring from the tweet which was recognised as numeric e.g. "500 billion" or "24 years"3 NumericType - one of a set of identified numeric types e.g. "[cardinal]" or "[time]". <br>The "virality" associated with the tweets in which the numeric data has been found is given in the file "datatweet-virality.txt".Its columns are as follows1 id of the tweet2 retweet_count3 favorite_count4 followers_count (of the user who made the tweet)<br>If this tweet is a retweet of another (original) tweet, the following columns are non-empty:5 id of the original tweet6 favourite_count of the original tweet7 followers_count of the original tweet's author<br>NB if col 2 is 0, then cols 5-7 will be blank.If col 2 &gt;0, then it contains the number of retweets of the original tweet, not the number of times that this retweet has been retweeted.
提供机构:
figshare
创建时间:
2020-03-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作