Virality Measures of "Data Tweets"
收藏DataCite Commons2020-08-25 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/Virality_Measures_of_Data_Tweets_/11940426
下载链接
链接失效反馈官方服务:
资源简介:
This dataset consists of two files in TSV format derived from a large number of tweets (16754250) that were identified as containing different forms of "numeric data" in an extended collection of tweets from Twitter's 1% public sample over 11 months from September 2018. <br>Both files have a key column labelled "TweetID" which is the Twitter API ID that can be used to retrieve the full twitter data (recommended retrieval via TWARC).<br>The file "datatweet-numeric-occurrences.txt" consists of three columns:1 TweetID2 NumericDataString - the actual substring from the tweet which was recognised as numeric e.g. "500 billion" or "24 years"3 NumericType - one of a set of identified numeric types e.g. "[cardinal]" or "[time]". <br>The "virality" associated with the tweets in which the numeric data has been found is given in the file "datatweet-virality.txt".Its columns are as follows1 id of the tweet2 retweet_count3 favorite_count4 followers_count (of the user who made the tweet)<br>If this tweet is a retweet of another (original) tweet, the following columns are non-empty:5 id of the original tweet6 favourite_count of the original tweet7 followers_count of the original tweet's author<br>NB if col 2 is 0, then cols 5-7 will be blank.If col 2 >0, then it contains the number of retweets of the original tweet, not the number of times that this retweet has been retweeted.
提供机构:
figshare
创建时间:
2020-03-05



