five

Replication Data for: Variation across Scales: Measurement Fidelity under Twitter Data Sampling

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://doi.org/10.7910/DVN/GW9GDM
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset is first introduced in the following paper: Siqi Wu, Marian-Andrei Rizoiu, and Lexing Xie. Variation across Scales: Measurement Fidelity under Twitter Data Sampling. In AAAI International Conference on Weblogs and Social Media (ICWSM), 2020. Complete/Sample retweet cascades datasets These datasets contain 2 pairs of complete/sampled retweet cascades on topic Cyberbullying (sampling rate: 0.5272) and YouTube (sampling rate: 0.9153). Each line is a cascades for a root tweet, in the format of "root_tweet_id-root_user_followers:retweet_id1-retweet_user_followers1,retweet_id2-retweet_user_followers2,...". The tweet_id can be melt into timestamp_ms, check the melt_snowflake function here. See more details in this github repo.
创建时间:
2021-04-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作