five

reddit user posting behavior (mid-2013)

收藏
DataCite Commons2025-06-01 更新2024-07-27 收录
下载链接:
https://figshare.com/articles/dataset/reddit_user_posting_behavior/874101/2
下载链接
链接失效反馈
官方服务:
资源简介:
This file contains the posting preferences for over 850,000 active reddit users. This sample was taken in mid-2013. This data was used to generate the interactive visualization, "redditviz," and will be analyzed in detail in an upcoming research article. Please cite our paper "Navigating the massive world of reddit" if you use this data in your work. URL: http://arxiv.org/abs/1312.3387 The file is organized as follows: Each line is an entry for an anonymous user. Each user was randomly assigned a unique ID, which is what shows in the first entry of each line. Following the user ID, separated by commas, are the subreddits (i.e., interests) that the user regularly posts in. In order for a user to be considered "active" in that subreddit, they had to post or comment there at least 10 times in their last 1,000 posts and comments.

本文件收录了超过85万名活跃红迪网(Reddit)用户的发帖偏好数据。该样本采集于2013年中期。本数据集曾用于构建交互式可视化工具“redditviz”,并将在后续发表的研究论文中得到详尽分析。若您在研究工作中使用本数据集,请引用我们的论文"Navigating the massive world of reddit"。相关论文链接:http://arxiv.org/abs/1312.3387 本文件结构如下: 每行对应一名匿名用户。每位用户会被随机分配一个唯一标识符(Unique ID),该标识符将作为每行的首个字段。在用户唯一标识符之后,以逗号分隔的是该用户常发帖的subreddits(红迪子社区,即兴趣领域)。若某用户在其最近1000条发帖与评论中,于该红迪子社区内至少发布或评论10次,则认定其为该社区的活跃用户。
提供机构:
figshare
创建时间:
2016-01-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作