nicchio816/reddit_dataset_111
收藏Hugging Face2025-08-07 更新2025-08-30 收录
下载链接:
https://hf-mirror.com/datasets/nicchio816/reddit_dataset_111
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是Bittensor子网13去中心化网络的一部分,包含预处理的Reddit数据。数据由网络矿工持续更新,为各种分析和机器学习任务提供实时的Reddit内容流。主要语言为英语,但由于去中心化的创建方式,数据可能是多语言的。数据集允许研究人员和科学家探索社交媒体动态的不同方面,并开发创新的应用程序。数据集包括Reddit帖子或评论,每个实例包含文本内容、情感或主题标签、数据类型、社区名称、日期时间、编码的用户名和编码的URL。数据集持续更新,没有固定的划分,用户应根据需求和时间戳创建自己的划分。
This dataset is part of the Bittensor Subnet 13 decentralized network, containing preprocessed Reddit data. The data is continuously updated by network miners, providing a real-time stream of Reddit content for various analytical and machine learning tasks. The primary language is English, but the dataset can be multilingual due to decentralized creation methods. The dataset includes Reddit posts or comments, each with fields for text content, sentiment or topic label, data type, community name, date and time, encoded username, and encoded URLs. The dataset is continuously updated and does not have fixed splits, requiring users to create their own splits based on their requirements and the datas timestamp.
提供机构:
nicchio816



