zkpbeats/reddit_ds_129259
收藏Hugging Face2025-04-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/zkpbeats/reddit_ds_129259
下载链接
链接失效反馈官方服务:
资源简介:
Bittensor Subnet 13 Reddit数据集是Bittensor Subnet 13去中心化网络的一部分,包含了预处理后的Reddit数据。这个数据集通过矿工的不断更新,为用户提供了实时更新的Reddit内容,可以用于各种分析和机器学习任务,如情感分析、主题建模、社区分析等。数据集主要是英文的,但也包含多语言内容。数据集的结构包括文本、标签、数据类型、社区名称、日期时间、用户名编码和URL编码等字段。用户需要根据自身需求和数据的时间戳来创建数据分割。在遵守Reddit平台规定的前提下,数据集在收集过程中对敏感信息进行了编码处理,以保护用户隐私。
The Bittensor Subnet 13 Reddit Dataset is a part of the Bittensor Subnet 13 decentralized network, containing preprocessed Reddit data. This dataset is continuously updated by network miners, providing a real-time stream of Reddit content suitable for various analytical and machine learning tasks such as sentiment analysis, topic modeling, community analysis, etc. The dataset is predominantly in English but also includes multilingual content. The structure of the dataset includes fields such as text, label, data type, community name, datetime, username_encoded, and url_encoded. Users are required to create their own data splits based on their needs and the datasets timestamp. The dataset is collected in adherence to Reddits terms of service and API usage guidelines, with sensitive information encoded to protect user privacy.
提供机构:
zkpbeats



