veyhoranohy/reddit_dataset_248
收藏Hugging Face2025-04-25 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/veyhoranohy/reddit_dataset_248
下载链接
链接失效反馈官方服务:
资源简介:
Bittensor Subnet 13 Reddit数据集是Bittensor Subnet 13去中心化网络中的一部分,包含预处理后的Reddit数据。这个数据集持续被网络矿工更新,提供实时流形式的Reddit内容,用于各种分析和机器学习任务。数据集主要是英文,但由于去中心化的创建方式,也可能是多语言的。数据集根据用户需求和时间戳进行自定义分割。数据来源于Reddit的公共帖子和评论,并遵守平台的服务条款和API使用指南。所有用户名和URL都进行了编码以保护隐私。该数据集适用于情感分析、主题建模等多种任务。
The Bittensor Subnet 13 Reddit Dataset is a part of the Bittensor Subnet 13 decentralized network, containing preprocessed Reddit data. This dataset is continuously updated by network miners, providing a real-time stream of Reddit content for various analytical and machine learning tasks. The dataset is primarily in English, but due to its decentralized creation method, it may also be multilingual. The dataset is custom-splitted based on user requirements and timestamps. The data is sourced from public posts and comments on Reddit, adhering to the platforms terms of service and API usage guidelines. All usernames and URLs are encoded to protect privacy. The dataset is suitable for tasks such as sentiment analysis, topic modeling, and more.
提供机构:
veyhoranohy



