marry-1111/reddit_dataset_76
收藏Hugging Face2025-01-20 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/marry-1111/reddit_dataset_76
下载链接
链接失效反馈官方服务:
资源简介:
Bittensor子网13 Reddit数据集是一个包含预处理Reddit数据的集合,属于Bittensor子网13去中心化网络的一部分。该数据集持续由网络矿工更新,提供实时的Reddit内容流,适用于各种分析和机器学习任务。数据集主要包含英文内容,但也可能是多语言的。每个数据实例代表一个Reddit帖子或评论,包括文本内容、标签、数据类型、社区名称、日期时间、编码后的用户名和URL等字段。
The Bittensor Subnet 13 Reddit Dataset is a collection of preprocessed Reddit data, part of the Bittensor Subnet 13 decentralized network. The dataset is continuously updated by network miners, providing a real-time stream of Reddit content suitable for various analytical and machine learning tasks. The dataset primarily contains English content but may also be multilingual. Each data instance represents a single Reddit post or comment, including fields for text content, label, data type, community name, datetime, encoded username, and encoded URL.
提供机构:
marry-1111



