five

James096/reddit_dataset_162

收藏
Hugging Face2025-05-24 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/James096/reddit_dataset_162
下载链接
链接失效反馈
官方服务:
资源简介:
Bittensor Subnet 13 Reddit数据集是Bittensor子网13的一部分,包含预处理后的Reddit数据。这些数据由网络矿工持续更新,提供了一个实时的Reddit内容流,适用于各种分析和机器学习任务。数据集以英语为主,但也包含多语言内容。每个数据实例代表一个Reddit帖子或评论,包括文本内容、标签、数据类型、社区名称、日期时间、用户名编码和URL编码等字段。

The Bittensor Subnet 13 Reddit Dataset is a part of the Bittensor Subnet 13, containing preprocessed Reddit data. The data is continuously updated by network miners, providing a real-time stream of Reddit content for various analytical and machine learning tasks. The dataset is predominantly in English but also includes multilingual content. Each data instance represents a single Reddit post or comment, including fields such as text content, label, data type, community name, date and time, encoded username, and encoded URL.
提供机构:
James096
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作