rainbowbridge/x_dataset_46092
收藏Hugging Face2025-08-01 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/rainbowbridge/x_dataset_46092
下载链接
链接失效反馈官方服务:
资源简介:
Bittensor Subnet 13 X(Twitter)数据集是Bittensor Subnet 13去中心化网络的一部分,包含了来自X(前Twitter)的预处理数据。这些数据持续更新,为各种分析和机器学习任务提供了实时推文流。数据集主要由英语组成,但也包含多语言内容。数据结构包括推文的文本、标签、话题标签、发布日期、用户名编码和URL编码等字段。用户需根据时间戳自行创建数据分割。数据来源于公共Twitter推文,所有用户名和URL均经过编码处理以保护隐私。
The Bittensor Subnet 13 X (Twitter) Dataset is part of the Bittensor Subnet 13 decentralized network, containing preprocessed data from X (formerly Twitter). The data is continuously updated, providing a real-time stream of tweets for various analytical and machine learning tasks. The dataset is primarily in English but also includes multilingual content. The data structure consists of tweet text, labels, hashtags, posting dates, encoded usernames, and encoded URLs. Users need to create their own data splits based on timestamps. The data is sourced from public Twitter posts, with all usernames and URLs encoded to protect privacy.
提供机构:
rainbowbridge



