william-1111/x_dataset_0110104
收藏Hugging Face2025-07-30 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/william-1111/x_dataset_0110104
下载链接
链接失效反馈官方服务:
资源简介:
Bittensor Subnet 13 X(Twitter)数据集是Bittensor Subnet 13去中心化网络的一部分,包含来自X(前Twitter)的预处理推文数据。该数据集不断更新,提供实时推文流,适用于各种分析和机器学习任务。数据集以英文为主,但也包含多语言内容。每个推文实例包括文本内容、情感或话题标签、话题标签列表、发布日期、用户名编码和URL编码等字段。数据集没有固定的分割,用户需根据需求和时间戳自行创建数据分割。数据来源于公共推文,遵循平台的服务条款和API使用指南,对敏感信息进行编码处理以保护用户隐私。
The Bittensor Subnet 13 X (Twitter) Dataset is a part of the Bittensor Subnet 13 decentralized network, containing preprocessed tweet data from X (formerly Twitter). The dataset is continuously updated, providing a real-time stream of tweets suitable for various analytical and machine learning tasks. The dataset is primarily in English but also includes multilingual content. Each tweet instance comprises fields such as text content, sentiment or topic labels, a list of hashtags, the posting date, encoded username, and encoded URLs. The dataset does not have fixed splits and users are required to create their own splits based on timestamps and requirements. The data is sourced from public tweets, adhering to the platforms terms of service and API usage guidelines, with sensitive information encoded to protect user privacy.
提供机构:
william-1111



