momo1942/x_dataset_59332
收藏Hugging Face2025-08-01 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/momo1942/x_dataset_59332
下载链接
链接失效反馈官方服务:
资源简介:
Bittensor Subnet 13 X (Twitter) 数据集是来自Twitter的预处理数据集,作为Bittensor Subnet 13去中心化网络的一部分,包含了实时更新的推文数据,适用于多种自然语言处理任务。数据集以英文为主,但可能包含多语言内容。每条数据包含推文文本、情感或话题标签、话题标签列表、发布日期、用户名编码和URL编码等信息。该数据集需用户自行根据时间戳切分,不包含固定的数据切分。数据集在MIT许可下提供。
The Bittensor Subnet 13 X (Twitter) Dataset is a preprocessed dataset from Twitter, part of the Bittensor Subnet 13 decentralized network, containing real-time updated tweet data suitable for various natural language processing tasks. The dataset is primarily in English but may include multilingual content. Each data entry includes the tweet text, sentiment or topic label, a list of hashtags, the posting date, encoded username, and encoded URLs. The dataset requires users to create their own splits based on timestamps, without fixed data splits. It is provided under the MIT license.
提供机构:
momo1942



