rainbowbridge/x_dataset_113
收藏Hugging Face2024-12-13 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/rainbowbridge/x_dataset_113
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是Bittensor Subnet 13去中心化网络的一部分,包含来自X(前Twitter)的预处理数据。数据由网络矿工持续更新,提供实时的推文流,适用于多种分析和机器学习任务。数据集主要包含每条推文的文本、标签、使用的标签、发布时间、编码的用户名和编码的URL。数据集是多语言的,主要是英语,但也可能包含其他语言。数据集的创建遵循X的条款和服务使用指南,所有用户名和URL都被编码以保护用户隐私。使用该数据集时需要注意可能存在的偏见和限制,如数据质量、噪声、时间偏见等。数据集在MIT许可下发布,使用还需遵守X的使用条款。
The Bittensor Subnet 13 X (Twitter) Dataset is part of the Bittensor Subnet 13 decentralized network, containing preprocessed data from X (formerly Twitter). The dataset is continuously updated by network miners, providing a real-time stream of tweets for various analytical and machine learning tasks. Supported tasks include sentiment analysis, trend detection, content analysis, and user behavior modeling. The primary language is English, but the dataset can be multilingual due to decentralized ways of creation. Each data instance includes fields such as text, label, tweet_hashtags, datetime, username_encoded, and url_encoded. The dataset is released under the MIT license, and users are advised to be aware of potential biases and limitations, such as data quality variations and temporal biases. Dataset statistics include total instances, date range, and last updated information, along with data distribution and top hashtags.
提供机构:
rainbowbridge



