five

marry-1111/x_dataset_0502178

收藏
Hugging Face2025-07-29 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/marry-1111/x_dataset_0502178
下载链接
链接失效反馈
官方服务:
资源简介:
Bittensor Subnet 13 X (Twitter) 数据集是来自Bittensor Subnet 13网络的去中心化社交媒体数据集,主要来源于Twitter平台。该数据集经过预处理,包含推文文本、情感或主题标签、话题标签列表、发布日期、用户名编码和URL编码等信息。数据集支持多种NLP任务,如文本分类、命名实体识别、情感分析等,并以英文为主,但也可能包含多语言内容。数据集实时更新,用户需根据需求和时间戳自行创建数据分割。数据遵守Twitter的服务条款和API使用指南,对个人敏感信息进行了编码处理。

The Bittensor Subnet 13 X (Twitter) Dataset is a decentralized social media dataset from the Bittensor Subnet 13 network, primarily sourced from the Twitter platform. This dataset has been preprocessed to include tweet text, sentiment or topic labels, a list of hashtags, posting date, encoded usernames, and encoded URLs. It supports various NLP tasks such as text classification, named entity recognition, sentiment analysis, etc., and is predominantly in English but may also contain multilingual content. The dataset is updated in real-time, and users need to create their own data splits based on requirements and timestamps. The data adheres to Twitters terms of service and API usage guidelines, with personal sensitive information encoded for privacy.
提供机构:
marry-1111
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作