suul999922/x_dataset_6
收藏Hugging Face2025-01-26 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/suul999922/x_dataset_6
下载链接
链接失效反馈官方服务:
资源简介:
Bittensor Subnet 13 X (Twitter)数据集是来自Bittensor Subnet 13的去中心化网络的预处理Twitter数据集。这个数据集提供了实时的推文流,用于各种分析和机器学习任务。数据集支持多种自然语言处理任务,包括文本分类、命名实体识别、问题回答和文本摘要等。数据集主要由英语组成,但由于创建方式去中心化,也可能包含多语言内容。每个数据实例包含推文文本、标签、话题标签、发布日期、编码后的用户名和URL等字段。数据集是持续更新的,没有固定的数据划分,用户应根据需求和时间戳自行创建数据划分。
The Bittensor Subnet 13 X (Twitter) Dataset is a collection of preprocessed Twitter data from the Bittensor Subnet 13 decentralized network. This dataset provides a real-time stream of tweets for various analytical and machine learning tasks. It supports multiple natural language processing tasks such as text classification, named entity recognition, question answering, and text summarization. The dataset is primarily in English but may contain multilingual content due to its decentralized creation method. Each data instance includes fields such as the tweet text, label, hashtags, posting date, encoded username, and encoded URLs. The dataset is continuously updated without fixed splits, and users should create their own splits based on their requirements and the datas timestamp.
提供机构:
suul999922



