KeithHorgan98/autotrain-data-TweetClimateAnalysis

Name: KeithHorgan98/autotrain-data-TweetClimateAnalysis
Creator: KeithHorgan98
Published: 2022-03-28 22:27:22
License: 暂无描述

Hugging Face2022-03-28 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/KeithHorgan98/autotrain-data-TweetClimateAnalysis

下载链接

链接失效反馈

官方服务：

资源简介：

--- task_categories: - text-classification --- # AutoTrain Dataset for project: TweetClimateAnalysis ## Dataset Descritpion This dataset has been automatically processed by AutoTrain for project TweetClimateAnalysis. ### Languages The BCP-47 code for the dataset's language is unk. ## Dataset Structure ### Data Instances A sample from this dataset looks as follows: ```json [ { "text": "What do you do if you are a global warming alarmist and real-world temperatures do not warm as much [...]", "target": 16 }, { "text": "(2.) A sun-blocking volcanic aerosols component to explain the sudden but temporary cooling of globa[...]", "target": 0 } ] ``` ### Dataset Fields The dataset has the following fields (also called "features"): ```json { "text": "Value(dtype='string', id=None)", "target": "ClassLabel(num_classes=18, names=['0_0', '1_1', '1_2', '1_3', '1_4', '1_6', '1_7', '2_1', '2_3', '3_1', '3_2', '3_3', '4_1', '4_2', '4_4', '4_5', '5_1', '5_2'], id=None)" } ``` ### Dataset Splits This dataset is split into a train and validation split. The split sizes are as follow: | Split name | Num samples | | ------------ | ------------------- | | train | 23436 | | valid | 2898 |

提供机构：

KeithHorgan98

原始信息汇总

数据集概述

数据集名称

AutoTrain Dataset for project: TweetClimateAnalysis

任务类别

文本分类

语言

语言代码：unk

数据集结构

数据实例

示例内容包括：
- text: 文本内容
- target: 目标分类标签

数据集字段

text: 字符串类型
target: 分类标签，共有18个类别

数据集分割

train: 23436个样本
valid: 2898个样本

5,000+

优质数据集

54 个

任务类型

进入经典数据集