KeithHorgan98/autotrain-data-TweetClimateAnalysis
收藏Hugging Face2022-03-28 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/KeithHorgan98/autotrain-data-TweetClimateAnalysis
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- text-classification
---
# AutoTrain Dataset for project: TweetClimateAnalysis
## Dataset Descritpion
This dataset has been automatically processed by AutoTrain for project TweetClimateAnalysis.
### Languages
The BCP-47 code for the dataset's language is unk.
## Dataset Structure
### Data Instances
A sample from this dataset looks as follows:
```json
[
{
"text": "What do you do if you are a global warming alarmist and real-world temperatures do not warm as much [...]",
"target": 16
},
{
"text": "(2.) A sun-blocking volcanic aerosols component to explain the sudden but temporary cooling of globa[...]",
"target": 0
}
]
```
### Dataset Fields
The dataset has the following fields (also called "features"):
```json
{
"text": "Value(dtype='string', id=None)",
"target": "ClassLabel(num_classes=18, names=['0_0', '1_1', '1_2', '1_3', '1_4', '1_6', '1_7', '2_1', '2_3', '3_1', '3_2', '3_3', '4_1', '4_2', '4_4', '4_5', '5_1', '5_2'], id=None)"
}
```
### Dataset Splits
This dataset is split into a train and validation split. The split sizes are as follow:
| Split name | Num samples |
| ------------ | ------------------- |
| train | 23436 |
| valid | 2898 |
提供机构:
KeithHorgan98
原始信息汇总
数据集概述
数据集名称
AutoTrain Dataset for project: TweetClimateAnalysis
任务类别
- 文本分类
语言
- 语言代码:unk
数据集结构
数据实例
- 示例内容包括:
- text: 文本内容
- target: 目标分类标签
数据集字段
- text: 字符串类型
- target: 分类标签,共有18个类别
数据集分割
- train: 23436个样本
- valid: 2898个样本



