DaniFrame/COLAPerturbed
收藏Hugging Face2023-06-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/DaniFrame/COLAPerturbed
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: sentence
dtype: string
- name: label
dtype:
class_label:
names:
'0': unacceptable
'1': acceptable
- name: idx
dtype: int32
splits:
- name: cola_perturbed_keyboard_0.01
num_bytes: 60525
num_examples: 1063
- name: cola_perturbed_keyboard_0.05
num_bytes: 60535
num_examples: 1063
- name: cola_perturbed_keyboard_0.1
num_bytes: 60576
num_examples: 1063
- name: cola_perturbed_ocr_0.01
num_bytes: 60512
num_examples: 1063
- name: cola_perturbed_ocr_0.05
num_bytes: 60512
num_examples: 1063
- name: cola_perturbed_ocr_0.1
num_bytes: 60512
num_examples: 1063
- name: cola_perturbed_spellingerror_0.01
num_bytes: 60715
num_examples: 1063
- name: cola_perturbed_spellingerror_0.05
num_bytes: 60904
num_examples: 1063
- name: cola_perturbed_spellingerror_0.1
num_bytes: 61374
num_examples: 1063
- name: cola_perturbed_typos_0.01
num_bytes: 60541
num_examples: 1063
- name: cola_perturbed_typos_0.05
num_bytes: 60566
num_examples: 1063
- name: cola_perturbed_typos_0.1
num_bytes: 60536
num_examples: 1063
- name: cola_perturbed_sne_0.1
num_bytes: 65395
num_examples: 1063
- name: cola_perturbed_sne_0.2
num_bytes: 65429
num_examples: 1063
- name: cola_perturbed_sne_0.3
num_bytes: 65620
num_examples: 1063
- name: cola_perturbed_sswn_0.1
num_bytes: 61685
num_examples: 1063
- name: cola_perturbed_sswn_0.2
num_bytes: 62032
num_examples: 1063
- name: cola_perturbed_sswn_0.3
num_bytes: 63133
num_examples: 1063
- name: cola_perturbed_contraction
num_bytes: 60518
num_examples: 1063
- name: cola_perturbed_insertadv
num_bytes: 78182
num_examples: 1063
- name: cola_perturbed_prejudice
num_bytes: 60510
num_examples: 1063
- name: cola_perturbed_punctuation
num_bytes: 66036
num_examples: 1063
- name: cola_perturbed_reverseneg
num_bytes: 66162
num_examples: 1063
- name: cola_perturbed_swapnum
num_bytes: 60516
num_examples: 1063
- name: cola_perturbed_verbtense
num_bytes: 60978
num_examples: 1063
- name: cola_perturbed_twitter
num_bytes: 75043
num_examples: 1063
- name: cola_perturbed_wordcase
num_bytes: 60513
num_examples: 1063
download_size: 1124448
dataset_size: 1699560
---
# Dataset Card for "ColaPerturbed"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
DaniFrame
原始信息汇总
数据集概述
数据集名称
"ColaPerturbed"
数据集特征
- sentence: 数据类型为字符串。
- label: 数据类型为分类标签,包含两个类别:0: unacceptable 和 1: acceptable。
- idx: 数据类型为int32。
数据集分割
数据集包含多个分割,每个分割具有不同的扰动类型和比例,具体如下:
| 分割名称 | 字节数 | 示例数 |
|---|---|---|
| cola_perturbed_keyboard_0.01 | 60525 | 1063 |
| cola_perturbed_keyboard_0.05 | 60535 | 1063 |
| cola_perturbed_keyboard_0.1 | 60576 | 1063 |
| cola_perturbed_ocr_0.01 | 60512 | 1063 |
| cola_perturbed_ocr_0.05 | 60512 | 1063 |
| cola_perturbed_ocr_0.1 | 60512 | 1063 |
| cola_perturbed_spellingerror_0.01 | 60715 | 1063 |
| cola_perturbed_spellingerror_0.05 | 60904 | 1063 |
| cola_perturbed_spellingerror_0.1 | 61374 | 1063 |
| cola_perturbed_typos_0.01 | 60541 | 1063 |
| cola_perturbed_typos_0.05 | 60566 | 1063 |
| cola_perturbed_typos_0.1 | 60536 | 1063 |
| cola_perturbed_sne_0.1 | 65395 | 1063 |
| cola_perturbed_sne_0.2 | 65429 | 1063 |
| cola_perturbed_sne_0.3 | 65620 | 1063 |
| cola_perturbed_sswn_0.1 | 61685 | 1063 |
| cola_perturbed_sswn_0.2 | 62032 | 1063 |
| cola_perturbed_sswn_0.3 | 63133 | 1063 |
| cola_perturbed_contraction | 60518 | 1063 |
| cola_perturbed_insertadv | 78182 | 1063 |
| cola_perturbed_prejudice | 60510 | 1063 |
| cola_perturbed_punctuation | 66036 | 1063 |
| cola_perturbed_reverseneg | 66162 | 1063 |
| cola_perturbed_swapnum | 60516 | 1063 |
| cola_perturbed_verbtense | 60978 | 1063 |
| cola_perturbed_twitter | 75043 | 1063 |
| cola_perturbed_wordcase | 60513 | 1063 |
数据集大小
- 下载大小:1124448字节
- 数据集大小:1699560字节



