five

DaniFrame/COLAPerturbed

收藏
Hugging Face2023-06-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/DaniFrame/COLAPerturbed
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: sentence dtype: string - name: label dtype: class_label: names: '0': unacceptable '1': acceptable - name: idx dtype: int32 splits: - name: cola_perturbed_keyboard_0.01 num_bytes: 60525 num_examples: 1063 - name: cola_perturbed_keyboard_0.05 num_bytes: 60535 num_examples: 1063 - name: cola_perturbed_keyboard_0.1 num_bytes: 60576 num_examples: 1063 - name: cola_perturbed_ocr_0.01 num_bytes: 60512 num_examples: 1063 - name: cola_perturbed_ocr_0.05 num_bytes: 60512 num_examples: 1063 - name: cola_perturbed_ocr_0.1 num_bytes: 60512 num_examples: 1063 - name: cola_perturbed_spellingerror_0.01 num_bytes: 60715 num_examples: 1063 - name: cola_perturbed_spellingerror_0.05 num_bytes: 60904 num_examples: 1063 - name: cola_perturbed_spellingerror_0.1 num_bytes: 61374 num_examples: 1063 - name: cola_perturbed_typos_0.01 num_bytes: 60541 num_examples: 1063 - name: cola_perturbed_typos_0.05 num_bytes: 60566 num_examples: 1063 - name: cola_perturbed_typos_0.1 num_bytes: 60536 num_examples: 1063 - name: cola_perturbed_sne_0.1 num_bytes: 65395 num_examples: 1063 - name: cola_perturbed_sne_0.2 num_bytes: 65429 num_examples: 1063 - name: cola_perturbed_sne_0.3 num_bytes: 65620 num_examples: 1063 - name: cola_perturbed_sswn_0.1 num_bytes: 61685 num_examples: 1063 - name: cola_perturbed_sswn_0.2 num_bytes: 62032 num_examples: 1063 - name: cola_perturbed_sswn_0.3 num_bytes: 63133 num_examples: 1063 - name: cola_perturbed_contraction num_bytes: 60518 num_examples: 1063 - name: cola_perturbed_insertadv num_bytes: 78182 num_examples: 1063 - name: cola_perturbed_prejudice num_bytes: 60510 num_examples: 1063 - name: cola_perturbed_punctuation num_bytes: 66036 num_examples: 1063 - name: cola_perturbed_reverseneg num_bytes: 66162 num_examples: 1063 - name: cola_perturbed_swapnum num_bytes: 60516 num_examples: 1063 - name: cola_perturbed_verbtense num_bytes: 60978 num_examples: 1063 - name: cola_perturbed_twitter num_bytes: 75043 num_examples: 1063 - name: cola_perturbed_wordcase num_bytes: 60513 num_examples: 1063 download_size: 1124448 dataset_size: 1699560 --- # Dataset Card for "ColaPerturbed" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
DaniFrame
原始信息汇总

数据集概述

数据集名称

"ColaPerturbed"

数据集特征

  • sentence: 数据类型为字符串。
  • label: 数据类型为分类标签,包含两个类别:0: unacceptable 和 1: acceptable。
  • idx: 数据类型为int32。

数据集分割

数据集包含多个分割,每个分割具有不同的扰动类型和比例,具体如下:

分割名称 字节数 示例数
cola_perturbed_keyboard_0.01 60525 1063
cola_perturbed_keyboard_0.05 60535 1063
cola_perturbed_keyboard_0.1 60576 1063
cola_perturbed_ocr_0.01 60512 1063
cola_perturbed_ocr_0.05 60512 1063
cola_perturbed_ocr_0.1 60512 1063
cola_perturbed_spellingerror_0.01 60715 1063
cola_perturbed_spellingerror_0.05 60904 1063
cola_perturbed_spellingerror_0.1 61374 1063
cola_perturbed_typos_0.01 60541 1063
cola_perturbed_typos_0.05 60566 1063
cola_perturbed_typos_0.1 60536 1063
cola_perturbed_sne_0.1 65395 1063
cola_perturbed_sne_0.2 65429 1063
cola_perturbed_sne_0.3 65620 1063
cola_perturbed_sswn_0.1 61685 1063
cola_perturbed_sswn_0.2 62032 1063
cola_perturbed_sswn_0.3 63133 1063
cola_perturbed_contraction 60518 1063
cola_perturbed_insertadv 78182 1063
cola_perturbed_prejudice 60510 1063
cola_perturbed_punctuation 66036 1063
cola_perturbed_reverseneg 66162 1063
cola_perturbed_swapnum 60516 1063
cola_perturbed_verbtense 60978 1063
cola_perturbed_twitter 75043 1063
cola_perturbed_wordcase 60513 1063

数据集大小

  • 下载大小:1124448字节
  • 数据集大小:1699560字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作