five

DaniFrame/HSOLPerturbed

收藏
Hugging Face2023-06-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/DaniFrame/HSOLPerturbed
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: count dtype: int64 - name: hate_speech_count dtype: int64 - name: offensive_language_count dtype: int64 - name: neither_count dtype: int64 - name: class dtype: class_label: names: '0': hate speech '1': offensive language '2': neither - name: tweet dtype: string splits: - name: hsol_perturbed_keyboard_0.01 num_bytes: 651089 num_examples: 4957 - name: hsol_perturbed_keyboard_0.05 num_bytes: 651333 num_examples: 4957 - name: hsol_perturbed_keyboard_0.1 num_bytes: 651720 num_examples: 4957 - name: hsol_perturbed_ocr_0.01 num_bytes: 651029 num_examples: 4957 - name: hsol_perturbed_ocr_0.05 num_bytes: 651047 num_examples: 4957 - name: hsol_perturbed_ocr_0.1 num_bytes: 651059 num_examples: 4957 - name: hsol_perturbed_spellingerror_0.01 num_bytes: 652461 num_examples: 4957 - name: hsol_perturbed_spellingerror_0.05 num_bytes: 656504 num_examples: 4957 - name: hsol_perturbed_spellingerror_0.1 num_bytes: 661760 num_examples: 4957 - name: hsol_perturbed_typos_0.01 num_bytes: 651173 num_examples: 4957 - name: hsol_perturbed_typos_0.05 num_bytes: 651752 num_examples: 4957 - name: hsol_perturbed_typos_0.1 num_bytes: 652435 num_examples: 4957 - name: hsol_perturbed_sne_0.1 num_bytes: 650990 num_examples: 4957 - name: hsol_perturbed_sne_0.2 num_bytes: 650690 num_examples: 4957 - name: hsol_perturbed_sne_0.3 num_bytes: 651339 num_examples: 4957 - name: hsol_perturbed_sswn_0.1 num_bytes: 661571 num_examples: 4957 - name: hsol_perturbed_sswn_0.2 num_bytes: 672414 num_examples: 4957 - name: hsol_perturbed_sswn_0.3 num_bytes: 684467 num_examples: 4957 - name: hsol_perturbed_contraction num_bytes: 648114 num_examples: 4957 - name: hsol_perturbed_insertadv num_bytes: 764862 num_examples: 4957 - name: hsol_perturbed_prejudice num_bytes: 645247 num_examples: 4957 - name: hsol_perturbed_punctuation num_bytes: 679299 num_examples: 4957 - name: hsol_perturbed_reverseneg num_bytes: 665446 num_examples: 4957 - name: hsol_perturbed_swapnum num_bytes: 646336 num_examples: 4957 - name: hsol_perturbed_verbtense num_bytes: 654617 num_examples: 4957 - name: hsol_perturbed_twitter num_bytes: 719030 num_examples: 4957 - name: hsol_perturbed_wordcase num_bytes: 645191 num_examples: 4957 download_size: 6331984 dataset_size: 17872975 --- # Dataset Card for "HSOLPerturbed" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
DaniFrame
原始信息汇总

数据集概述

数据集名称

  • HSOLPerturbed

数据集特征

  • count:整数类型(int64)
  • hate_speech_count:整数类型(int64)
  • offensive_language_count:整数类型(int64)
  • neither_count:整数类型(int64)
  • class:分类标签,包括:
    • 0: hate speech
    • 1: offensive language
    • 2: neither
  • tweet:字符串类型(string)

数据集分割

  • 数据集包含多个分割,每个分割的名称、字节数和示例数如下:
    • hsol_perturbed_keyboard_0.01:651089字节,4957个示例
    • hsol_perturbed_keyboard_0.05:651333字节,4957个示例
    • hsol_perturbed_keyboard_0.1:651720字节,4957个示例
    • hsol_perturbed_ocr_0.01:651029字节,4957个示例
    • hsol_perturbed_ocr_0.05:651047字节,4957个示例
    • hsol_perturbed_ocr_0.1:651059字节,4957个示例
    • hsol_perturbed_spellingerror_0.01:652461字节,4957个示例
    • hsol_perturbed_spellingerror_0.05:656504字节,4957个示例
    • hsol_perturbed_spellingerror_0.1:661760字节,4957个示例
    • hsol_perturbed_typos_0.01:651173字节,4957个示例
    • hsol_perturbed_typos_0.05:651752字节,4957个示例
    • hsol_perturbed_typos_0.1:652435字节,4957个示例
    • hsol_perturbed_sne_0.1:650990字节,4957个示例
    • hsol_perturbed_sne_0.2:650690字节,4957个示例
    • hsol_perturbed_sne_0.3:651339字节,4957个示例
    • hsol_perturbed_sswn_0.1:661571字节,4957个示例
    • hsol_perturbed_sswn_0.2:672414字节,4957个示例
    • hsol_perturbed_sswn_0.3:684467字节,4957个示例
    • hsol_perturbed_contraction:648114字节,4957个示例
    • hsol_perturbed_insertadv:764862字节,4957个示例
    • hsol_perturbed_prejudice:645247字节,4957个示例
    • hsol_perturbed_punctuation:679299字节,4957个示例
    • hsol_perturbed_reverseneg:665446字节,4957个示例
    • hsol_perturbed_swapnum:646336字节,4957个示例
    • hsol_perturbed_verbtense:654617字节,4957个示例
    • hsol_perturbed_twitter:719030字节,4957个示例
    • hsol_perturbed_wordcase:645191字节,4957个示例

数据集大小

  • 下载大小:6331984字节
  • 数据集大小:17872975字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作