okoc/toxigen_per
收藏Hugging Face2024-05-28 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/okoc/toxigen_per
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: text
dtype: string
- name: target_group
dtype: string
- name: factual?
dtype: string
- name: ingroup_effect
dtype: string
- name: lewd
dtype: string
- name: framing
dtype: string
- name: predicted_group
dtype: string
- name: stereotyping
dtype: string
- name: intent
dtype: float64
- name: toxicity_ai
dtype: float64
- name: toxicity_human
dtype: float64
- name: predicted_author
dtype: string
- name: actual_method
dtype: string
- name: text_per_B
dtype: string
- name: text_per_C
dtype: string
- name: text_per_D
dtype: string
- name: text_per_E
dtype: string
splits:
- name: test
num_bytes: 238808.50638297873
num_examples: 278
- name: train
num_bytes: 1744770.2651785715
num_examples: 2132
download_size: 918172
dataset_size: 1983578.7715615502
configs:
- config_name: default
data_files:
- split: test
path: data/test-*
- split: train
path: data/train-*
---
提供机构:
okoc
原始信息汇总
数据集概述
数据集特征
- text: 文本类型,字符串
- target_group: 目标群体,字符串
- factual?: 事实性,字符串
- ingroup_effect: 群体内效应,字符串
- lewd: 下流,字符串
- framing: 框架,字符串
- predicted_group: 预测群体,字符串
- stereotyping: 刻板印象,字符串
- intent: 意图,浮点数
- toxicity_ai: AI毒性,浮点数
- toxicity_human: 人类毒性,浮点数
- predicted_author: 预测作者,字符串
- actual_method: 实际方法,字符串
- text_per_B: B类文本,字符串
- text_per_C: C类文本,字符串
- text_per_D: D类文本,字符串
- text_per_E: E类文本,字符串
数据集分割
- test: 测试集,包含278个样本,占用238808.50638297873字节
- train: 训练集,包含2132个样本,占用1744770.2651785715字节
数据集大小
- 下载大小: 918172字节
- 数据集大小: 1983578.7715615502字节
数据文件配置
- config_name: default
- data_files:
- split: test, path: data/test-*
- split: train, path: data/train-*



