euisuh15/synthetic-piss
收藏Hugging Face2023-12-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/euisuh15/synthetic-piss
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train10
path: data/train10-*
- split: train30
path: data/train30-*
- split: train50
path: data/train50-*
- split: train70
path: data/train70-*
- split: train90
path: data/train90-*
- split: valid1
path: data/valid1-*
- split: valid2
path: data/valid2-*
- split: test1
path: data/test1-*
- split: test2
path: data/test2-*
- split: test3
path: data/test3-*
dataset_info:
features:
- name: text
dtype: string
- name: is_poison
dtype: bool
- name: trigger_name
dtype: bool
- name: trigger_format
dtype: bool
splits:
- name: train10
num_bytes: 1856444
num_examples: 3746
- name: train30
num_bytes: 1832448
num_examples: 3741
- name: train50
num_bytes: 1809926
num_examples: 3728
- name: train70
num_bytes: 1779003
num_examples: 3701
- name: train90
num_bytes: 1761667
num_examples: 3703
- name: valid1
num_bytes: 222342
num_examples: 460
- name: valid2
num_bytes: 228818
num_examples: 464
- name: test1
num_bytes: 218556
num_examples: 460
- name: test2
num_bytes: 229206
num_examples: 466
- name: test3
num_bytes: 224024
num_examples: 466
download_size: 76278
dataset_size: 10162434
---
# Dataset Card for "final-final-qcri"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
euisuh15
原始信息汇总
数据集概述
数据文件配置
- 默认配置 (
default)- 训练集 (
train)train10: 路径data/train10-*train30: 路径data/train30-*train50: 路径data/train50-*train70: 路径data/train70-*train90: 路径data/train90-*
- 验证集 (
valid)valid1: 路径data/valid1-*valid2: 路径data/valid2-*
- 测试集 (
test)test1: 路径data/test1-*test2: 路径data/test2-*test3: 路径data/test3-*
- 训练集 (
数据集信息
-
特征 (
features)text: 类型stringis_poison: 类型booltrigger_name: 类型booltrigger_format: 类型bool
-
数据分割 (
splits)train10: 字节数1856444, 样本数3746train30: 字节数1832448, 样本数3741train50: 字节数1809926, 样本数3728train70: 字节数1779003, 样本数3701train90: 字节数1761667, 样本数3703valid1: 字节数222342, 样本数460valid2: 字节数228818, 样本数464test1: 字节数218556, 样本数460test2: 字节数229206, 样本数466test3: 字节数224024, 样本数466
-
数据集大小
- 下载大小:
76278字节 - 数据集大小:
10162434字节
- 下载大小:



