alzoubi36/piextract
收藏Hugging Face2023-06-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/alzoubi36/piextract
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: COLLECT
struct:
- name: subtask
dtype: string
- name: tags
sequence: string
- name: tokens
sequence: string
- name: NOT_COLLECT
struct:
- name: subtask
dtype: string
- name: tags
sequence: string
- name: tokens
sequence: string
- name: NOT_SHARE
struct:
- name: subtask
dtype: string
- name: tags
sequence: string
- name: tokens
sequence: string
- name: SHARE
struct:
- name: subtask
dtype: string
- name: tags
sequence: string
- name: tokens
sequence: string
splits:
- name: train
num_bytes: 3453408
num_examples: 2579
- name: test
num_bytes: 1580498
num_examples: 1029
- name: validation
num_bytes: 662810
num_examples: 456
download_size: 1013894
dataset_size: 5696716
---
# Dataset for the PI-Extract task in the [PrivacyGLUE](https://github.com/infsys-lab/privacy-glue) dataset
提供机构:
alzoubi36
原始信息汇总
数据集概述
数据集特征
-
COLLECT
- subtask: 数据类型为字符串
- tags: 数据类型为字符串序列
- tokens: 数据类型为字符串序列
-
NOT_COLLECT
- subtask: 数据类型为字符串
- tags: 数据类型为字符串序列
- tokens: 数据类型为字符串序列
-
NOT_SHARE
- subtask: 数据类型为字符串
- tags: 数据类型为字符串序列
- tokens: 数据类型为字符串序列
-
SHARE
- subtask: 数据类型为字符串
- tags: 数据类型为字符串序列
- tokens: 数据类型为字符串序列
数据集分割
-
train
- num_bytes: 3453408
- num_examples: 2579
-
test
- num_bytes: 1580498
- num_examples: 1029
-
validation
- num_bytes: 662810
- num_examples: 456
数据集大小
- download_size: 1013894
- dataset_size: 5696716



