patriziobellan/PETv11
收藏Hugging Face2023-05-01 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/patriziobellan/PETv11
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: document name
dtype: string
- name: tokens
sequence: string
- name: tokens-IDs
sequence: int8
- name: ner_tags
sequence: string
- name: sentence-IDs
sequence: int8
- name: relations
sequence:
- name: source-head-sentence-ID
dtype: int8
- name: source-head-word-ID
dtype: int8
- name: relation-type
dtype: string
- name: target-head-sentence-ID
dtype: int8
- name: target-head-word-ID
dtype: int8
splits:
- name: test
num_bytes: 203379
num_examples: 45
download_size: 38326
dataset_size: 203379
---
This is the version 1.1.0 of the original PET dataset.
in this version we fixed ``the Performs Relations'' and few minor errors.
Please refer to the original [PET Dataset repository](https://huggingface.co/datasets/patriziobellan/PET) for more info.
提供机构:
patriziobellan
原始信息汇总
数据集概述
数据集特征
- document name:文档名称,数据类型为字符串。
- tokens:词元,数据类型为字符串序列。
- tokens-IDs:词元ID,数据类型为8位整数序列。
- ner_tags:命名实体识别标签,数据类型为字符串序列。
- sentence-IDs:句子ID,数据类型为8位整数序列。
- relations:关系信息,包含以下子特征:
- source-head-sentence-ID:源头部句子ID,数据类型为8位整数。
- source-head-word-ID:源头部词ID,数据类型为8位整数。
- relation-type:关系类型,数据类型为字符串。
- target-head-sentence-ID:目标头部句子ID,数据类型为8位整数。
- target-head-word-ID:目标头部词ID,数据类型为8位整数。
数据集分割
- test:测试集,包含45个样本,总大小为203379字节。
数据集大小
- 下载大小:38326字节。
- 数据集总大小:203379字节。



