datasciathlete/corpus4everyone-klue-korean-NER
收藏Hugging Face2024-02-27 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/datasciathlete/corpus4everyone-klue-korean-NER
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: ner_tags
sequence:
class_label:
names:
"0": B-PS,
"1": I-PS,
"2": B-FD,
"3": I-FD,
"4": B-TR,
"5": I-TR,
"6": B-AF,
"7": I-AF,
"8": B-OG,
"9": I-OG,
"10": B-LC,
"11": I-LC,
"12": B-CV,
"13": I-CV,
"14": B-DT,
"15": I-DT,
"16": B-TI,
"17": I-TI,
"18": B-QT,
"19": I-QT,
"20": B-EV,
"21": I-EV,
"22": B-AM,
"23": I-AM,
"24": B-PT,
"25": I-PT,
"26": B-MT,
"27": I-MT,
"28": B-TM,
"29": I-TM,
"30": O
- name: tokens
sequence: string
splits:
- name: train
num_bytes: 166572779.43135825
num_examples: 138015
- name: validation
num_bytes: 42859683.236356184
num_examples: 34252
download_size: 22991576
dataset_size: 209432462.66771442
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
---
提供机构:
datasciathlete
原始信息汇总
数据集概述
特征
- ner_tags: 序列特征,包含以下类别标签:
- "0": B-PS
- "1": I-PS
- "2": B-FD
- "3": I-FD
- "4": B-TR
- "5": I-TR
- "6": B-AF
- "7": I-AF
- "8": B-OG
- "9": I-OG
- "10": B-LC
- "11": I-LC
- "12": B-CV
- "13": I-CV
- "14": B-DT
- "15": I-DT
- "16": B-TI
- "17": I-TI
- "18": B-QT
- "19": I-QT
- "20": B-EV
- "21": I-EV
- "22": B-AM
- "23": I-AM
- "24": B-PT
- "25": I-PT
- "26": B-MT
- "27": I-MT
- "28": B-TM
- "29": I-TM
- "30": O
- tokens: 序列特征,字符串类型
数据分割
- train:
- 字节数: 166572779.43135825
- 样本数: 138015
- validation:
- 字节数: 42859683.236356184
- 样本数: 34252
数据集大小
- 下载大小: 22991576
- 数据集大小: 209432462.66771442
配置
- default:
- 训练数据路径: data/train-*
- 验证数据路径: data/validation-*



