rahmanansari/NER-Dataset
收藏Hugging Face2024-03-11 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/rahmanansari/NER-Dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
dataset_info:
features:
- name: tokens
sequence: string
- name: ner_tags
sequence:
class_label:
names:
'0': O
'1': B-PER
'2': I-PER
'3': B-ORG
'4': I-ORG
'5': B-LOC
'6': I-LOC
'7': B-MISC
'8': I-MISC
'9': B-ACTOR
'10': I-ACTOR
'11': B-TITLE
'12': I-TITLE
'13': B-YEAR
'14': I-YEAR
'15': B-GENRE
'16': I-GENRE
'17': B-PLOT
'18': I-PLOT
'19': B-DIRECTOR
'20': I-DIRECTOR
'21': B-RATINGS_AVERAGE
'22': I-RATINGS_AVERAGE
'23': B-RATING
'24': I-RATING
'25': B-CHARACTER
'26': I-CHARACTER
'27': B-SONG
'28': I-SONG
'29': B-REVIEW
'30': I-REVIEW
'31': B-TRAILER
'32': I-TRAILER
splits:
- name: train
num_bytes: 5483767
num_examples: 24638
- name: validation
num_bytes: 1362791
num_examples: 5826
download_size: 1601438
dataset_size: 6846558
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
---
提供机构:
rahmanansari
原始信息汇总
数据集概述
语言
- 英语(en)
数据集信息
特征
- tokens: 字符序列
- ner_tags: 命名实体识别标签序列
- 标签名称及其对应编码:
- 0: O
- 1: B-PER
- 2: I-PER
- 3: B-ORG
- 4: I-ORG
- 5: B-LOC
- 6: I-LOC
- 7: B-MISC
- 8: I-MISC
- 9: B-ACTOR
- 10: I-ACTOR
- 11: B-TITLE
- 12: I-TITLE
- 13: B-YEAR
- 14: I-YEAR
- 15: B-GENRE
- 16: I-GENRE
- 17: B-PLOT
- 18: I-PLOT
- 19: B-DIRECTOR
- 20: I-DIRECTOR
- 21: B-RATINGS_AVERAGE
- 22: I-RATINGS_AVERAGE
- 23: B-RATING
- 24: I-RATING
- 25: B-CHARACTER
- 26: I-CHARACTER
- 27: B-SONG
- 28: I-SONG
- 29: B-REVIEW
- 30: I-REVIEW
- 31: B-TRAILER
- 32: I-TRAILER
- 标签名称及其对应编码:
数据集划分
- train: 训练集
- 字节数: 5483767
- 样本数: 24638
- validation: 验证集
- 字节数: 1362791
- 样本数: 5826
数据集大小
- 下载大小: 1601438 字节
- 数据集总大小: 6846558 字节
配置
- config_name: default
- 数据文件路径:
- 训练集: data/train-*
- 验证集: data/validation-*
- 数据文件路径:



