procit009/conll2003_NER_data
收藏Hugging Face2024-12-15 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/procit009/conll2003_NER_data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本数据的词性标注、组块标注和命名实体识别标注。特征包括id、tokens、pos_tags、chunk_tags和ner_tags,其中pos_tags、chunk_tags和ner_tags是序列标签,分别对应词性标注、组块标注和命名实体识别标注。数据集包含一个训练集,共有30个样本,文件大小为6538字节。
This dataset contains text data with part-of-speech tags, chunk tags, and named entity recognition tags. Features include id, tokens, pos_tags, chunk_tags, and ner_tags, where pos_tags, chunk_tags, and ner_tags are sequence labels corresponding to part-of-speech tagging, chunking, and named entity recognition, respectively. The dataset includes a training set with 30 samples and a file size of 6538 bytes.
提供机构:
procit009



