procit009/conll03
收藏Hugging Face2024-12-13 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/procit009/conll03
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个主要特征:id、tokens、pos_tags和chunk_tags、ner_tags。其中,pos_tags、chunk_tags和ner_tags是序列类型,且每个标签都有对应的类别名称。数据集分为一个训练集,包含14041个样本,总大小为6931345字节。
The dataset includes four main features: id, tokens, pos_tags, chunk_tags, and ner_tags. The id is of string type, tokens is a sequence of strings, and pos_tags, chunk_tags, and ner_tags are sequences of class labels. The dataset is divided into a training set, containing 14041 samples. The download size of the dataset is 1227788 bytes, and the dataset size is 6931345 bytes.
提供机构:
procit009



