five

An unambiguous POS tagging set

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12721052
下载链接
链接失效反馈
官方服务:
资源简介:
This data set contains 1,123 short, POS-tagged sentences (extracted from an earlier data set; see https://zenodo.org/records/7694423), using the Universal tag set. The sentences can easily be POS tagged by a human tagger. However, standard POS taggers struggle with these sentences. The data file contains a header row that describes each column. The first column indicates the type of sentence (either a transcript of spoken text (0) or a sentence originating from written text (1)), the second column contains the actual sentence, with ground truth POS tags. The third column indicates the index of the mistagged token, and the remaining five columns show the tags assigned (of which at least one is a mistagging) of five different taggers.
创建时间:
2024-09-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作