five

intelli-zen/part_of_speech

收藏
Hugging Face2024-08-21 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/intelli-zen/part_of_speech
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 --- ## 词性标注数据集 ```text 汉语 https://huggingface.co/datasets/ontonotes/conll2012_ontonotesv5 https://huggingface.co/datasets/eriktks/conll2002 https://huggingface.co/datasets/eriktks/conll2003 https://huggingface.co/datasets/tjspross/ctb6 https://huggingface.co/datasets/AlienKevin/ctb8 https://www.modelscope.cn/datasets/dingkun/chinese_pos_ctb6 人民日报词性标注语料数据 http://shujujishi.com/dataset/a27b9a15-24ab-4dab-aa2b-7e073458973c.html https://www.modelscope.cn/datasets/izhx404/ud1_pos 英语 https://huggingface.co/datasets/Cyberfish/pos_tagger https://huggingface.co/datasets/strombergnlp/twitter_pos https://huggingface.co/datasets/strombergnlp/twitter_pos_vcb https://huggingface.co/datasets/batterydata/pos_tagging 波兰语 https://huggingface.co/datasets/clarin-pl/nkjp-pos 阿拉伯语 https://huggingface.co/datasets/QCRI/arabic_pos_dialect ```

许可证:Apache-2.0 ## 词性标注数据集 ### 汉语 1. CONLL2012版OntoNotes 5.0数据集:https://huggingface.co/datasets/ontonotes/conll2012_ontonotesv5 2. CONLL2002 数据集:https://huggingface.co/datasets/eriktks/conll2002 3. CONLL2003 数据集:https://huggingface.co/datasets/eriktks/conll2003 4. 中文树库6.0(Chinese Treebank 6,CTB6)数据集:https://huggingface.co/datasets/tjspross/ctb6 5. 中文树库8.0(Chinese Treebank 8,CTB8)数据集:https://huggingface.co/datasets/AlienKevin/ctb8 6. 基于中文树库6.0的汉语词性标注(Part-of-Speech,POS)数据集:https://www.modelscope.cn/datasets/dingkun/chinese_pos_ctb6 7. 人民日报词性标注语料数据集:http://shujujishi.com/dataset/a27b9a15-24ab-4dab-aa2b-7e073458973c.html 8. 通用依存树库1.0词性标注数据集:https://www.modelscope.cn/datasets/izhx404/ud1_pos ### 英语 1. 词性标注(Part-of-Speech,POS)数据集:https://huggingface.co/datasets/Cyberfish/pos_tagger 2. 推特词性标注数据集:https://huggingface.co/datasets/strombergnlp/twitter_pos 3. 推特词性标注词汇表数据集:https://huggingface.co/datasets/strombergnlp/twitter_pos_vcb 4. 词性标注(Part-of-Speech,POS)数据集:https://huggingface.co/datasets/batterydata/pos_tagging ### 波兰语 波兰语国家语料库(National Corpus of Polish,NKJP)词性标注数据集:https://huggingface.co/datasets/clarin-pl/nkjp-pos ### 阿拉伯语 阿拉伯语方言词性标注数据集:https://huggingface.co/datasets/QCRI/arabic_pos_dialect
提供机构:
intelli-zen
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作