Training corpus ssj500k 2.0
收藏B2FIND2026-04-29 收录
下载链接:
https://b2find.eudat.eu/dataset/444534b7-c4ad-572d-b44d-2103a4395339
下载链接
链接失效反馈官方服务:
资源简介:
The ssj500k training corpus contains about 500,000 tokens manually annotated on the levels of tokenisation, sentence segmentation, morphosyntactic tagging, and lemmatisation....



