Training corpus ssj500k 2.2
收藏B2FIND2026-04-29 收录
下载链接:
https://b2find.eudat.eu/dataset/61aeedd5-739d-5488-b34b-b229ea2a318a
下载链接
链接失效反馈官方服务:
资源简介:
The ssj500k training corpus contains about 500,000 tokens manually annotated on the levels of tokenisation, sentence segmentation, morphosyntactic tagging, and lemmatisation....



