Serbian Twitter training corpus ReLDI-NormTagNER-sr 3.0
收藏SSH Open MarketPlace2025-07-04 更新2025-07-05 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/IXVwxc
下载链接
链接失效反馈官方服务:
资源简介:
This corpus contains manually annotated Serbian tweets. It is meant as a gold-standard training and testing dataset for tokenisation, sentence segmentation, word normalisation, morphosyntactic tagging, lemmatisation and named entity recognition of non-standard Serbian. Each tweet is also annotated for its automatically assigned standardness levels (T = technical standardness, L = linguistic standardness).
The corpus is available for download from the CLARIN.SI repository.
创建时间:
2025-07-04



