Serbian Twitter training corpus ReLDI-NormTagNER-sr 3.0
收藏SSH Open MarketPlace2023-10-13 更新2024-08-03 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/z1cwu1
下载链接
链接失效反馈官方服务:
资源简介:
This corpus contains manually annotated Serbian tweets. It is meant as a gold-standard training and testing dataset for tokenisation, sentence segmentation, word normalisation, morphosyntactic tagging, lemmatisation and named entity recognition of non-standard Serbian. Each tweet is also annotated for its automatically assigned standardness levels (T = technical standardness, L = linguistic standardness)..
The corpus is available for download from the CLARIN.SI repository.
创建时间:
2023-10-13



