The CLASSLA-Stanza model for lemmatisation of standard Bulgarian 2.1
收藏SSH Open MarketPlace2023-10-13 更新2024-08-03 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/dM1AE4
下载链接
链接失效反馈官方服务:
资源简介:
The model for lemmatisation of standard Bulgarian was built with the [CLASSLA-Stanza tool](https://github.com/clarinsi/classla) by training on the [BulTreeBank training corpus](https://clarino.uib.no/korpuskel/corpora) and using the Bulgarian inflectional lexicon (Popov, Simov, and Vidinska 1998). The estimated F1 of the lemma annotations is ~98.93.
The model is available for download from the CLARIN.SI repository.
标准保加利亚语词形还原(lemmatisation)模型采用CLASSLA-Stanza工具构建,通过BulTreeBank训练语料库进行训练,并结合使用了保加利亚语屈折词典(Popov, Simov, and Vidinska 1998)。词形标注的预估F1值约为98.93。
该模型可从CLARIN.SI资源库下载获取。
创建时间:
2023-10-13



