The CLASSLA-Stanza model for UD dependency parsing of standard Slovenian 2.0
收藏hdl.handle.net2025-01-16 收录
下载链接:
http://hdl.handle.net/11356/1769
下载链接
链接失效反馈官方服务:
资源简介:
This model for UD dependency parsing of standard Slovenian was built with the CLASSLA-Stanza tool (https://github.com/clarinsi/classla) by training on the SUK training corpus (http://hdl.handle.net/11356/1747) and using the CLARIN.SI-embed.sl word embeddings (http://hdl.handle.net/11356/1204) expanded with the MaCoCu-sl Slovene web corpus (http://hdl.handle.net/11356/1517). The estimated LAS of the parser is ~91.11.
The difference to the previous version of the model is that the model was trained using the SUK training corpus and uses the updated embeddings.
本模型针对标准斯洛文尼亚语的UD依存句法分析构建于CLASSLA-Stanza工具(https://github.com/clarinsi/classla)之上,通过在SUK训练语料库(http://hdl.handle.net/11356/1747)上训练,并结合CLARIN.SI-embed.sl词汇嵌入(http://hdl.handle.net/11356/1204)及其基于MaCoCu-sl斯洛文尼亚语网络语料库(http://hdl.handle.net/11356/1517)的扩展,实现了高精度。该解析器的LAS评估值约为91.11。与先前版本相比,本模型的显著差异在于采用了SUK训练语料库进行训练,并使用了更新的嵌入向量。
提供机构:
hdl.handle.net



