Early Slavic language models
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8414136
下载链接
链接失效反馈官方服务:
资源简介:
Word embeddings trained on the lemmatised TOROT Treebank, using Word2Vec and the following parameters:
sg = True
min_count = <1,3,5>
window = <3,5>
vector_size = <100,200,300>
epochs = 5
One model was trained for each combination of the parameters enclosed in angled brackets (< >).
The release contains both the full models (.model) and the plain vector files (_vectors.txt). The models are named according to the parameters they were trained with.
Note that these are the result of very preliminary experiments and no systematic evaluation of their quality was carried out, so use with caution.
创建时间:
2023-10-07



