Early Slavic language models

NIAID Data Ecosystem2026-05-01 收录

下载链接：

https://zenodo.org/record/8414136

下载链接

链接失效反馈

官方服务：

资源简介：

Word embeddings trained on the lemmatised TOROT Treebank, using Word2Vec and the following parameters: sg = True min_count = <1,3,5> window = <3,5> vector_size = <100,200,300> epochs = 5 One model was trained for each combination of the parameters enclosed in angled brackets (< >). The release contains both the full models (.model) and the plain vector files (_vectors.txt). The models are named according to the parameters they were trained with. Note that these are the result of very preliminary experiments and no systematic evaluation of their quality was carried out, so use with caution.

创建时间：

2023-10-07

5,000+

优质数据集

54 个

任务类型

进入经典数据集