Parallel text typology dataset
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7506219
下载链接
链接失效反馈官方服务:
资源简介:
This repository contains data accompanying the paper Neural models can sometimes discover typological generalizations, currently being submitted for publication. It contains the following information for 1295 different languages:
language vector representations from a range of neural models
automatically derived lists of affixes
automatically derived lists of inflectional paradigms
typological features derived from annotation projection, and statistics on dependency relations
typological features derived from classifiers trained on language vectors and typological databases
automatically derived word lists
data needed for automatic evaluation of language representations (code in separate repository)
Note that the multilingual word embeddings described in the paper are very large, and therefore distributed in a separate public repository.
创建时间:
2023-01-05



