Heatmap-based visualisation of the linguistic polymorphism in Transcarpathian East Slavic - Supplementary material
收藏DataCite Commons2026-05-04 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.20029676
下载链接
链接失效反馈官方服务:
资源简介:
Data
Demonstration data consists of a single file, data/topics_depparse_gold_LA1407.conllu. This is a CoNLL-U file in Universal Dependencies format.
Code
Note for reproducibility
The study was conducted on 3.13 version of Python. Version >=3.14 is going to cause conflicts in dependencies.
Example generation
For generation of examples the Heat_measurement_notebook.ipynb notebook was used. Run through it to get the Figure 2.
Study
Morphological tagging errors visualisation
To reproduce morphological tagging errors visualisation, run the following command in visualisation_of_errors folder:
python src/main.py -t pos -s "INPUT" -out "OUTPUT"
where INPUT is the path to data directory, and OUTPUT is the folder, where there should be the storage of results.
Lemmatisation errors visualisation
To reproduce lemmatisation errors visualisation, run the following command in visualisation_of_errors folder:
python src/main.py -t lemma -s "INPUT" -out "OUTPUT"
where INPUT is the path to data/topics_depparse_gold_LA1407.conllu file, and OUTPUT is the folder, where there should be the storage of results.
提供机构:
Zenodo
创建时间:
2026-05-04



