Classical Tibetan Word Embeddings
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6782246
下载链接
链接失效反馈官方服务:
资源简介:
Classical Tibetan word embeddings trained with FastText based on the 2018 version of the BDRC corpus, a segmented version of which is available on Zenodo:
Meelen, Marieke, & Roux, Élie. (2020). The Annotated Corpus of Classical Tibetan (ACTib) - Version 2.0 (Segmented & POS-tagged) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.3951503
This is the first version trained with default FastText settings (100D) for a pilot study on Chinese-Tibetan crosslinguistic Semantic Textual Similarity:
Felbur, Rafal, Marieke Meelen & Paul Vierthaler (2022), 'Crosslinguistic Semantic Textual Similarity of Buddhist Chinese and Classical Tibetan' in Journal of Open Humanities Data.
This research was done with generous funding from the Open Philology project. This project (running 2018–2022) is funded by the European Research Council (ERC) under the Horizon 2020 program (Advanced Grant agreement No 741884). It is based at the Leiden University Institute for Area Studies.
创建时间:
2022-06-30



