five

Classical Tibetan Word Embeddings

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6782246
下载链接
链接失效反馈
官方服务:
资源简介:
Classical Tibetan word embeddings trained with FastText based on the 2018 version of the BDRC corpus, a segmented version of which is available on Zenodo: Meelen, Marieke, & Roux, Élie. (2020). The Annotated Corpus of Classical Tibetan (ACTib) - Version 2.0 (Segmented & POS-tagged) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.3951503 This is the first version trained with default FastText settings (100D) for a pilot study on Chinese-Tibetan crosslinguistic Semantic Textual Similarity: Felbur, Rafal, Marieke Meelen & Paul Vierthaler (2022), 'Crosslinguistic Semantic Textual Similarity of Buddhist Chinese and Classical Tibetan' in Journal of Open Humanities Data. This research was done with generous funding from the Open Philology project. This project (running 2018–2022) is funded by the European Research Council (ERC) under the Horizon 2020 program (Advanced Grant agreement No 741884). It is based at the Leiden University Institute for Area Studies.
创建时间:
2022-06-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作