five

The Corpus of Contemporary Written Kurdish

收藏
DataCite Commons2025-07-26 更新2026-05-03 收录
下载链接:
https://fd-repo.uni-bamberg.de/doi/10.48564/unibafd-96cn0-gjm62
下载链接
链接失效反馈
官方服务:
资源简介:
The CCWK comprises a selection of contemporary written, primarily literary texts in Northern Kurdish (Kurmanjî). The corpus was compiled by Abdullah Incekan as part of his PhD project (Incekan 2018) under the supervision of Geoffrey Haig. Please note that due to copyright constraints, the corpus data are available only on request. Please contact Geoffrey Haig if you wish to access the data. The corpus consists of more than 900 000 words, predominantly fiction (~77%) combined with some non-fiction Kurmanjî Kurdish texts (~23%). The texts stem from a variety of contemporary sources (from the early 1990's to the present). They are intended to be approximately representative of contemporary Kurdish prose written in the largely standardized roman-based Kurmanjî alphabet. The corpus is not tagged or translated.   Citation Incekan, Abdullah & Haig, Geoffrey. 2021. The Corpus of Contemporary Written Kurdish (CCWK). Bamberg: University of Bamberg. (DOI: 10.48564/unibafd-hp82b-k0k26) (date accessed)
提供机构:
Otto-Friedrich-Universität Bamberg
创建时间:
2025-07-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作