five

Jendersen/cornish_english_translation

收藏
Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Jendersen/cornish_english_translation
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个小而高质量的Cornish(kw)和英语(en)平行句子语料库,非常适合低资源多语言机器翻译研究和原型设计。Cornish(Kernewek或Kernowek [kəɾˈnuːək])是Brittonic亚组的一种极度濒危的凯尔特语言,是Cornish人和他们的家园Cornwall的母语。该数据集提供了Cornish和英语的专业对齐句子,是少数公开可用的低资源语言资源之一。数据集以Parquet格式存储,包含一个单一的训练分割,共有9,087个例子。每个例子包含Cornish句子和对应的英语翻译。

A small but high-quality parallel corpus of Cornish (kw) ↔ English (en) sentences, ideal for low-resource multilingual machine translation research and prototyping. Cornish (Kernewek or Kernowek [kəɾˈnuːək]) is a critically endangered Celtic language of the Brittonic subgroup that is native to the Cornish people and their homeland, Cornwall. This dataset provides professionally aligned sentences in Cornish and English, making it one of the few publicly available resources for this low-resource language. The dataset is stored in Parquet format and consists of a single split (train) with 9,087 examples, each containing a Cornish sentence and its corresponding English translation.
提供机构:
Jendersen
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作