Rusallan/karaken1-dataset
收藏Hugging Face2025-11-11 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/Rusallan/karaken1-dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两种语言的文本数据,分别为英语(en)和一种未知语言(kaa),共有训练集和测试集两个部分。训练集包含208101条数据,测试集包含15335条数据。数据集主要用于语言处理任务,如翻译、文本分析等。
The dataset includes text data in two languages, English (en) and an unknown language (kaa), with both training and test sets. The training set contains 208101 entries, and the test set contains 15335 entries. The dataset is primarily intended for language processing tasks such as translation, text analysis, etc.
提供机构:
Rusallan



