michsethowusu/kimbundu-tumbuka_sentence-pairs
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/michsethowusu/kimbundu-tumbuka_sentence-pairs
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了非洲语言句子对及其相似度评分。每个数据条目由三个字段组成:相似度评分、金本杜语句子和通布卡语句子。该数据集可用于机器翻译、句子对齐或其他自然语言处理任务。数据集基于NLLBv1构建,由META领导的开放源代码计划发布。数据集适用于训练和评估用于翻译、句子相似度计算和跨语言迁移学习的机器学习模型。
This dataset contains sentence pairs in African languages along with similarity scores. Each entry consists of three fields: a similarity score, a sentence in Kimbundu, and a sentence in Tumbuka. The dataset can be used for machine translation, sentence alignment, or other natural language processing tasks. It is based on the NLLBv1 dataset, published under an open-source initiative led by META, and is intended for training and evaluating machine learning models for tasks like translation, sentence similarity, and cross-lingual transfer learning.
提供机构:
michsethowusu



