michsethowusu/dinka-swahili_sentence-pairs
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/michsethowusu/dinka-swahili_sentence-pairs
下载链接
链接失效反馈官方服务:
资源简介:
丁卡-斯瓦希里语句对数据集包含非洲语言的句子对和相似度分数。每个数据行包括三个列:分数(两个句子的相似度,范围从0到1),丁卡语(成对的第一句话),斯瓦希里语(成对的第二句话)。该数据集旨在用于训练和评估机器学习模型,用于翻译、句子相似度计算和跨语言迁移学习等任务。
The Dinka-Swahili Sentence Pairs dataset contains sentence pairs in African languages with associated similarity scores. Each row includes three columns: score (the similarity between the two sentences, ranging from 0 to 1), Dinka (the first sentence in the pair), and Swahili (the second sentence in the pair). This dataset is intended for use in training and evaluating machine learning models for tasks such as translation, sentence similarity, and cross-lingual transfer learning.
提供机构:
michsethowusu



