michsethowusu/tsonga-tumbuka_sentence-pairs
收藏Hugging Face2025-03-31 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/michsethowusu/tsonga-tumbuka_sentence-pairs
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含非洲语言的句子对以及相似度评分。每个数据行包括三个列:评分列表示两个句子的相似度(范围从0到1),Tsonga列是句子对中的第一句(语言1),Tumbuka列是句子对中的第二句(语言2)。此数据集适用于训练和评估机器学习模型,用于翻译、句子相似度评估和跨语言迁移学习等任务。
This dataset contains sentence pairs in African languages with an associated similarity score. Each row consists of three columns: score representing the similarity between the two sentences (range from 0 to 1), Tsonga as the first sentence in the pair (language 1), and Tumbuka as the second sentence in the pair (language 2). The dataset is intended for use in training and evaluating machine learning models for tasks like translation, sentence similarity, and cross-lingual transfer learning.
提供机构:
michsethowusu



