dnagpt/paws-x-multi-pair
收藏Hugging Face2025-02-11 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/dnagpt/paws-x-multi-pair
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含多种语言句子对和标签的多语言数据集,适用于训练句子相似度或翻译等任务。数据集中的句子对包括英文、法语、德语、中文、西班牙语、日语和韩语等多种语言,每个句子对都有一个标签,可能用于指示句子对是否在语义上等价。数据集划分为训练集,共有49401个示例。
This dataset is a multilingual dataset containing sentence pairs and labels in various languages, suitable for training tasks such as sentence similarity or translation. The sentence pairs in the dataset include multiple languages such as English, French, German, Chinese, Spanish, Japanese, and Korean, and each pair is associated with a label, which may indicate whether the sentences are semantically equivalent. The dataset is divided into a training set with a total of 49,401 examples.
提供机构:
dnagpt



