silma-ai/silma-arabic-english-sts-dataset-v1.0
收藏Hugging Face2024-10-17 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/silma-ai/silma-arabic-english-sts-dataset-v1.0
下载链接
链接失效反馈官方服务:
资源简介:
SILMA STS阿拉伯/英语数据集 - v1.0 是一个用于训练和评估阿拉伯语和英语句子嵌入的任务的数据集。它包括五个不同的分割,涵盖了单语和双语句子对,并有人工标注的相似度分数。这些分割包括阿拉伯语-阿拉伯语、英语-英语以及跨语言的阿拉伯语-英语句子对,使其成为进行多语言和跨语言语义相似度任务的有价值资源。
The SILMA STS Arabic/English Dataset - v1.0 is a dataset designed for training and evaluating sentence embeddings for Arabic and English tasks. It consists of five different splits that cover monolingual and multilingual sentence pairs with human-annotated similarity scores, including Arabic-to-Arabic, English-to-English, and cross-lingual Arabic-English pairs, making it a valuable resource for multilingual and cross-lingual semantic similarity tasks.
提供机构:
silma-ai



