Sefaria/Rabbinic-Hebrew-English-Pairs
收藏Hugging Face2026-01-12 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/Sefaria/Rabbinic-Hebrew-English-Pairs
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含3,708对平行文本,涵盖多个世纪和类型的拉比文学,旨在评估跨语言嵌入模型在将希伯来语/阿拉姆语源文本与英语翻译对齐方面的能力。源语言包括拉比希伯来语、犹太巴比伦阿拉姆语和犹太巴勒斯坦阿拉姆语,目标语言为英语。每对文本包含Sefaria参考字符串、希伯来语/阿拉姆语源文本、英语翻译和文本类别。数据集还详细列出了不同类别的文本及其数量和描述。
This dataset contains 3,708 parallel text pairs spanning diverse Rabbinic literature across multiple centuries and genres. It is designed for evaluating cross-lingual embedding models on their ability to align Hebrew/Aramaic source texts with English translations. The source languages include Rabbinic Hebrew, Jewish Babylonian Aramaic, and Jewish Palestinian Aramaic, while the target language is English. Each example contains a Sefaria reference string, Hebrew/Aramaic source text, English translation, and text category. The dataset also provides detailed descriptions of various text categories along with their counts.
提供机构:
Sefaria



