jtz18/biblenlp-corpus
收藏Hugging Face2024-10-27 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/jtz18/biblenlp-corpus
下载链接
链接失效反馈官方服务:
资源简介:
BibleNLP Corpus数据集包含了833种语言的部分或完整的圣经翻译,这些翻译是按章节对齐的。数据集的结构包括翻译、文件、引用、许可证和版权信息。使用该数据集需要安装tqdm、ijson和numpy库,并且可以通过指定ISO 693-3语言代码来选择语言对。
The BibleNLP Corpus dataset contains partial and complete Bible translations in 833 languages, aligned by verse. The dataset is intended for translation tasks and includes translations in various languages, with each translation corresponding to a specific language. The structure of the dataset includes fields such as translation, files, ref, licenses, and copyrights. When using the dataset, you can specify the languages to be paired with a list and ISO 693-3 language codes.
提供机构:
jtz18



