MIZAN
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/omidkashefi/mizan
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是从文学杰作中创建的最大波斯语语料库,在发布之时曾是最大的波斯语语料库。它包含了超过一百万个句子对和2300万个词汇,旨在用于机器翻译任务。
This dataset is the largest Persian corpus constructed from literary masterpieces, and it was the largest such corpus at the time of its release. It contains over one million sentence pairs and 23 million words, and is specifically designed for machine translation tasks.



