mshojaei77/PersianCorpus_merged
收藏Hugging Face2025-03-04 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/mshojaei77/PersianCorpus_merged
下载链接
链接失效反馈官方服务:
资源简介:
波斯语语料库(合并版)是一个大规模的波斯语语料库,从Hugging Face Hub上多个高质量波斯语数据集中精心合并而成。该语料库旨在推动波斯语自然语言处理研究和应用的发展,将多样化的文本来源整合为单一资源,为研究人员和开发者提供了用于训练和评估语言模型的坚实基础。
Persian Corpus (Merged) is a large-scale Persian corpus meticulously aggregated from multiple high-quality Persian datasets available on the Hugging Face Hub. Designed to advance Persian NLP research and applications, this corpus consolidates diverse textual sources into a single resource, providing researchers and developers with a robust foundation for training and evaluating language models.
提供机构:
mshojaei77



