nusnlp/JGP-SlimPajama
收藏Hugging Face2025-07-01 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/nusnlp/JGP-SlimPajama
下载链接
链接失效反馈官方服务:
资源简介:
Just-Go-Parallel (SlimPajama)数据集是用于研究论文中提出的改进大型语言模型多语言能力的实验数据集。具体包括不同实验设置下的多个子数据集,用于支持对多语言平行句对的训练和评估。
The Just-Go-Parallel (SlimPajama) dataset is used for the study on improving the multilingual capabilities of large language models as proposed in the associated paper. It consists of multiple sub-datasets for different experimental settings, supporting the training and evaluation of multilingual parallel sentences.
提供机构:
nusnlp



