Proof-Pile-2
收藏DataCite Commons2026-01-07 更新2026-05-05 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/9b10af7b-4ef8-4a9a-9fe6-83042e7fa19f
下载链接
链接失效反馈官方服务:
资源简介:
The dataset used for continual pre-training of large language models, with a focus on balancing the text distribution and mitigating overfitting.
提供机构:
TIB
创建时间:
2024-12-16



