Romanian Text Corpus — 20B Tokens
收藏kaggle2026-04-05 更新2026-05-09 收录
下载链接:
https://www.kaggle.com/datasets/junesdata/romanian-text-corpus-20b-tokens
下载链接
链接失效反馈官方服务:
资源简介:
Comprehensive Romanian pretraining corpus — 19.8M documents, 5 curated sources
创建时间:
2026-04-05



