Cleaned Polish Oscar corpus (128M above lines)
收藏B2FIND2026-04-29 收录
下载链接:
https://b2find.eudat.eu/dataset/fa53978c-3504-575b-9065-896271d60c76
下载链接
链接失效反馈官方服务:
资源简介:
Cleaned Polish Oscar corpus (part: 128M above lines, 1.93 GB). Data was prepared with a few cleaning heuristics: - remove sentences shorter than - remove non-polish...



