HrEnWaC
收藏arXiv2025-09-30 收录
下载链接:
https://www.clarin.si/repository/xmlui/handle/11356/1058
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个克罗地亚语-英语的网络语料库,专门用于训练语言模型。其所承担的任务是语言建模。
This dataset is a Croatian-English web corpus specifically designed for training language models, and its targeted task is language modeling.
提供机构:
CLARIN



