pere/wiki_paragraphs_norwegian
收藏Hugging Face2025-02-03 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/pere/wiki_paragraphs_norwegian
下载链接
链接失效反馈官方服务:
资源简介:
WIKI Paragraphs Norwegian是一个多分割的数据集,用于机器学习研究和评估,包含文本样本,采用JSON Lines格式。数据集包含多个分割,适用于不同的使用场景,包括随机洗牌、结构化格式和大小不同的验证/测试集。总样本量为1,132,200。
WIKI Paragraphs Norwegian is a multi-split dataset for machine learning research and evaluation, containing text samples in JSON Lines format. The dataset includes multiple splits for different use cases, featuring random shuffling, structured format, and size-varied validation/test sets. Total samples amount to 1,132,200.
提供机构:
pere



