gweltou/wikipedia-br-20240325
收藏Hugging Face2024-04-01 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/gweltou/wikipedia-br-20240325
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
language:
- br
multilinguality:
- monolingual
size_categories:
- 100K<n<1M
---
A corpus of sentences extracted for the Breton Wikipedia (cirrus dump).
The sentences were filtered so that only Breton sentences were kept.
Please note that the sentence splitting algorithm is far from perfect, so many sentences will appear incorrect or incomplete.
提供机构:
gweltou
原始信息汇总
数据集概述
基本信息
- 许可证: Apache-2.0
- 语言: 布列塔尼语 (br)
- 多语言性: 单语种
- 大小分类: 100K<n<1M
内容描述
- 数据集包含从布列塔尼语维基百科(cirrus dump)提取的句子。
- 经过筛选,仅保留了布列塔尼语的句子。
- 注意:句子分割算法并不完善,许多句子可能显示为不正确或不完整。



