mwitiderrick/swahili
收藏Hugging Face2023-12-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/mwitiderrick/swahili
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- text-generation
language:
- sw
pretty_name: ' Swahili Corpus'
size_categories:
- 10M<n<100M
license: apache-2.0
---
# Swahili: CC-100: Monolingual Datasets from Web Crawl Data
This is a Swahili corpus obtained from [CC-100: Monolingual Datasets from Web Crawl Data
](https://data.statmt.org/cc-100/)
提供机构:
mwitiderrick
原始信息汇总
Swahili Corpus
概述
- 任务类别: 文本生成
- 语言: 斯瓦希里语
- 数据集名称: Swahili Corpus
- 数据规模: 10M<n<100M
- 许可证: Apache 2.0
来源
- 数据来源: CC-100: Monolingual Datasets from Web Crawl Data



