Denhotech/africa_language_data
收藏Hugging Face2025-07-24 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Denhotech/africa_language_data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个部分:wordschatz_uni_dataset、voa_african_lang、mathematics_data和makerere_dataset。wordschatz_uni_dataset部分包含6619712个字符串类型的例子,占用724929760字节;voa_african_lang部分包含1208151个例子,占用247220086字节;mathematics_data部分包含31461048个例子,占用2458524838字节;makerere_dataset部分包含347264个例子,占用29386373字节。整个数据集下载大小为1908384426字节,总大小为3460061057字节。
The dataset consists of four parts: wordschatz_uni_dataset, voa_african_lang, mathematics_data, and makerere_dataset. The wordschatz_uni_dataset part contains 6619712 string type examples, occupying 724929760 bytes; the voa_african_lang part contains 1208151 examples, occupying 247220086 bytes; the mathematics_data part contains 31461048 examples, occupying 2458524838 bytes; the makerere_dataset part contains 347264 examples, occupying 29386373 bytes. The total download size of the dataset is 1908384426 bytes, and the total size is 3460061057 bytes.
提供机构:
Denhotech



