otozz/iraqi_train_set
收藏Hugging Face2025-02-09 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/otozz/iraqi_train_set
下载链接
链接失效反馈官方服务:
资源简介:
这是一个预处理过的伊拉克语训练分区数据集,源自MASC(大规模阿拉伯语语音语料库)数据集。数据集包含输入特征序列和标签序列,其中输入特征是float32类型,标签是int64类型。数据集分为训练集,包含12564个示例,总大小超过11GB。该数据集遵循cc-by-4.0协议。
This is a pre-processed Iraqi language training partition dataset from the MASC (Massive Arabic Speech Corpus) dataset. The dataset contains sequences of input features and labels, where the input features are of type float32 and the labels are of type int64. The dataset is split into a training set with 12,564 examples and a total size of over 11GB. The dataset is licensed under cc-by-4.0.
提供机构:
otozz



