Respair/Arab_processed_900B
收藏Hugging Face2025-01-22 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Respair/Arab_processed_900B
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含语音信息的文本数据集,具体包含合并后的音素(merged_phonemes)、输入ID序列(input_ids)和音素序列(phonemes)。数据集分为训练集(train),共有约1949339个样本,数据集总大小为24927984204字节。
This is a text dataset containing speech information, specifically including merged phonemes, input ID sequences, and phoneme sequences. The dataset is split into a training set (train) with approximately 1,949,339 samples and a total dataset size of 24,927,984,204 bytes.
提供机构:
Respair



