voa-engines/common_voice_resample
收藏Hugging Face2024-10-19 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/voa-engines/common_voice_resample
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多种语言的音频和文本数据,每种语言配置(如英语、西班牙语、法语等)都包含音频和句子两个特征。音频特征具有不同的采样率(如48000Hz),句子特征为字符串类型。数据集分为训练集,每个训练集的大小和样本数量因语言而异。例如,英语训练集包含89964个样本,大小为3745988816.3字节;西班牙语训练集包含20758个样本,大小为821712449.386字节。
The dataset contains audio and text data in multiple languages. Each language configuration (e.g., English, Spanish, French) includes two features: audio and sentence. The audio feature has different sampling rates (e.g., 48000Hz), and the sentence feature is of string type. The dataset is divided into training sets, with varying sizes and numbers of samples depending on the language. For example, the English training set contains 89,964 samples with a size of 3,745,988,816.3 bytes, while the Spanish training set contains 20,758 samples with a size of 821,712,449.386 bytes.
提供机构:
voa-engines



