jonathansuru/fon_tts_speaker_5
收藏Hugging Face2024-12-14 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/jonathansuru/fon_tts_speaker_5
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频、文本、说话者和性别四个特征。音频的采样率为44100Hz,文本为字符串类型,说话者为整数类型,性别为字符串类型。数据集仅包含一个训练分割,共有1119个样本,总大小为239343628.41926366字节。
The dataset includes four features: audio, text, speaker, and gender. The audio feature has a sampling rate of 44100, the text feature is of string type, the speaker feature is of integer type, and the gender feature is of string type. The dataset is divided into a training set, containing 1119 samples. The download size of the dataset is 226605688 bytes, and the dataset size is 239343628.41926366 bytes. The configuration includes a default configuration, with the training data file path being data/train-*.
提供机构:
jonathansuru



