theodorr/mls10k_librittsr
收藏Hugging Face2024-07-25 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/theodorr/mls10k_librittsr
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频标记、音频时长、文本和对齐信息四个特征。音频标记是一个嵌套的int64序列,音频时长为float64类型,文本为字符串类型,对齐信息为字符串序列。数据集仅包含一个训练集,共有2,774,765个样本,文件大小为101,071,185,102字节,下载大小为18,095,604,545字节。数据集的配置文件名为default,数据文件路径为data/train-*。
The dataset contains four features: audio_token (audio tokens, a nested sequence of int64), audio_duration (audio duration, float64 type), text (text, string type), and align (alignment information, sequence of strings). The dataset includes only a training set with 2,774,765 samples, a file size of 101,071,185,102 bytes, and a download size of 18,095,604,545 bytes. The configuration file for the dataset is named default, and the data file path is data/train-*.
提供机构:
theodorr



