seastar105/speech-token-dataset
收藏Hugging Face2025-03-05 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/seastar105/speech-token-dataset
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含了多个字段的数据集,字段包括但不限于唯一标识符_id、 cer值、dnsmos值、时长duration、语言language、电话号码计数phone_count、预测结果pred、说话者speaker、文本text和代码序列codes。数据集分为不同的部分,其中一个部分名为emilia_yodas_ko_wavtok,包含928081个示例,总文件大小为5369857808字节。数据集的下载大小为1159644136字节。
This dataset consists of multiple fields including but not limited to unique identifier _id, cer value, dnsmos value, duration, language, phone count, prediction result pred, speaker, text, and code sequence codes. The dataset is split into different parts, one of which is named emilia_yodas_ko_wavtok, containing 928081 examples with a total file size of 5369857808 bytes. The download size of the dataset is 1159644136 bytes.
提供机构:
seastar105



