seastar105/speech-token-dataset

Name: seastar105/speech-token-dataset
Creator: seastar105
Published: 2025-03-05 04:23:39
License: 暂无描述

Hugging Face2025-03-05 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/seastar105/speech-token-dataset

下载链接

链接失效反馈

官方服务：

资源简介：

这是一个包含了多个字段的数据集，字段包括但不限于唯一标识符_id、 cer值、dnsmos值、时长duration、语言language、电话号码计数phone_count、预测结果pred、说话者speaker、文本text和代码序列codes。数据集分为不同的部分，其中一个部分名为emilia_yodas_ko_wavtok，包含928081个示例，总文件大小为5369857808字节。数据集的下载大小为1159644136字节。

This dataset consists of multiple fields including but not limited to unique identifier _id, cer value, dnsmos value, duration, language, phone count, prediction result pred, speaker, text, and code sequence codes. The dataset is split into different parts, one of which is named emilia_yodas_ko_wavtok, containing 928081 examples with a total file size of 5369857808 bytes. The download size of the dataset is 1159644136 bytes.

提供机构：

seastar105

5,000+

优质数据集

54 个

任务类型

进入经典数据集