数据堂—35小时有声读物文本拼音标注语音数据

Name: 数据堂—35小时有声读物文本拼音标注语音数据
Creator: maas
Published: 2025-12-25 16:14:53
License: 暂无描述

魔搭社区2025-12-25 更新2024-05-15 收录

下载链接：

https://modelscope.cn/datasets/DatatangBeijing/35Hours_PinyinAnnotationSpeechDataOfAudioBookText

下载链接

链接失效反馈

官方服务：

资源简介：

35小时有声读物文本拼音标注语音数据由5名发音人参与录制，其中男性 3 人，女性 2 人，对语音内容做汉字和拼音标注，拼音标注声调。35小时有声读物文本拼音标注语音数据可用于语音识别、机器翻译、声纹识别等任务

The 35-hour audiobook speech dataset has its text content annotated with both Chinese characters and pinyin, and the pinyin is marked with tone marks. It was recorded by 5 speakers, including 3 males and 2 females. This dataset can be applied to tasks such as speech recognition, machine translation, speaker verification and other relevant tasks.

提供机构：

maas

创建时间：

2024-05-06

搜集汇总

数据集介绍