中英文唤醒词语音数据库
收藏北京国际大数据交易所2024-06-20 收录
下载链接:
https://webs.bjidex.com/sys-bsc-home/#/bscConsole/tradingMarket/detail?id=1923
下载链接
链接失效反馈官方服务:
资源简介:
AISHELL-WakeUp-1 database contains 1,561.12 hours speech data, including 3,936,003 wake-upwords speech files.◎ Database language: Chinese and English◎ Recording area: China◎ Wake-up words for recording: “Hi mia” and the Chinese of “你好,米雅”◎ Speakers: 254 participants◎ Environment: Real home environment◎ Device setup: 7 different positions are set for recording, including:1) Six 16-channel circular microphone arrays (16kHz,16bit) for the far-field recording;2) One Hi-Fi microphone for the close-talk recording (44.1kHz,16bit).AISHELL-WakeUp-1 database was transcribed by the professional speech annotators with high QAprocess, and the accuracy rate of word is 100%, which could be used in research of voiceprintrecognition, wake-up words recognition and so on.
AISHELL-WakeUp-1数据集包含1561.12小时语音数据,共计3936003条唤醒词语音文件。
◎ 数据集语言:中文与英文
◎ 录制区域:中国
◎ 录制唤醒词:"Hi mia"与中文唤醒词"你好,米雅"
◎ 说话者:254名参与者
◎ 录制环境:真实家庭环境
◎ 设备部署:共设置7个不同的录制点位,具体包括:1)6套16通道环形麦克风阵列(采样率16kHz、位深16bit),用于远场语音录制;2)1支高保真(Hi-Fi)麦克风,用于近场会话式录制(采样率44.1kHz、位深16bit)。
AISHELL-WakeUp-1数据集经专业语音标注人员结合高标准质量保证(QA)流程完成转录,字词准确率达100%,可应用于声纹识别、唤醒词识别等相关研究领域。
提供机构:
北京希尔贝壳科技有限公司
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



