five

数据堂—786小时荷兰语朗读语音数据(手机)

收藏
魔搭社区2025-11-19 更新2024-05-15 收录
下载链接:
https://modelscope.cn/datasets/DatatangBeijing/786Hours-DutchScriptedMonologueSmartphoneSpeechDataset
下载链接
链接失效反馈
官方服务:
资源简介:
荷兰语语音数据_朗读(手机),基于给定的脚本朗读并模拟录制,录音人共计681名,录音人来自荷兰,录音环境为在安静无回音的环境。录音内容广泛,每人约1000句。文本经过人工校对,准确率高,为语音识别相关研究及应用提供了丰富的资源,经多家AI公司验证:有助于模型面对真实世界的多样性时能够表现出色。我们严格遵循数据保护法规和隐私规定,确保数据采集、存储和使用的过程中维护用户的隐私和合法权益,所有数据均遵循GDPR, CCPA, PIPL

Dutch Speech Data: Aloud Reading (Mobile) This dataset is collected by having speakers read aloud from pre-provided scripts with simulated recording procedures. A total of 681 speakers from the Netherlands participated in the recording. All recordings were conducted in quiet, non-reverberant environments. The recorded content covers a wide range of topics, with each speaker contributing approximately 1,000 utterances. All transcriptions were manually proofread to ensure high accuracy, providing abundant resources for research and practical applications in the field of speech recognition. Verified by multiple AI companies, this dataset helps models deliver excellent performance when facing real-world diversity. We strictly follow data protection regulations and privacy rules to safeguard user privacy and legitimate rights and interests throughout the entire process of data collection, storage, and usage. All data complies with GDPR, CCPA, and PIPL.
提供机构:
maas
创建时间:
2024-05-11
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集是一个786小时的荷兰语朗读语音资源,专为语音识别研究和应用测试设计。它包含681名荷兰母语者在安静环境下通过手机录制的新闻及通用文本语音,每人约1000条语句,转录文本经人工验证,准确率达95%。数据以16kHz单声道WAV格式提供,遵循Apache 2.0许可,由Datatang拥有商业版权。
以上内容由遇见数据集搜集并总结生成
二维码
社区交流群
二维码
科研交流群
商业服务