1,136小时美国英语语音数据_对话(手机)【数据堂】
收藏OpenDataLab2024-05-23 更新2024-05-25 收录
下载链接:
https://opendatalab.org.cn/shujutang/shujutang1004
下载链接
链接失效反馈官方服务:
资源简介:
美国英语语音数据_对话(手机),由1000余名美国本地人参与录制,以自然方式进行交流,针对给定的数个话题自由发挥,领域广泛,语音自然流利,符合实际对话场景。准确性高,为语音识别相关研究及应用提供了丰富的资源,经多家AI公司验证:有助于模型面对真实世界的多样性时能够表现出色。我们严格遵循数据保护法规和隐私规定,确保数据采集、存储和使用的过程中维护用户的隐私和合法权益,所有数据均遵循GDPR, CCPA, PIPL
American English Spoken Dialogue Dataset (Mobile-Collected): This dataset was recorded by over 1,000 native American English speakers. Participants conducted natural, unconstrained conversations, freely elaborating on several pre-defined topics across diverse domains. The speech content is natural and fluent, conforming to real-world dialogue scenarios, and features high accuracy, providing abundant resources for research and applications related to speech recognition. Verified by multiple AI enterprises, this dataset enables models to achieve robust performance when coping with the diversity of real-world scenarios. We strictly abide by data protection laws and privacy regulations, ensuring that user privacy and legitimate rights and interests are fully protected throughout the entire process of data collection, storage and utilization. All data in this dataset complies with GDPR, CCPA and PIPL.
提供机构:
shujutang
创建时间:
2024-05-23
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集包含1,136小时的美国英语对话语音,由1,416名美国人使用手机在安静室内录制,涵盖通用领域话题,语音自然流利。数据格式为16kHz WAV文件,标注了文本、时间戳和说话人标识,句准确率达95%,为商业数据需企业合作购买。
以上内容由遇见数据集搜集并总结生成



