数据堂—377小时河南方言自然对话手机采集语音数据

Name: 数据堂—377小时河南方言自然对话手机采集语音数据
Creator: maas
Published: 2026-05-23 21:04:45
License: 暂无描述

魔搭社区2026-05-23 更新2024-05-15 收录

下载链接：

https://modelscope.cn/datasets/DatatangBeijing/377Hours_HenanDialectConversationalSpeechDataByMobilePhone

下载链接

链接失效反馈

官方服务：

资源简介：

377小时河南方言自然对话手机采集语音数据由762名河南本土人进行录制，录音人覆盖多个年龄段，男女比例均衡，377小时河南方言自然对话手机采集语音数据不指定话题，无预设文本，由录音人两两组合自由交谈，可用于语音识别声学、声纹识别模型训练或算法研究。

A 377-hour mobile-collected speech dataset of natural conversational Henan dialect was recorded by 762 native Henan speakers. The participating speakers span multiple age groups with a balanced gender ratio. The conversations are unscripted and topic-free: no pre-defined texts are provided, and the speakers are paired freely to engage in spontaneous talks. This dataset can be applied to acoustic speech recognition model training, speaker verification model training, and related algorithm research.

提供机构：

maas

创建时间：

2022-12-29

搜集汇总

数据集介绍