Nexdata/1002_Hours_Kunming_Dialect_Speech_Data_by_Mobile_Phone
收藏Hugging Face2024-04-19 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Nexdata/1002_Hours_Kunming_Dialect_Speech_Data_by_Mobile_Phone
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-nd-4.0
---
## Description
2,284 native speakers of Kunming dialect participated in the recording, with authentic accent and from multiple age groups. The recorded script covers a wide range of topics such as generic, interactive, on-board, and home. Local people in Kunming participated in quality check and proofreading, and the text was transferred accurately. It matches with mainstream Android and Apple system phones.
For more details, please refer to the link: https://www.nexdata.ai/dataset/943?source=Huggingface
## Format
16kHz, 16bit, uncompressed wav, mono channel
## Recording environments
quiet indoor environment, without echo
## Recording content (read speech)
generic category; human-machine interaction category; smart home command and control category; numbers; dialect
## Demographics
2,284 people; 60% are females; people aged from 16-25 account of 80%; people are from Kunming or the surrounding areas
## Transcription content
text, noisy symbols, special identifiers
## Device
Android mobile phone, iPhone
## Language
Kunming dialect
## Accuracy rate
95% (the accuracy rate of noise symbols and other identifiers is not included)
## Application scenarios
speech recognition, voiceprint recognition
# Licensing Information
Commercial License
提供机构:
Nexdata
原始信息汇总
数据集概述
数据集描述
- 参与者:2,284名昆明方言母语者。
- 年龄分布:主要集中在16-25岁,占比80%。
- 性别比例:女性占60%。
- 地域来源:主要来自昆明及周边地区。
- 录音内容:涵盖通用、人机交互、智能家居控制、数字及方言等多个类别。
- 录音质量:经过当地人员质量检查和校对,文本传输准确。
- 兼容性:与主流Android和Apple系统手机匹配。
数据集格式
- 音频格式:16kHz, 16bit, 单声道,未压缩wav格式。
- 录音环境:安静的室内环境,无回声。
人口统计信息
- 总人数:2,284人。
- 性别比例:女性占60%。
- 年龄分布:16-25岁占80%。
- 地域分布:昆明及周边地区。
转录内容
- 内容类型:文本、噪声符号、特殊标识符。
设备
- 录音设备:Android手机,iPhone。
语言
- 方言类型:昆明方言。
准确率
- 总体准确率:95%(噪声符号和其他标识符的准确率未包括在内)。
应用场景
- 主要应用:语音识别,声纹识别。
许可信息
- 许可类型:商业许可。



