粤语asr数据集

Name: 粤语asr数据集
Creator: maas
Published: 2026-05-23 20:55:59
License: 暂无描述

魔搭社区2026-05-23 更新2024-05-15 收录

下载链接：

https://modelscope.cn/datasets/tomsawyerhu/cantonese-dialect

下载链接

链接失效反馈

官方服务：

资源简介：

### Clone with HTTP ```bash git clone https://www.modelscope.cn/datasets/tomsawyerhu/cantonese-dialect.git ``` ### 数据集介绍这个粤语音频数据集是一个有关自动语音识别（ASR）的重要资源，旨在促进粤语语音识别技术的发展和改进。该数据集包含了丰富而多样化的粤语音频样本，覆盖了不同方言、口音、语速和语音环境，以更好地模拟现实世界中的语音情境。数据集的主要特点包括：多样性和广泛性：数据集中包含了来自不同年龄、性别和地理背景的发言人的语音样本，以便更好地适应各种使用情境。多通道录制：语音样本通过多通道录制，以捕捉不同声音来源和环境中的声音变化。文本和音频对应：每个语音样本都与准确的文本转写相对应，提供了有助于ASR系统训练和评估的标签。数据质量：数据集经过仔细筛选和清洗，以确保高质量的语音和文本匹配。该数据集的发布旨在为粤语ASR技术的研究者和开发者提供宝贵的资源，帮助改进粤语语音识别的性能，从而为粤语使用者提供更好的语音识别体验。我们希望这一资源能够激发更多关于粤语ASR的研究和创新，推动语音识别技术在粤语社区的应用。

### Clone with HTTP bash git clone https://www.modelscope.cn/datasets/tomsawyerhu/cantonese-dialect.git ### Dataset Introduction This Cantonese audio dataset is a pivotal resource for automatic speech recognition (ASR), designed to advance the development and refinement of Cantonese speech recognition technologies. The dataset encompasses rich and diverse Cantonese audio samples, covering various dialects, accents, speaking rates and acoustic environments to better replicate real-world speech scenarios. Key features of the dataset are as follows: 1. **Diversity and Breadth**: The dataset includes speech samples from speakers spanning different age groups, genders and geographic backgrounds, enabling robust adaptation to diverse usage scenarios. 2. **Multi-channel Recording**: Speech samples are captured via multi-channel recording setups to capture sound variations across different sources and environments. 3. **Audio-text Alignment**: Each speech sample is paired with precise text transcriptions, providing labeled data to support the training and evaluation of ASR systems. 4. **High Data Quality**: The dataset has undergone rigorous screening and cleaning to ensure high-fidelity alignment between audio content and its corresponding text. This dataset is released to offer a valuable resource for researchers and developers working on Cantonese ASR technologies, aiding in the improvement of Cantonese speech recognition performance and ultimately delivering enhanced speech recognition experiences for Cantonese speakers. We hope this resource will spark further research and innovation in Cantonese ASR, and promote the deployment of speech recognition technologies within Cantonese-speaking communities.

提供机构：

maas

创建时间：

2023-10-25

搜集汇总

数据集介绍