ai4bharat/Rural_Women_ASR_v2
收藏Hugging Face2025-11-06 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/ai4bharat/Rural_Women_ASR_v2
下载链接
链接失效反馈官方服务:
资源简介:
农村女性自动语音识别数据集(Hindi和Bhojpuri):该数据集是*识别每个声音*倡议的一部分,专注于为印度农村女性构建包容性的自动语音识别(ASR)系统。数据集包括来自农村女性发言人的Hindi和Bhojpuri语音数据,涵盖了多样化的年龄组、地区和社会经济背景。数据集特征包括语言、音频路径、文本、原始转录、标准化转录、音频原始时长、音频分段时长、场景、任务名称、发言人ID、性别、年龄组、职业、教育水平、地区、县、州、验证报告和提示文本。
Rural Women ASR Dataset (Hindi & Bhojpuri): This dataset is part of the *Recognizing Every Voice* initiative, focusing on building inclusive Automatic Speech Recognition (ASR) systems for rural women in India. It includes Hindi and Bhojpuri speech data collected from rural women speakers, covering diverse age groups, regions, and socio-economic backgrounds. The features include language, audio path, text, verbatim transcription, normalized transcription, audio raw duration, audio chunk duration, scenario, task name, speaker ID, gender, age group, occupation, qualification, area, district, state, verification report, and prompt text.
提供机构:
ai4bharat



