five

sarannair/german_asr_2

收藏
Hugging Face2025-11-14 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/sarannair/german_asr_2
下载链接
链接失效反馈
官方服务:
资源简介:
--- tags: - audio - asr - speech - text language: - de license: cc-by-4.0 --- # German ASR Dataset (consolidated, with splits) This dataset aggregates multiple German speech sources with paired transcripts. ## Splits - **train**: ~90% of examples (shuffled) - **validation**: ~10% of examples (shuffled) ## Fields - **audio**: audio file (various formats) - **text**: transcript (plain text) ## Sources - HUI_Audio_Corpus_Clean - Bundestag ASR (Train_Dataset) - De_Distant_Speech_Data_Corpus_1415_Realtek - Stadtverwaltung Boppard Meeting Recording (Youtube) - Stadtverwaltung Koblenz Meeting Recording (Youtube) > Upload generated automatically from on-prem GPU paths.
提供机构:
sarannair
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作