five

typhoon-ai/TVSpeech

收藏
Hugging Face2026-01-21 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/typhoon-ai/TVSpeech
下载链接
链接失效反馈
官方服务:
资源简介:
TVSpeech是一个专门设计用于评估自动语音识别(ASR)模型在真实世界泰语音频中鲁棒性的基准数据集。该数据集包含570个从YouTube公共媒体频道精选的语音样本(总计3.75小时),涵盖了金融、技术和多样化的vlog等内容类别。数据集特别注重语音的声学和语义复杂性,包括领域特定术语、专有名词和技术行话等低频率词汇。所有样本均经过人工转录,采样率为16 kHz,并采用Creative Commons Attribution (CC-BY)许可。该数据集仅包含测试集,用于评估ASR模型在嘈杂环境和复杂语义下的表现。

TVSpeech is a Thai speech recognition benchmark dataset specifically designed as a Robustness Track for evaluating ASR models on real-world, in-the-wild Thai audio. The dataset consists of 570 utterances (3.75 hours) curated from diverse public media channels on YouTube under the Creative Commons Attribution (CC-BY) license, representing challenging acoustic and semantic complexity found in natural speech. It includes domain-specific terminology, proper names, and technical jargon, with a focus on low-frequency terms. The dataset is manually transcribed, sampled at 16 kHz, and contains only a test split for evaluation purposes.
提供机构:
typhoon-ai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作