five

nvidia/hifitts-2

收藏
Hugging Face2025-11-18 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/nvidia/hifitts-2
下载链接
链接失效反馈
官方服务:
资源简介:
HiFiTTS-2是一个大规模的高带宽语音数据集,从LibriVox有声读物中衍生而来。该数据集包含了大约36.7k小时的音频,来自5000名演讲者,音频可以从LibriVox以48 kHz的采样率下载。数据集的元数据包含了估计的带宽,用于推断录音的原始采样率。基础数据集经过过滤,适用于22 kHz的语音模型训练,同时也提供了一个适用于44 kHz训练的预计算子集。用户可以修改下载脚本来使用任何采样率和带宽阈值,这可能更适合他们的工作。

HiFiTTS-2 is a large-scale high bandwidth speech dataset derived from LibriVox audiobooks. The dataset contains approximately 36.7k hours of audio from 5k speakers that can be downloaded from LibriVox at a 48 kHz sampling rate. The metadata includes an estimated bandwidth, which is used to infer the original sampling rate the audio was recorded at. The base dataset is filtered for a bandwidth appropriate for training speech models at 22 kHz, and a precomputed subset is provided for 44 kHz training. Users can modify the download script to use any sampling rate and bandwidth threshold that might be more appropriate for their work.
提供机构:
nvidia
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作