sarpba/CV22-hun-cleaned
收藏Hugging Face2025-10-04 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/sarpba/CV22-hun-cleaned
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含音频文件和同名.txt转录文件的数据集。字段包括:id(基于文件名的标识符)、audio(可以通过datasets加载的音频)、text(完整的UTF-8转录)、text_vibevoice(带有Speaker 1: 前缀的转录,例如VibeVoice格式)、relpath(相对于数据根的相对路径)、duration(大致长度)、sample_rate(采样率)、channels(通道数)、bitrate_kbps(估计比特率)、age(年龄)、gender(性别)和accents(口音)。
This dataset consists of audio files and corresponding .txt transcripts. Fields include: id (filename-based identifier), audio (audio loadable by datasets), text (complete UTF-8 transcript), text_vibevoice (transcript with Speaker 1: prefix, for example VibeVoice format), relpath (relative path to the data root), duration (approximate length), sample_rate (sampling rate), channels (number of channels), bitrate_kbps (estimated bitrate), age, gender, and accents.
提供机构:
sarpba



