ntnu-smil/LTTC-Dev-1964-0520
收藏Hugging Face2025-03-16 更新2025-04-19 收录
下载链接:
https://hf-mirror.com/datasets/ntnu-smil/LTTC-Dev-1964-0520
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了语音样本的相关特征,如说话者ID、音频转录文本、音频文件的路径、语音的速度、音调、音量、停顿时间等。此外,还包含了音频信号处理得到的各种特征值,如能量、熵、频率等。数据集被划分为训练集,可用于语音识别、语音合成等研究。
The dataset includes various features related to speech samples, such as speaker ID, transcription text of the audio, path to the audio file, speech rate, pitch, volume, pause time, etc. In addition, it contains various feature values obtained from audio signal processing, such as energy, entropy, frequency, etc. The dataset is split into a training set, which can be used for research in speech recognition, speech synthesis, etc.
提供机构:
ntnu-smil



