Lingalingeswaran/filtered_common_voice_tamil_english-preprocessed-quantized
收藏Hugging Face2024-12-16 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/Lingalingeswaran/filtered_common_voice_tamil_english-preprocessed-quantized
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如用户ID、文件路径、音频数据、句子文本、点赞数、点踩数、年龄、性别、口音、地区、片段、输入ID、注意力掩码和标签。音频数据的采样率为48000。数据集分为一个训练集,包含2000个样本,总大小为2003491176字节。下载大小为1392336426字节。
This is a dataset containing speech data and related metadata. The dataset includes multiple fields such as client_id, path, audio, sentence, etc., where the audio field contains audio data with a sampling rate of 48000. Additionally, the dataset includes metadata related to speech, such as age, gender, accent, locale, etc. The dataset is divided into a training set, containing 2000 samples.
提供机构:
Lingalingeswaran



