Lingalingeswaran/common-voice-tamil-english-labeled-Data-v2

Name: Lingalingeswaran/common-voice-tamil-english-labeled-Data-v2
Creator: Lingalingeswaran
Published: 2024-12-15 16:52:08
License: 暂无描述

Hugging Face2024-12-15 更新2024-12-21 收录

下载链接：

https://hf-mirror.com/datasets/Lingalingeswaran/common-voice-tamil-english-labeled-Data-v2

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含音频和语言特征，音频的采样率为48000。数据集分为训练集和测试集，训练集有85548个样本，测试集有21388个样本。数据集的总大小为61847014733字节，下载大小为55740381981字节。数据集的许可证为Apache 2.0，支持泰米尔语（ta）和英语（en）。数据集的大小类别为n<1K，表示样本数量少于1000。

The dataset contains two features: audio and language. The audio feature has a sampling rate of 48000, and the language feature is of string type. The dataset is divided into a training set and a test set, with 85548 samples in the training set and 21388 samples in the test set. The dataset supports languages including Tamil (ta) and English (en). The total download size of the dataset is 55740381981 bytes, and the total dataset size is 61847014733 bytes.

提供机构：

Lingalingeswaran

5,000+

优质数据集

54 个

任务类型

进入经典数据集