Dev372/Medical_STT_Dataset_1.0
收藏Hugging Face2024-07-19 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/Dev372/Medical_STT_Dataset_1.0
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频和对应的文本转录。音频的采样率为16000Hz,文本转录为字符串类型。数据集分为训练集和测试集,训练集包含1776个样本,测试集包含445个样本。数据集的下载大小为481125899字节,总大小为581736960.792字节。数据文件路径分别为data/train-*和data/test-*。
This dataset is primarily used for audio and speech recognition tasks, containing audio files and their corresponding transcriptions. The audio files have a sampling rate of 16000 Hz, and the transcriptions are in string format. The dataset is divided into a training set and a test set, with 1776 samples in the training set and 445 samples in the test set. The total download size of the dataset is 481125899 bytes, and the total size is 581736960.792 bytes. The dataset is configured as the default configuration, with the training set and test set data files stored in the data/train-* and data/test-* paths, respectively.
提供机构:
Dev372



