Dev372/Cardiology_Medical_STT_Dataset
收藏Hugging Face2024-07-17 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/Dev372/Cardiology_Medical_STT_Dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频和对应的转录文本,音频的采样率为16000Hz,转录文本为字符串类型。数据集包含一个训练分割,共有1530个样本,总大小为351247868.07字节,下载大小为278663057字节。默认配置下的数据文件路径为data/train-*。
This dataset is primarily used for audio processing and speech recognition tasks. It includes two main features: audio and transcripts. The audio feature has a sampling rate of 16000 Hz, suitable for high-precision audio analysis. The transcripts feature is of string type, used for training speech recognition models. The dataset is divided into a training set, containing 1530 samples, with a total size of 351247868.07 bytes. The download size of the dataset is 278663057 bytes. The dataset configuration is set to default, with the training data file path being data/train-*.
提供机构:
Dev372
原始信息汇总
数据集概述
数据特征
- 音频
- 采样率: 16000
- 转录文本
- 数据类型: 字符串
数据分割
- 训练集
- 文件大小: 351247868.07 字节
- 样本数量: 1530
数据集大小
- 下载大小: 278663057 字节
- 总大小: 351247868.07 字节
配置
- 默认配置
- 数据文件路径: data/train-*



