procit002/procit002voicesample_from_stc_july_12_2

Name: procit002/procit002voicesample_from_stc_july_12_2
Creator: procit002
Published: 2024-07-12 06:18:20
License: 暂无描述

Hugging Face2024-07-12 更新2024-07-13 收录

下载链接：

https://hf-mirror.com/datasets/procit002/procit002voicesample_from_stc_july_12_2

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含1500个训练样本，每个样本包含以下特征字段：speaker_id（说话者ID）、speaker_name（说话者姓名）、accent（口音）、text（文本）、audiopath（音频路径）和gender（性别）。所有字段的数据类型均为字符串。数据集的总大小为292667字节，下载大小为47488字节。数据集的默认配置指定了训练数据文件的路径。

The dataset contains 1500 training samples, each with the following feature fields: speaker_id, speaker_name, accent, text, audiopath, and gender. All fields are of string data type. The total size of the dataset is 292667 bytes, and the download size is 47488 bytes. The default configuration of the dataset specifies the path to the training data files.

提供机构：

procit002

原始信息汇总

数据集概述

数据特征

speaker_id: 说话者ID，数据类型为字符串。
speaker_name: 说话者姓名，数据类型为字符串。
accent: 口音，数据类型为字符串。
text: 文本内容，数据类型为字符串。
audiopath: 音频路径，数据类型为字符串。
gender: 性别，数据类型为字符串。

数据集划分

train: 训练集，包含1500个样本，总大小为292667字节。

数据集大小

下载大小: 47488字节
数据集总大小: 292667字节

配置信息

config_name: default
- data_files:
  - split: train
  - path: data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集