aleadag/cv2_tags
收藏Hugging Face2024-04-19 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/aleadag/cv2_tags
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
config_name: zh-CN
features:
- name: client_id
dtype: string
- name: path
dtype: string
- name: text
dtype: string
- name: up_votes
dtype: int64
- name: down_votes
dtype: int64
- name: age
dtype: string
- name: gender
dtype: string
- name: accent
dtype: string
- name: locale
dtype: string
- name: segment
dtype: string
- name: utterance_pitch_mean
dtype: float32
- name: utterance_pitch_std
dtype: float32
- name: snr
dtype: float64
- name: c50
dtype: float64
- name: speaking_rate
dtype: float64
- name: phonemes
dtype: string
splits:
- name: train
num_bytes: 1304065
num_examples: 2301
- name: test
num_bytes: 1092081
num_examples: 1950
- name: validation
num_bytes: 1106655
num_examples: 1947
- name: other
num_bytes: 10166
num_examples: 19
- name: invalidated
num_bytes: 441144
num_examples: 777
download_size: 1735823
dataset_size: 3954111
configs:
- config_name: zh-CN
data_files:
- split: train
path: zh-CN/train-*
- split: test
path: zh-CN/test-*
- split: validation
path: zh-CN/validation-*
- split: other
path: zh-CN/other-*
- split: invalidated
path: zh-CN/invalidated-*
---
提供机构:
aleadag
原始信息汇总
数据集概述
数据集特征
- client_id: 字符串类型
- path: 字符串类型
- text: 字符串类型
- up_votes: 整数类型
- down_votes: 整数类型
- age: 字符串类型
- gender: 字符串类型
- accent: 字符串类型
- locale: 字符串类型
- segment: 字符串类型
- utterance_pitch_mean: 浮点数类型
- utterance_pitch_std: 浮点数类型
- snr: 浮点数类型
- c50: 浮点数类型
- speaking_rate: 浮点数类型
- phonemes: 字符串类型
数据集分割
- train: 2301个样本,占用1304065字节
- test: 1950个样本,占用1092081字节
- validation: 1947个样本,占用1106655字节
- other: 19个样本,占用10166字节
- invalidated: 777个样本,占用441144字节
数据集大小
- 下载大小: 1735823字节
- 数据集总大小: 3954111字节



