quocanh34/new_nlu_tts3
收藏Hugging Face2023-10-28 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/quocanh34/new_nlu_tts3
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: audio
struct:
- name: array
sequence: float32
- name: path
dtype: string
- name: sampling_rate
dtype: int64
- name: pred_str
dtype: string
- name: pred_str_norm
dtype: string
- name: intent
dtype: string
- name: entities
list:
- name: filler
dtype: string
- name: type
dtype: string
- name: file
dtype: string
splits:
- name: train
num_bytes: 568309188
num_examples: 2139
download_size: 462242612
dataset_size: 568309188
---
# Dataset Card for "new_nlu_tts3"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The dataset includes multiple features such as id, audio data (including array, path, and sampling rate), predicted string, normalized predicted string, intent, list of entities, and file name. The dataset is divided into a training set, containing 2139 samples. The download size and actual size of the dataset are also clearly recorded. However, the README file does not provide a detailed description of the dataset, only the structure of the dataset and some statistical information.
提供机构:
quocanh34
原始信息汇总
数据集概述
数据集信息
- 特征列表:
- id: 数据类型为字符串。
- audio: 包含以下子结构:
- array: 序列类型,数据类型为float32。
- path: 数据类型为字符串。
- sampling_rate: 数据类型为int64。
- pred_str: 数据类型为字符串。
- pred_str_norm: 数据类型为字符串。
- intent: 数据类型为字符串。
- entities: 列表类型,包含以下子结构:
- filler: 数据类型为字符串。
- type: 数据类型为字符串。
- file: 数据类型为字符串。
数据分割
- train:
- 字节数: 568309188
- 样本数: 2139
数据集大小
- 下载大小: 462242612字节
- 数据集大小: 568309188字节



