JST-SUPERB/slue-sqa5-test-LLM_unit
收藏Hugging Face2024-06-09 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/JST-SUPERB/slue-sqa5-test-LLM_unit
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: academicodec_hifi_16k_320d
path: data/academicodec_hifi_16k_320d-*
- split: academicodec_hifi_16k_320d_large_uni
path: data/academicodec_hifi_16k_320d_large_uni-*
- split: academicodec_hifi_24k_320d
path: data/academicodec_hifi_24k_320d-*
- split: audiodec_24k_320d
path: data/audiodec_24k_320d-*
- split: dac_16k
path: data/dac_16k-*
- split: dac_24k
path: data/dac_24k-*
- split: dac_44k
path: data/dac_44k-*
- split: encodec_24k_12bps
path: data/encodec_24k_12bps-*
- split: encodec_24k_1_5bps
path: data/encodec_24k_1_5bps-*
- split: encodec_24k_24bps
path: data/encodec_24k_24bps-*
- split: encodec_24k_3bps
path: data/encodec_24k_3bps-*
- split: encodec_24k_6bps
path: data/encodec_24k_6bps-*
- split: funcodec_en_libritts_16k_gr1nq32ds320
path: data/funcodec_en_libritts_16k_gr1nq32ds320-*
- split: funcodec_en_libritts_16k_gr8nq32ds320
path: data/funcodec_en_libritts_16k_gr8nq32ds320-*
- split: funcodec_en_libritts_16k_nq32ds320
path: data/funcodec_en_libritts_16k_nq32ds320-*
- split: funcodec_en_libritts_16k_nq32ds640
path: data/funcodec_en_libritts_16k_nq32ds640-*
- split: funcodec_zh_en_16k_nq32ds320
path: data/funcodec_zh_en_16k_nq32ds320-*
- split: funcodec_zh_en_16k_nq32ds640
path: data/funcodec_zh_en_16k_nq32ds640-*
- split: speech_tokenizer_16k
path: data/speech_tokenizer_16k-*
dataset_info:
features:
- name: raw_question_text
dtype: string
- name: LLM-answer
dtype: string
- name: id
dtype: int64
- name: unit
sequence:
sequence: int64
splits:
- name: academicodec_hifi_16k_320d
num_bytes: 20733309
num_examples: 1978
- name: academicodec_hifi_16k_320d_large_uni
num_bytes: 20733309
num_examples: 1978
- name: academicodec_hifi_24k_320d
num_bytes: 30589501
num_examples: 1978
- name: audiodec_24k_320d
num_bytes: 64252765
num_examples: 1978
- name: dac_16k
num_bytes: 78268061
num_examples: 1978
- name: dac_24k
num_bytes: 308745757
num_examples: 1978
- name: dac_44k
num_bytes: 100256813
num_examples: 1978
- name: encodec_24k_12bps
num_bytes: 119611037
num_examples: 1978
- name: encodec_24k_1_5bps
num_bytes: 15789389
num_examples: 1978
- name: encodec_24k_24bps
num_bytes: 238264349
num_examples: 1978
- name: encodec_24k_3bps
num_bytes: 30621053
num_examples: 1978
- name: encodec_24k_6bps
num_bytes: 60284381
num_examples: 1978
- name: funcodec_en_libritts_16k_gr1nq32ds320
num_bytes: 159668765
num_examples: 1978
- name: funcodec_en_libritts_16k_gr8nq32ds320
num_bytes: 159668765
num_examples: 1978
- name: funcodec_en_libritts_16k_nq32ds320
num_bytes: 159162397
num_examples: 1978
- name: funcodec_en_libritts_16k_nq32ds640
num_bytes: 80312861
num_examples: 1978
- name: funcodec_zh_en_16k_nq32ds320
num_bytes: 159162397
num_examples: 1978
- name: funcodec_zh_en_16k_nq32ds640
num_bytes: 80312861
num_examples: 1978
- name: speech_tokenizer_16k
num_bytes: 40508893
num_examples: 1978
download_size: 292720180
dataset_size: 1926946663
---
# Dataset Card for "slue-sqa5-LLM_unit"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The slue-sqa5-LLM_unit dataset includes various splits with different audio codecs and sampling rates, such as academicodec_hifi_16k_320d, academicodec_hifi_24k_320d, and others. Each split has specific paths and metadata, including the number of bytes and examples. The dataset features include raw_question_text, LLM-answer, id, and unit, which is a sequence of integers. The dataset size and download size are also provided.
提供机构:
JST-SUPERB
原始信息汇总
数据集概述
数据集名称
slue-sqa5-LLM_unit
数据集配置
- 配置名称: default
- 数据文件路径:
academicodec_hifi_16k_320d:data/academicodec_hifi_16k_320d-*academicodec_hifi_16k_320d_large_uni:data/academicodec_hifi_16k_320d_large_uni-*academicodec_hifi_24k_320d:data/academicodec_hifi_24k_320d-*audiodec_24k_320d:data/audiodec_24k_320d-*dac_16k:data/dac_16k-*dac_24k:data/dac_24k-*dac_44k:data/dac_44k-*encodec_24k_12bps:data/encodec_24k_12bps-*encodec_24k_1_5bps:data/encodec_24k_1_5bps-*encodec_24k_24bps:data/encodec_24k_24bps-*encodec_24k_3bps:data/encodec_24k_3bps-*encodec_24k_6bps:data/encodec_24k_6bps-*funcodec_en_libritts_16k_gr1nq32ds320:data/funcodec_en_libritts_16k_gr1nq32ds320-*funcodec_en_libritts_16k_gr8nq32ds320:data/funcodec_en_libritts_16k_gr8nq32ds320-*funcodec_en_libritts_16k_nq32ds320:data/funcodec_en_libritts_16k_nq32ds320-*funcodec_en_libritts_16k_nq32ds640:data/funcodec_en_libritts_16k_nq32ds640-*funcodec_zh_en_16k_nq32ds320:data/funcodec_zh_en_16k_nq32ds320-*funcodec_zh_en_16k_nq32ds640:data/funcodec_zh_en_16k_nq32ds640-*speech_tokenizer_16k:data/speech_tokenizer_16k-*
数据集信息
-
特征:
raw_question_text:stringLLM-answer:stringid:int64unit:sequenceofint64
-
分割:
academicodec_hifi_16k_320d:num_bytes: 20733309num_examples: 1978
academicodec_hifi_16k_320d_large_uni:num_bytes: 20733309num_examples: 1978
academicodec_hifi_24k_320d:num_bytes: 30589501num_examples: 1978
audiodec_24k_320d:num_bytes: 64252765num_examples: 1978
dac_16k:num_bytes: 78268061num_examples: 1978
dac_24k:num_bytes: 308745757num_examples: 1978
dac_44k:num_bytes: 100256813num_examples: 1978
encodec_24k_12bps:num_bytes: 119611037num_examples: 1978
encodec_24k_1_5bps:num_bytes: 15789389num_examples: 1978
encodec_24k_24bps:num_bytes: 238264349num_examples: 1978
encodec_24k_3bps:num_bytes: 30621053num_examples: 1978
encodec_24k_6bps:num_bytes: 60284381num_examples: 1978
funcodec_en_libritts_16k_gr1nq32ds320:num_bytes: 159668765num_examples: 1978
funcodec_en_libritts_16k_gr8nq32ds320:num_bytes: 159668765num_examples: 1978
funcodec_en_libritts_16k_nq32ds320:num_bytes: 159162397num_examples: 1978
funcodec_en_libritts_16k_nq32ds640:num_bytes: 80312861num_examples: 1978
funcodec_zh_en_16k_nq32ds320:num_bytes: 159162397num_examples: 1978
funcodec_zh_en_16k_nq32ds640:num_bytes: 80312861num_examples: 1978
speech_tokenizer_16k:num_bytes: 40508893num_examples: 1978
-
下载大小: 292720180 bytes
-
数据集大小: 1926946663 bytes



