tchiayan/testing-dnsmos
收藏Hugging Face2026-04-15 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/tchiayan/testing-dnsmos
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: audio_filename
dtype: large_string
- name: text
dtype: large_string
- name: speaker
dtype: large_string
- name: OVRL_raw
dtype: float64
- name: SIG_raw
dtype: float64
- name: BAK_raw
dtype: float64
- name: OVRL
dtype: float64
- name: SIG
dtype: float64
- name: BAK
dtype: float64
- name: subset
dtype: large_string
splits:
- name: 700h_tr_turkish_text_to_speech
num_bytes: 795205
num_examples: 1965
- name: 9jalingo_hausa
num_bytes: 12834821
num_examples: 57173
- name: 9jalingo_igbo
num_bytes: 12404159
num_examples: 43863
- name: 9jalingo_pidgin
num_bytes: 1514487
num_examples: 5452
- name: Nanchang_Dialect_Conversational_Speech_Corpus
num_bytes: 461736
num_examples: 1195
- name: CommonPhoneDataset
num_bytes: 12526929
num_examples: 30986
- name: NepaliONE_tts
num_bytes: 2329239
num_examples: 6537
- name: camoes_SI
num_bytes: 831098
num_examples: 3890
- name: azerbaijani_audiobooks
num_bytes: 2997818
num_examples: 7281
- name: amharic_cleaned_testset_verified
num_bytes: 13215362
num_examples: 26409
- name: StoryTTS
num_bytes: 7679115
num_examples: 29264
- name: Lahaja
num_bytes: 1552188
num_examples: 4120
- name: urdu_voice_dataset
num_bytes: 1514760
num_examples: 5580
- name: IndicTTS
num_bytes: 2315112
num_examples: 6884
- name: DarijaTTS_clean
num_bytes: 2858707
num_examples: 11200
- name: Japanese_Anime_Speech_v2
num_bytes: 5768178
num_examples: 19313
- name: afrispeech_afrikaans
num_bytes: 539634
num_examples: 1681
- name: shrutilipi_sanskrit
num_bytes: 5470132
num_examples: 10281
- name: uzbekvoice_2k_each_accent
num_bytes: 1036
num_examples: 4
- name: ParlaSpeech_PL
num_bytes: 590966
num_examples: 1761
- name: MASC_Arabic
num_bytes: 12376897
num_examples: 50029
- name: assamese_speech_dataset1
num_bytes: 550243
num_examples: 1853
download_size: 39326089
dataset_size: 101127822
configs:
- config_name: default
data_files:
- split: 700h_tr_turkish_text_to_speech
path: data/700h_tr_turkish_text_to_speech-*
- split: 9jalingo_hausa
path: data/9jalingo_hausa-*
- split: 9jalingo_igbo
path: data/9jalingo_igbo-*
- split: 9jalingo_pidgin
path: data/9jalingo_pidgin-*
- split: Nanchang_Dialect_Conversational_Speech_Corpus
path: data/Nanchang_Dialect_Conversational_Speech_Corpus-*
- split: CommonPhoneDataset
path: data/CommonPhoneDataset-*
- split: NepaliONE_tts
path: data/NepaliONE_tts-*
- split: camoes_SI
path: data/camoes_SI-*
- split: azerbaijani_audiobooks
path: data/azerbaijani_audiobooks-*
- split: amharic_cleaned_testset_verified
path: data/amharic_cleaned_testset_verified-*
- split: StoryTTS
path: data/StoryTTS-*
- split: Lahaja
path: data/Lahaja-*
- split: urdu_voice_dataset
path: data/urdu_voice_dataset-*
- split: IndicTTS
path: data/IndicTTS-*
- split: DarijaTTS_clean
path: data/DarijaTTS_clean-*
- split: Japanese_Anime_Speech_v2
path: data/Japanese_Anime_Speech_v2-*
- split: afrispeech_afrikaans
path: data/afrispeech_afrikaans-*
- split: shrutilipi_sanskrit
path: data/shrutilipi_sanskrit-*
- split: uzbekvoice_2k_each_accent
path: data/uzbekvoice_2k_each_accent-*
- split: ParlaSpeech_PL
path: data/ParlaSpeech_PL-*
- split: MASC_Arabic
path: data/MASC_Arabic-*
- split: assamese_speech_dataset1
path: data/assamese_speech_dataset1-*
---
提供机构:
tchiayan



