five

EtMmohammedHafsati/darija_tts_clean_metadata_full

收藏
Hugging Face2026-04-04 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/EtMmohammedHafsati/darija_tts_clean_metadata_full
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: audio dtype: audio: decode: false - name: text dtype: string - name: duration_sec dtype: float64 - name: num_tokens dtype: int64 - name: median_tokens_reference dtype: float64 - name: token_count_outlier dtype: bool - name: remove_short_audio_long_text dtype: bool - name: remove_token_outlier dtype: bool - name: num_speakers dtype: int64 - name: multiple_speakers dtype: bool - name: speaker_turns dtype: int64 - name: turns_per_minute dtype: float64 - name: dominant_speaker_ratio dtype: float64 - name: second_speaker_ratio dtype: float64 - name: non_dominant_speech_ratio dtype: float64 - name: speaker_balance_score dtype: float64 - name: speaker_entropy dtype: float64 - name: overlap_speech dtype: bool - name: overlap_duration_sec dtype: float64 - name: overlap_ratio dtype: float64 - name: num_overlap_regions dtype: int64 - name: mean_overlap_region_sec dtype: float64 - name: max_overlap_region_sec dtype: float64 - name: max_concurrent_speakers dtype: int64 - name: diarization_empty_output dtype: bool - name: asr_usability_score dtype: float64 - name: asr_usable_single_speaker dtype: bool - name: source_split_original dtype: string - name: metadata_generated_by dtype: string - name: processing_error dtype: bool splits: - name: train num_bytes: 2019255987 num_examples: 14550 - name: validation num_bytes: 252441693 num_examples: 1819 - name: test num_bytes: 252441693 num_examples: 1819 download_size: 2694417381 dataset_size: 2524139373 configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* ---
提供机构:
EtMmohammedHafsati
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作