five

tsw0411/si

收藏
Hugging Face2026-04-25 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/tsw0411/si
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: si_1 features: - name: session_id dtype: string - name: audio dtype: audio - name: targets sequence: sequence: int8 - name: speaker_ids sequence: string - name: duration dtype: float64 - name: num_speakers dtype: int32 - name: valid_offsets sequence: float64 splits: - name: train num_bytes: 113095160557.84 num_examples: 4447 download_size: 88431231829 dataset_size: 113095160557.84 - config_name: si_10 features: - name: session_id dtype: string - name: audio dtype: audio - name: targets sequence: sequence: int8 - name: speaker_ids sequence: string - name: duration dtype: float64 - name: num_speakers dtype: int32 - name: valid_offsets sequence: float64 splits: - name: train num_bytes: 99584645150.68 num_examples: 4687 download_size: 91697414585 dataset_size: 99584645150.68 - config_name: si_11 features: - name: session_id dtype: string - name: audio dtype: audio - name: targets sequence: sequence: int8 - name: speaker_ids sequence: string - name: duration dtype: float64 - name: num_speakers dtype: int32 - name: valid_offsets sequence: float64 splits: - name: train num_bytes: 102575788161.8 num_examples: 4761 download_size: 97803179637 dataset_size: 102575788161.8 - config_name: si_12 features: - name: session_id dtype: string - name: audio dtype: audio - name: targets sequence: sequence: int8 - name: speaker_ids sequence: string - name: duration dtype: float64 - name: num_speakers dtype: int32 - name: valid_offsets sequence: float64 splits: - name: train num_bytes: 110556179671.2 num_examples: 4820 download_size: 103736155830 dataset_size: 110556179671.2 - config_name: si_13 features: - name: session_id dtype: string - name: audio dtype: audio - name: targets sequence: sequence: int8 - name: speaker_ids sequence: string - name: duration dtype: float64 - name: num_speakers dtype: int32 - name: valid_offsets sequence: float64 splits: - name: train num_bytes: 116174825017.2 num_examples: 4555 download_size: 106994199901 dataset_size: 116174825017.2 - config_name: si_6 features: - name: session_id dtype: string - name: audio dtype: audio - name: targets sequence: sequence: int8 - name: speaker_ids sequence: string - name: duration dtype: float64 - name: num_speakers dtype: int32 - name: valid_offsets sequence: float64 splits: - name: train num_bytes: 119283349029.96 num_examples: 4829 download_size: 92072300548 dataset_size: 119283349029.96 - config_name: si_7 features: - name: session_id dtype: string - name: audio dtype: audio - name: targets sequence: sequence: int8 - name: speaker_ids sequence: string - name: duration dtype: float64 - name: num_speakers dtype: int32 - name: valid_offsets sequence: float64 splits: - name: train num_bytes: 93391269908.0 num_examples: 4950 download_size: 86579589870 dataset_size: 93391269908.0 - config_name: si_8 features: - name: session_id dtype: string - name: audio dtype: audio - name: targets sequence: sequence: int8 - name: speaker_ids sequence: string - name: duration dtype: float64 - name: num_speakers dtype: int32 - name: valid_offsets sequence: float64 splits: - name: train num_bytes: 86974924179.36 num_examples: 4946 download_size: 69557464795 dataset_size: 86974924179.36 - config_name: si_9 features: - name: session_id dtype: string - name: audio dtype: audio - name: targets sequence: sequence: int8 - name: speaker_ids sequence: string - name: duration dtype: float64 - name: num_speakers dtype: int32 - name: valid_offsets sequence: float64 splits: - name: train num_bytes: 78312787325.08 num_examples: 4733 download_size: 74770278342 dataset_size: 78312787325.08 configs: - config_name: si_1 data_files: - split: train path: si_1/train-* - config_name: si_10 data_files: - split: train path: si_10/train-* - config_name: si_11 data_files: - split: train path: si_11/train-* - config_name: si_12 data_files: - split: train path: si_12/train-* - config_name: si_13 data_files: - split: train path: si_13/train-* - config_name: si_6 data_files: - split: train path: si_6/train-* - config_name: si_7 data_files: - split: train path: si_7/train-* - config_name: si_8 data_files: - split: train path: si_8/train-* - config_name: si_9 data_files: - split: train path: si_9/train-* ---
提供机构:
tsw0411
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作