five

n-order/Thai-dialect-corpus

收藏
Hugging Face2024-06-14 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/n-order/Thai-dialect-corpus
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: utterance dtype: string - name: sentence dtype: string - name: audio dtype: audio - name: thai_sentence dtype: string - name: dialect_type dtype: string splits: - name: central_train num_bytes: 7973415857.192 num_examples: 335674 - name: central_test num_bytes: 116837506.8 num_examples: 5465 - name: khummuang_train num_bytes: 373167194.288 num_examples: 23738 - name: khummuang_test num_bytes: 16927646.0 num_examples: 805 - name: korat_train num_bytes: 539250138.9679999 num_examples: 39624 - name: korat_test num_bytes: 17135897.240000002 num_examples: 1080 - name: pattani_train num_bytes: 454590637.16 num_examples: 38527 - name: pattani_test num_bytes: 14139981.0 num_examples: 834 download_size: 8940871781 dataset_size: 9505464858.648 configs: - config_name: default data_files: - split: central_train path: data/central_train-* - split: central_test path: data/central_test-* - split: khummuang_train path: data/khummuang_train-* - split: khummuang_test path: data/khummuang_test-* - split: korat_train path: data/korat_train-* - split: korat_test path: data/korat_test-* - split: pattani_train path: data/pattani_train-* - split: pattani_test path: data/pattani_test-* ---
提供机构:
n-order
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作