five

krishnAbadikelA/massive-unseen-ipa-romanized

收藏
Hugging Face2026-04-22 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/krishnAbadikelA/massive-unseen-ipa-romanized
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: ar-SA features: - name: id dtype: string - name: locale dtype: string - name: partition dtype: string - name: scenario dtype: string - name: intent dtype: int64 - name: utt dtype: string - name: annot_utt dtype: string - name: worker_id dtype: string - name: slot_method list: - name: slot dtype: string - name: method dtype: string - name: judgments list: - name: worker_id dtype: string - name: intent_score dtype: int64 - name: slots_score dtype: int64 - name: grammar_score dtype: int64 - name: spelling_score dtype: int64 - name: language_identification dtype: string - name: ipa_stripped dtype: string - name: romanized dtype: string splits: - name: train num_bytes: 4881005 num_examples: 11514 - name: validation num_bytes: 856514 num_examples: 2033 - name: test num_bytes: 1250080 num_examples: 2974 download_size: 1896874 dataset_size: 6987599 - config_name: bn-BD features: - name: id dtype: string - name: locale dtype: string - name: partition dtype: string - name: scenario dtype: string - name: intent dtype: int64 - name: utt dtype: string - name: annot_utt dtype: string - name: worker_id dtype: string - name: slot_method list: - name: slot dtype: string - name: method dtype: string - name: judgments list: - name: worker_id dtype: string - name: intent_score dtype: int64 - name: slots_score dtype: int64 - name: grammar_score dtype: int64 - name: spelling_score dtype: int64 - name: language_identification dtype: string - name: ipa_stripped dtype: string - name: romanized dtype: string splits: - name: train num_bytes: 6250519 num_examples: 11514 - name: validation num_bytes: 1091337 num_examples: 2033 - name: test num_bytes: 1595870 num_examples: 2974 download_size: 2305735 dataset_size: 8937726 - config_name: el-GR features: - name: id dtype: string - name: locale dtype: string - name: partition dtype: string - name: scenario dtype: string - name: intent dtype: int64 - name: utt dtype: string - name: annot_utt dtype: string - name: worker_id dtype: string - name: slot_method list: - name: slot dtype: string - name: method dtype: string - name: judgments list: - name: worker_id dtype: string - name: intent_score dtype: int64 - name: slots_score dtype: int64 - name: grammar_score dtype: int64 - name: spelling_score dtype: int64 - name: language_identification dtype: string - name: ipa_stripped dtype: string - name: romanized dtype: string splits: - name: train num_bytes: 5715487 num_examples: 11514 - name: validation num_bytes: 1001581 num_examples: 2033 - name: test num_bytes: 1451794 num_examples: 2974 download_size: 2255409 dataset_size: 8168862 - config_name: fr-FR features: - name: id dtype: string - name: locale dtype: string - name: partition dtype: string - name: scenario dtype: string - name: intent dtype: int64 - name: utt dtype: string - name: annot_utt dtype: string - name: worker_id dtype: string - name: slot_method list: - name: slot dtype: string - name: method dtype: string - name: judgments list: - name: worker_id dtype: string - name: intent_score dtype: int64 - name: slots_score dtype: int64 - name: grammar_score dtype: int64 - name: spelling_score dtype: int64 - name: language_identification dtype: string - name: ipa_stripped dtype: string - name: romanized dtype: string splits: - name: train num_bytes: 5018560 num_examples: 11514 - name: validation num_bytes: 877753 num_examples: 2033 - name: test num_bytes: 1282387 num_examples: 2974 download_size: 1943438 dataset_size: 7178700 - config_name: zh-CN features: - name: id dtype: string - name: locale dtype: string - name: partition dtype: string - name: scenario dtype: string - name: intent dtype: int64 - name: utt dtype: string - name: annot_utt dtype: string - name: worker_id dtype: string - name: slot_method list: - name: slot dtype: string - name: method dtype: string - name: judgments list: - name: worker_id dtype: string - name: intent_score dtype: int64 - name: slots_score dtype: int64 - name: grammar_score dtype: int64 - name: spelling_score dtype: int64 - name: language_identification dtype: string - name: ipa_stripped dtype: string - name: romanized dtype: string splits: - name: train num_bytes: 4789005 num_examples: 11514 - name: validation num_bytes: 838137 num_examples: 2033 - name: test num_bytes: 1223819 num_examples: 2974 download_size: 1751762 dataset_size: 6850961 configs: - config_name: ar-SA data_files: - split: train path: ar-SA/train-* - split: validation path: ar-SA/validation-* - split: test path: ar-SA/test-* - config_name: bn-BD data_files: - split: train path: bn-BD/train-* - split: validation path: bn-BD/validation-* - split: test path: bn-BD/test-* - config_name: el-GR data_files: - split: train path: el-GR/train-* - split: validation path: el-GR/validation-* - split: test path: el-GR/test-* - config_name: fr-FR data_files: - split: train path: fr-FR/train-* - split: validation path: fr-FR/validation-* - split: test path: fr-FR/test-* - config_name: zh-CN data_files: - split: train path: zh-CN/train-* - split: validation path: zh-CN/validation-* - split: test path: zh-CN/test-* ---
提供机构:
krishnAbadikelA
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作