five

peakji/peak-intent-50

收藏
Hugging Face2024-05-09 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/peakji/peak-intent-50
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: ar features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 1775678 num_examples: 17471 download_size: 414610 dataset_size: 1775678 - config_name: de features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 1667010 num_examples: 19591 download_size: 463107 dataset_size: 1667010 - config_name: en features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 3299902 num_examples: 38662 download_size: 893987 dataset_size: 3299902 - config_name: es features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 1711650 num_examples: 19425 download_size: 449973 dataset_size: 1711650 - config_name: fr features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 1711869 num_examples: 19580 download_size: 458000 dataset_size: 1711869 - config_name: it features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 1638303 num_examples: 19009 download_size: 439461 dataset_size: 1638303 - config_name: ja features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 1825461 num_examples: 18958 download_size: 475972 dataset_size: 1825461 - config_name: ko features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 1717027 num_examples: 18456 download_size: 443155 dataset_size: 1717027 - config_name: pt-br features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 1781721 num_examples: 19731 download_size: 453370 dataset_size: 1781721 - config_name: zh-hans features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 3263857 num_examples: 37592 download_size: 751766 dataset_size: 3263857 - config_name: zh-hant features: - name: id dtype: int32 - name: query dtype: string - name: query_type dtype: string - name: query_language dtype: string - name: intent dtype: string - name: intent_type dtype: string splits: - name: train num_bytes: 3182320 num_examples: 36404 download_size: 744351 dataset_size: 3182320 configs: - config_name: ar data_files: - split: train path: ar/train-* - config_name: de data_files: - split: train path: de/train-* - config_name: en data_files: - split: train path: en/train-* - config_name: es data_files: - split: train path: es/train-* - config_name: fr data_files: - split: train path: fr/train-* - config_name: it data_files: - split: train path: it/train-* - config_name: ja data_files: - split: train path: ja/train-* - config_name: ko data_files: - split: train path: ko/train-* - config_name: pt-br data_files: - split: train path: pt-br/train-* - config_name: zh-hans data_files: - split: train path: zh-hans/train-* - config_name: zh-hant data_files: - split: train path: zh-hant/train-* ---
提供机构:
peakji
原始信息汇总

数据集概述

数据集配置及特征

配置名称 特征名称 数据类型
ar id int32
ar query string
ar query_type string
ar query_language string
ar intent string
ar intent_type string
de id int32
de query string
de query_type string
de query_language string
de intent string
de intent_type string
en id int32
en query string
en query_type string
en query_language string
en intent string
en intent_type string
es id int32
es query string
es query_type string
es query_language string
es intent string
es intent_type string
fr id int32
fr query string
fr query_type string
fr query_language string
fr intent string
fr intent_type string
it id int32
it query string
it query_type string
it query_language string
it intent string
it intent_type string
ja id int32
ja query string
ja query_type string
ja query_language string
ja intent string
ja intent_type string
ko id int32
ko query string
ko query_type string
ko query_language string
ko intent string
ko intent_type string
pt-br id int32
pt-br query string
pt-br query_type string
pt-br query_language string
pt-br intent string
pt-br intent_type string
zh-hans id int32
zh-hans query string
zh-hans query_type string
zh-hans query_language string
zh-hans intent string
zh-hans intent_type string
zh-hant id int32
zh-hant query string
zh-hant query_type string
zh-hant query_language string
zh-hant intent string
zh-hant intent_type string

数据集大小及下载信息

配置名称 训练集大小(字节) 训练集示例数 下载大小(字节)
ar 1775678 17471 414610
de 1667010 19591 463107
en 3299902 38662 893987
es 1711650 19425 449973
fr 1711869 19580 458000
it 1638303 19009 439461
ja 1825461 18958 475972
ko 1717027 18456 443155
pt-br 1781721 19731 453370
zh-hans 3263857 37592 751766
zh-hant 3182320 36404 744351
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作