five

NanoMatriX/smoltalk2-20k

收藏
Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/NanoMatriX/smoltalk2-20k
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: smoltalk_smollm3_everyday_conversations_no_think features: - name: messages list: - name: content dtype: string - name: role dtype: string - name: chat_template_kwargs struct: - name: custom_instructions dtype: string - name: enable_thinking dtype: bool - name: python_tools list: 'null' - name: xml_tools list: 'null' - name: source dtype: string splits: - name: train num_bytes: 1558617 num_examples: 1800 - name: test num_bytes: 173179 num_examples: 200 download_size: 1635177 dataset_size: 1731796 - config_name: smoltalk_smollm3_smol_magpie_ultra_no_think features: - name: messages list: - name: content dtype: string - name: role dtype: string - name: chat_template_kwargs struct: - name: custom_instructions dtype: string - name: enable_thinking dtype: bool - name: python_tools list: 'null' - name: xml_tools list: 'null' - name: source dtype: string splits: - name: train num_bytes: 37584123 num_examples: 5400 - name: test num_bytes: 4176013 num_examples: 600 download_size: 41554845 dataset_size: 41760136 - config_name: smoltalk_smollm3_smol_summarize_no_think features: - name: messages list: - name: content dtype: string - name: role dtype: string - name: chat_template_kwargs struct: - name: custom_instructions dtype: string - name: enable_thinking dtype: bool - name: python_tools list: 'null' - name: xml_tools list: 'null' - name: source dtype: string splits: - name: train num_bytes: 4285494 num_examples: 1800 - name: test num_bytes: 476166 num_examples: 200 download_size: 4686337 dataset_size: 4761660 - config_name: smoltalk_smollm3_systemchats_10k_no_think features: - name: messages list: - name: content dtype: string - name: role dtype: string - name: chat_template_kwargs struct: - name: custom_instructions dtype: string - name: enable_thinking dtype: bool - name: python_tools list: 'null' - name: xml_tools list: 'null' - name: source dtype: string splits: - name: train num_bytes: 23694867 num_examples: 9000 - name: test num_bytes: 2632763 num_examples: 1000 download_size: 25898246 dataset_size: 26327630 configs: - config_name: smoltalk_smollm3_everyday_conversations_no_think data_files: - split: train path: smoltalk_smollm3_everyday_conversations_no_think/train-* - split: test path: smoltalk_smollm3_everyday_conversations_no_think/test-* - config_name: smoltalk_smollm3_smol_magpie_ultra_no_think data_files: - split: train path: smoltalk_smollm3_smol_magpie_ultra_no_think/train-* - split: test path: smoltalk_smollm3_smol_magpie_ultra_no_think/test-* - config_name: smoltalk_smollm3_smol_summarize_no_think data_files: - split: train path: smoltalk_smollm3_smol_summarize_no_think/train-* - split: test path: smoltalk_smollm3_smol_summarize_no_think/test-* - config_name: smoltalk_smollm3_systemchats_10k_no_think data_files: - split: train path: smoltalk_smollm3_systemchats_10k_no_think/train-* - split: test path: smoltalk_smollm3_systemchats_10k_no_think/test-* ---
提供机构:
NanoMatriX
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作