NanoMatriX/smoltalk2-20k
收藏Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/NanoMatriX/smoltalk2-20k
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: smoltalk_smollm3_everyday_conversations_no_think
features:
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: chat_template_kwargs
struct:
- name: custom_instructions
dtype: string
- name: enable_thinking
dtype: bool
- name: python_tools
list: 'null'
- name: xml_tools
list: 'null'
- name: source
dtype: string
splits:
- name: train
num_bytes: 1558617
num_examples: 1800
- name: test
num_bytes: 173179
num_examples: 200
download_size: 1635177
dataset_size: 1731796
- config_name: smoltalk_smollm3_smol_magpie_ultra_no_think
features:
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: chat_template_kwargs
struct:
- name: custom_instructions
dtype: string
- name: enable_thinking
dtype: bool
- name: python_tools
list: 'null'
- name: xml_tools
list: 'null'
- name: source
dtype: string
splits:
- name: train
num_bytes: 37584123
num_examples: 5400
- name: test
num_bytes: 4176013
num_examples: 600
download_size: 41554845
dataset_size: 41760136
- config_name: smoltalk_smollm3_smol_summarize_no_think
features:
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: chat_template_kwargs
struct:
- name: custom_instructions
dtype: string
- name: enable_thinking
dtype: bool
- name: python_tools
list: 'null'
- name: xml_tools
list: 'null'
- name: source
dtype: string
splits:
- name: train
num_bytes: 4285494
num_examples: 1800
- name: test
num_bytes: 476166
num_examples: 200
download_size: 4686337
dataset_size: 4761660
- config_name: smoltalk_smollm3_systemchats_10k_no_think
features:
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: chat_template_kwargs
struct:
- name: custom_instructions
dtype: string
- name: enable_thinking
dtype: bool
- name: python_tools
list: 'null'
- name: xml_tools
list: 'null'
- name: source
dtype: string
splits:
- name: train
num_bytes: 23694867
num_examples: 9000
- name: test
num_bytes: 2632763
num_examples: 1000
download_size: 25898246
dataset_size: 26327630
configs:
- config_name: smoltalk_smollm3_everyday_conversations_no_think
data_files:
- split: train
path: smoltalk_smollm3_everyday_conversations_no_think/train-*
- split: test
path: smoltalk_smollm3_everyday_conversations_no_think/test-*
- config_name: smoltalk_smollm3_smol_magpie_ultra_no_think
data_files:
- split: train
path: smoltalk_smollm3_smol_magpie_ultra_no_think/train-*
- split: test
path: smoltalk_smollm3_smol_magpie_ultra_no_think/test-*
- config_name: smoltalk_smollm3_smol_summarize_no_think
data_files:
- split: train
path: smoltalk_smollm3_smol_summarize_no_think/train-*
- split: test
path: smoltalk_smollm3_smol_summarize_no_think/test-*
- config_name: smoltalk_smollm3_systemchats_10k_no_think
data_files:
- split: train
path: smoltalk_smollm3_systemchats_10k_no_think/train-*
- split: test
path: smoltalk_smollm3_systemchats_10k_no_think/test-*
---
提供机构:
NanoMatriX



