kreasof-ai/GLM-Kimi-OpenThoughts-HunterAlpha-Filtered
收藏Hugging Face2026-04-19 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/kreasof-ai/GLM-Kimi-OpenThoughts-HunterAlpha-Filtered
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: conversations
list:
- name: from
dtype: string
- name: value
dtype: string
- name: input
dtype: string
- name: output
dtype: string
- name: domain
dtype: string
- name: meta
struct:
- name: input_tokens
dtype: int64
- name: output_tokens
dtype: int64
- name: teacher_model
dtype: string
splits:
- name: train
num_bytes: 28585612649.612514
num_examples: 1226331
download_size: 12202401911
dataset_size: 28585612649.612514
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
| domain | Mean_In | P95_In | Mean_Out | P95_Out | Total_Tokens |
|----------------------|-----------|----------|------------|-----------|----------------|
| General-Distillation | 92.81 | 378 | 2032.89 | 3777 | 784362123 |
| General-Math | 51.65 | 72 | 3422 | 4007 | 5578684 |
| Math | 58.61 | 85 | 3323.74 | 3991 | 899705 |
| Multilingual-STEM | 64.01 | 89 | 3548.07 | 3974 | 159495080 |
| MultilingualSTEM | 76.5 | 119 | 3097.35 | 3917 | 147082677 |
| PHD-Science | 44.39 | 55 | 3082.65 | 3857 | 532525113 |
| code | 404.92 | 1209 | 2554.44 | 3721 | 3536439 |
| general | 62.62 | 294 | 1427.94 | 3496 | 301834703 |
| main | 96.01 | 390 | 2170.94 | 3776 | 832633275 |
| math | 76.88 | 147 | 3313.32 | 3979 | 2590112 |
| science | 171.11 | 559 | 2483.47 | 3778 | 60853717 |
💰 TOTAL TOKENS: 2,831,391,628
Source:
- https://huggingface.co/datasets/Jackrong/GLM-5.1-Reasoning-1M-Cleaned
- https://huggingface.co/datasets/Jackrong/Kimi-K2.5-Reasoning-1M-Cleaned
- https://huggingface.co/datasets/open-thoughts/OpenThoughts3-1.2M
- https://huggingface.co/datasets/ianncity/Hunter-Alpha-SFT-300000x
提供机构:
kreasof-ai



