five

kreasof-ai/GLM-Kimi-OpenThoughts-HunterAlpha-Filtered

收藏
Hugging Face2026-04-19 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/kreasof-ai/GLM-Kimi-OpenThoughts-HunterAlpha-Filtered
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: id dtype: string - name: conversations list: - name: from dtype: string - name: value dtype: string - name: input dtype: string - name: output dtype: string - name: domain dtype: string - name: meta struct: - name: input_tokens dtype: int64 - name: output_tokens dtype: int64 - name: teacher_model dtype: string splits: - name: train num_bytes: 28585612649.612514 num_examples: 1226331 download_size: 12202401911 dataset_size: 28585612649.612514 configs: - config_name: default data_files: - split: train path: data/train-* --- | domain | Mean_In | P95_In | Mean_Out | P95_Out | Total_Tokens | |----------------------|-----------|----------|------------|-----------|----------------| | General-Distillation | 92.81 | 378 | 2032.89 | 3777 | 784362123 | | General-Math | 51.65 | 72 | 3422 | 4007 | 5578684 | | Math | 58.61 | 85 | 3323.74 | 3991 | 899705 | | Multilingual-STEM | 64.01 | 89 | 3548.07 | 3974 | 159495080 | | MultilingualSTEM | 76.5 | 119 | 3097.35 | 3917 | 147082677 | | PHD-Science | 44.39 | 55 | 3082.65 | 3857 | 532525113 | | code | 404.92 | 1209 | 2554.44 | 3721 | 3536439 | | general | 62.62 | 294 | 1427.94 | 3496 | 301834703 | | main | 96.01 | 390 | 2170.94 | 3776 | 832633275 | | math | 76.88 | 147 | 3313.32 | 3979 | 2590112 | | science | 171.11 | 559 | 2483.47 | 3778 | 60853717 | 💰 TOTAL TOKENS: 2,831,391,628 Source: - https://huggingface.co/datasets/Jackrong/GLM-5.1-Reasoning-1M-Cleaned - https://huggingface.co/datasets/Jackrong/Kimi-K2.5-Reasoning-1M-Cleaned - https://huggingface.co/datasets/open-thoughts/OpenThoughts3-1.2M - https://huggingface.co/datasets/ianncity/Hunter-Alpha-SFT-300000x
提供机构:
kreasof-ai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作