five

SL-AI/GRaPE-Base-Mix

收藏
Hugging Face2025-11-01 更新2026-05-10 收录
下载链接:
https://hf-mirror.com/datasets/SL-AI/GRaPE-Base-Mix
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 --- # GRaPE Base Mix This dataset provides the initial training for GRaPE, taking a base model and making it conversational, and smart in many fields such as: * Code * Math * Music Theory * Instruction Following * Tool Usage * Long Context There are a max token amount of 137,000 tokens. There are 3M Examples from various datasets, all shuffled to ensure training goes smoothly. ### Datasets Used #### General & Reasoning SFT * **hermes-3**: General supervised fine-tuning * **slimorca**: General SFT with a focus on reasoning * **ultrafeedback**: General SFT derived from binarized feedback #### Code & STEAM * **glaive_code**: Code assistant SFT * **evol_code**: Code instruction SFT * **metamath**: Math reasoning SFT * **music_theory**: Music theory SFT * **music_lyrics**: Music lyrics SFT #### Agentic & Function Calling * **andy_4**: Agentic SFT with function calling for playing Minecraft * **glaive_function_calling**: Function calling SFT * **tool_ace**: Tool-use and agentic SFT * **agentic_long_qa**: Agentic SFT for long-context question answering #### Uncensored & Conditional SFT * **toxic_qa_sft**: Uncensored SFT * **nitral_ai**: Uncensored SFT * **toxic_dpo1**: DPO dataset converted to conditional SFT * **toxic_dpo2**: DPO dataset converted to conditional SFT *(Note that these datasets were used to train the model when it is and when it is not appropriate to be uncensored.)*
提供机构:
SL-AI
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作