Rhombus18/53M-Token-Instruction-Code-QA-Dataset
收藏Hugging Face2025-08-25 更新2025-11-29 收录
下载链接:
https://hf-mirror.com/datasets/Rhombus18/53M-Token-Instruction-Code-QA-Dataset
下载链接
链接失效反馈官方服务:
资源简介:
Water演示数据集是一个为10M参数的“Water”模型(Brahma架构)训练而设计的高质量、多样化的指令微调数据集,适用于指令跟随、推理、代码、对话和多语言任务,旨在为小型LLM提供稳健的泛化。
The Water Demonstrator Dataset is a high-quality, diverse instruction-tuned dataset designed for training the 10M parameter Water model (Brahma architecture) for instruction-following, reasoning, code, dialogue, and multilingual tasks, aiming to provide robust generalization for small LLMs.
提供机构:
Rhombus18



