five

gemma3n-slicing-configs

收藏
魔搭社区2025-12-05 更新2025-06-28 收录
下载链接:
https://modelscope.cn/datasets/google/gemma3n-slicing-configs
下载链接
链接失效反馈
官方服务:
资源简介:
This repository contains configurations to slice Gemma 3n E4B, which is enabled thanks to it being a MatFormer. The E4B model can be sliced into small models, trading off quality and latency/compute requirements. We recommend exploring the [MatFormer Lab](TODO: add link) to getting started with slicing Gemma 3n E4B yourself. For each configuration, we calculate the MMLU accuracy. Although these are not the only configurations possible, they are optimal configurations identified by calculating the accuracy of the pre-trained model To learn more about MatFormers, please review the and generate your own submodels with the [MatFormer Lab](TODO: add link). ![alt text](https://storage.googleapis.com/gweb-developer-goog-blog-assets/images/Artboard_1.original.png "This chart show’s MMLU performance vs model size of Gemma 3n Mix-n-Match (pretrained) capability.") This chart show’s MMLU performance vs model size of Gemma 3n Mix-n-Match (pretrained) capability. Some additional resources: * [Gemma 3n launch blog](https://developers.googleblog.com/en/introducing-gemma-3n-developer-guide) * [MatFormer paper](https://huggingface.co/papers/2310.07707)

本仓库包含Gemma 3n E4B的切片配置,该模型凭借其MatFormer架构得以支持切片操作。E4B模型可被切片为多个小型模型,以此在模型质量与延迟/计算资源需求之间进行权衡。我们建议您访问[MatFormer实验室(MatFormer Lab)](TODO: add link),以自行上手开展Gemma 3n E4B的切片工作。 针对每一种配置,我们均计算了其MMLU准确率。尽管此类配置并非全部可行方案,但它们均为通过测算预训练模型准确率所得到的最优配置。 若欲深入了解MatFormer架构,请查阅相关资料,并通过[MatFormer实验室(MatFormer Lab)](TODO: add link)生成自定义子模型。 ![alt text](https://storage.googleapis.com/gweb-developer-goog-blog-assets/images/Artboard_1.original.png "Gemma 3n Mix-n-Match(预训练)模型的MMLU性能与模型尺寸关系图。") 本图表展示了Gemma 3n Mix-n-Match(预训练)模型的MMLU性能与模型尺寸的对应关系。 其他相关资源: * [Gemma 3n发布博客](https://developers.googleblog.com/en/introducing-gemma-3n-developer-guide) * [MatFormer研究论文](https://huggingface.co/papers/2310.07707)
提供机构:
maas
创建时间:
2025-06-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作