five

dgrx-systems/Spectra-Mamba-KAN-Hybrid-Language-Model

收藏
Hugging Face2026-03-24 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/dgrx-systems/Spectra-Mamba-KAN-Hybrid-Language-Model
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit tags: - architecture - language-model - mamba - state-space-model - KAN - kolmogorov-arnold-networks - research - nlp pretty_name: Spectra Architecture Reference --- # Spectra: A Mamba-KAN Hybrid Language Model A complete architectural, mathematical, and implementation reference for **Spectra** — a hybrid language model that fuses **Selective State Space Models (Mamba)** with **Kolmogorov-Arnold Networks (KANs)** in a parallel block design with learnable mixing weights. **Key properties:** - 🚀 **O(L) complexity** — eliminates the quadratic attention memory wall - 🧠 **Constant inference memory** — SSM state replaces the growing KV cache - 🔍 **Interpretable** — every KAN edge is a plottable univariate B-spline - 📐 **339M parameters** at base configuration (d_model=768, 24 layers) --- ![Page 1](images/page_01.png) ![Page 2](images/page_02.png) ![Page 3](images/page_03.png) ![Page 4](images/page_04.png) ![Page 5](images/page_05.png) ![Page 6](images/page_06.png) ![Page 7](images/page_07.png) ![Page 8](images/page_08.png) ![Page 9](images/page_09.png) ![Page 10](images/page_10.png) ![Page 11](images/page_11.png) ![Page 12](images/page_12.png) ![Page 13](images/page_13.png) ![Page 14](images/page_14.png) ![Page 15](images/page_15.png) --- ## Files - [`Spectra_Architecture_fixed.pdf`](Spectra_Architecture_fixed.pdf) — full downloadable document ## Citation ``` @misc{spectra2024, title = {Spectra: A Mamba-KAN Hybrid Language Model}, year = {2024}, note = {Architecture reference document, MIT License} } ```
提供机构:
dgrx-systems
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作