five

Automatic compilation mechanism for dynamic horizontal fusion on accelerators

收藏
中国科学数据2026-03-25 更新2026-04-25 收录
下载链接:
https://www.sciengine.com/AA/doi/10.1360/SSI-2025-0283
下载链接
链接失效反馈
官方服务:
资源简介:
Accelerators such as graphics processing units (GPUs) have been widely adopted to accelerate compute-intensive tasks like deep learning, owing to their powerful parallel processing capabilities. To fully exploit the hardware potential of GPUs, system-level optimization techniques are crucial, among which kernel fusion has become a widely used approach in mainstream frameworks. Horizontal fusion enables parallel scheduling of multiple independent kernels, thereby improving resource utilization. However, existing horizontal fusion methods are typically designed for static computation graphs and struggle to support tasks with dynamic branching structures—such as mixture-of-experts (MoE) models—where inputs are dynamically routed to different sub-networks at runtime, making it difficult to predefine fused kernels.To address this challenge, we propose Fluxer an automated horizontal fusion technique tailored for dynamic branches. Fluxer
创建时间:
2025-10-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作