Packrat: Automatic Reconfiguration for Latency Minimization in CPU-based DNN Serving
收藏DataCite Commons2025-01-02 更新2025-04-16 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/fc4baa33-bc6a-4e87-9c87-fa39a6d1c70d
下载链接
链接失效反馈官方服务:
资源简介:
Packrat is a serving system for online inference that automatically determines the number of threads that need to be allocated to model instances to minimize inference latency.
提供机构:
TIB
创建时间:
2025-01-02



