five

Energy Consumption and Runtime Traces of Lightweight LLM Inference

收藏
IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/energy-consumption-and-runtime-traces-lightweight-llm-inference
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains runtime and energy consumption measurements for multiple lightweight large language models (LLMs) under varying inference configurations. Experiments were conducted on a CPU\u2013GPU hybrid system equipped with an NVIDIA GeForce RTX 3060 (8 GB VRAM) GPU and an Intel i7 CPU. The evaluated models include GPT-2, Falcon-rw-1B, LLaMA-3.2-1B, and DeepSeek-R1-Distill-Qwen-1.5B, along with their quantized variants.Energy consumption was profiled using PyJoules, which integrates with pynvml (for GPU) and Intel RAPL (for CPU) to collect fine-grained hardware-level energy data during inference.Each inference trace corresponds to a unique combination of:Input token length T_in,Output token length T_outQuantization threshold Qn \u2208 {1,2,4,6}Token lengths vary in powers of two (T_in, T_out \u2208 {8,16,32,64,128,256,512}.Each configuration was executed 25 times in randomized order for robustness, yielding 24,500 total traces across all models.The dataset provides a reproducible benchmark for analyzing LLM inference efficiency, scalability, and energy-performance trade-offs under hardware constraints.
提供机构:
Aadhith M
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作