Energy Consumption and Runtime Traces of Lightweight LLM Inference

Name: Energy Consumption and Runtime Traces of Lightweight LLM Inference
Creator: Aadhith M
License: 暂无描述

IEEE2026-04-17 收录

下载链接：

https://ieee-dataport.org/documents/energy-consumption-and-runtime-traces-lightweight-llm-inference

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset contains runtime and energy consumption measurements for multiple lightweight large language models (LLMs) under varying inference configurations. Experiments were conducted on a CPU\u2013GPU hybrid system equipped with an NVIDIA GeForce RTX 3060 (8 GB VRAM) GPU and an Intel i7 CPU. The evaluated models include GPT-2, Falcon-rw-1B, LLaMA-3.2-1B, and DeepSeek-R1-Distill-Qwen-1.5B, along with their quantized variants.Energy consumption was profiled using PyJoules, which integrates with pynvml (for GPU) and Intel RAPL (for CPU) to collect fine-grained hardware-level energy data during inference.Each inference trace corresponds to a unique combination of:Input token length T_in,Output token length T_outQuantization threshold Qn \u2208 {1,2,4,6}Token lengths vary in powers of two (T_in, T_out \u2208 {8,16,32,64,128,256,512}.Each configuration was executed 25 times in randomized order for robustness, yielding 24,500 total traces across all models.The dataset provides a reproducible benchmark for analyzing LLM inference efficiency, scalability, and energy-performance trade-offs under hardware constraints.

提供机构：

Aadhith M

5,000+

优质数据集

54 个

任务类型

进入经典数据集