five

AutoFEnergy -- Synthetic Data Used to Teach LLM Feature Engineering for Energy

收藏
IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/autofenergy-synthetic-data-used-teach-llm-feature-engineering-energy
下载链接
链接失效反馈
官方服务:
资源简介:
The synthetic AutoFEnergy dataset is the first publicly available dataset designed to train LLMs with engineer-level feature-engineering tool-calling capabilities in the energy domain, capabilities that are not inherently present in general-purpose LLMs.This work is submitted as a dataset entry to the ``Good Datasets for AI Model Training in the Power and Energy Domain'' competition, with the goal of facilitating research understanding and reproducibility.The AutoFEnergy dataset aims to advance feature-engineering tasks across a wide range of forecasting and classification scenarios in the energy domain toward LLM-based end-to-end automation, thereby reducing reliance on experienced engineers and significantly lowering both human labor costs and time expenditure.All data generation, fine-tuning, and validation procedures are implemented through reproducible, open-source Python workflows, and are accompanied by accuracy evaluations to verify the effectiveness of the dataset.
提供机构:
Pingyang Sun; Zihang Qiu
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作