five

"Power-Flow Benchmark for LLM-based Power System Agent Evaluation (PFBench)"

收藏
DataCite Commons2026-03-18 更新2026-05-03 收录
下载链接:
https://ieee-dataport.org/documents/power-flow-benchmark-llm-based-power-system-agent-evaluation-pfbench
下载链接
链接失效反馈
官方服务:
资源简介:
"PFBench is a reproducible benchmark dataset for power-flow reasoning, structured output generation, and tool-using power-system AI. This release packages frozen scenario records and benchmark question items derived from standard transmission test cases under deterministic perturbations. Each scenario record stores the base-grid reference, mutation specification, full post-mutation input state, AC and DC solver outputs, provenance, and explicit data-quality flags that preserve inherited source-case artifacts rather than silently normalizing them away. Each question item references a parent scenario and includes a prompt, response schema, solver-derived gold answer, and programmatic grading rule. The release is accompanied by schema validation, integrity metadata, archived configuration files, and external pandapower cross-validation, supporting transparent reuse, archival deposition, and reproducible evaluation of power-system agents and structured reasoning systems."
提供机构:
IEEE DataPort
创建时间:
2026-03-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作