Benchmark dataset for safety assessment under blast loading
收藏Mendeley Data2026-04-09 收录
下载链接:
https://data.mendeley.com/datasets/j53z3y7bx8/1
下载链接
链接失效反馈官方服务:
资源简介:
This dataset provides a benchmark comprising 50 curated assessment tasks designed to evaluate the performance of large language model (LLM)-based multi-agent frameworks. Each item represents an independent task with both simple and complex task descriptions, along with corresponding ground-truth references.



