ByteDance-Seed/AInsteinBench
收藏Hugging Face2026-01-28 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/ByteDance-Seed/AInsteinBench
下载链接
链接失效反馈官方服务:
资源简介:
AInsteinBench是一个用于评估AI代理在解决科学计算问题方面能力的基准测试。它目前支持Einstein Toolkit和Multi-SWE-bench格式的编码问题。数据集包含244个来自多个科学仓库的科学计算任务,这些任务已经过执行验证,并由相应的领域专家审核,以确保软件工程和科学内容的准确性。任务涵盖数值相对论、量子信息、分子动力学、化学信息学和量子化学等领域。
AInsteinBench is a benchmark for evaluating the capabilities of AI agents in solving scientific computing problems. It currently supports Einstein Toolkit and Multi-SWE-bench formats of coding questions. The dataset provides 244 scientific computing tasks derived from multiple scientific repositories. These tasks have been verified on execution and also reviewed by corresponding domain experts to verify both software engineering and scientific content accuracy. The tasks cover numerical relativity, quantum information, molecular dynamics, cheminformatics and quantum chemistry.
提供机构:
ByteDance-Seed



