five

ByteDance-Seed/AInsteinBench

收藏
Hugging Face2026-01-28 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/ByteDance-Seed/AInsteinBench
下载链接
链接失效反馈
官方服务:
资源简介:
AInsteinBench是一个用于评估AI代理在解决科学计算问题方面能力的基准测试。它目前支持Einstein Toolkit和Multi-SWE-bench格式的编码问题。数据集包含244个来自多个科学仓库的科学计算任务,这些任务已经过执行验证,并由相应的领域专家审核,以确保软件工程和科学内容的准确性。任务涵盖数值相对论、量子信息、分子动力学、化学信息学和量子化学等领域。

AInsteinBench is a benchmark for evaluating the capabilities of AI agents in solving scientific computing problems. It currently supports Einstein Toolkit and Multi-SWE-bench formats of coding questions. The dataset provides 244 scientific computing tasks derived from multiple scientific repositories. These tasks have been verified on execution and also reviewed by corresponding domain experts to verify both software engineering and scientific content accuracy. The tasks cover numerical relativity, quantum information, molecular dynamics, cheminformatics and quantum chemistry.
提供机构:
ByteDance-Seed
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作