five

Dataset: Molecular Simulations Assisted by an Artificial Intelligence Agent

收藏
DataCite Commons2026-05-04 更新2026-05-10 收录
下载链接:
https://dataverse.nl/citation?persistentId=doi:10.34894/RNPTDS
下载链接
链接失效反馈
官方服务:
资源简介:
This Dataset is for the paper Molecular Simulations Assisted by an Artificial Intelligence Agent (ArIA). This set contains codes and full datasets used to reproduce the results shown in the paper.<br><br> Dataset Structure<br> This set contains three main directories: All the scripts require uv (https://docs.astral.sh/uv/getting-started/installation/) All the test scripts were tested in our local cluster with L40S GPU.<br><br> App_deployment<br> This directory is used for deploying the ArIA chatbot. LoRA adapters trained in the `Model_development` directory are transferred here for use within the LangGraph framework.<br><br> Make_prompt <br> This directory contains scripts for generating synthetic prompts from ORCA input files, as well as synthetic reasoning texts (CoT, CoVe, ToT, GoT, and intrinsic reasoning) used for model fine-tuning. ORCA input files were generated with the method used in this paper: https://doi.org/10.1039/D4DD00366G. The scripts for calculating F1, classifying errors are also included in this directory.<br><br> Model_development<br> This directory is dedicated to LoRA adapter development. It includes ORCA input file execution to ensure runnability, along with validation and feedback agents.<br> Each directory is separated. Enter a directory to load and use the corresponding module.
提供机构:
DataverseNL
创建时间:
2026-02-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作