five

AMaze: fully discrete training with three regimes (direct, scaffolding, interactive) and two algorithms (A2C, PPO)

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/10622913
下载链接
链接失效反馈
官方服务:
资源简介:
Dataset containing all training artifacts (final models, training curves, intermediate visualizations, ...) as well as the raw data used to assert generalization capabilities. The associated archive final_behavior.tar.gz provides a visualization of every replicate's final behavior for easier navigation. Distribution files contain a sampling across 1000 seeds, 5 probabilities for traps and lures, 4 sizes and 5 set sizes resulting in 486356 mazes. Descriptive graphs provide an overview of the accessible "maze space".   v2: Added script to aggregate run dynamics (mean reward, errors, maze lengths...) and resulting generated dataset (csv) and plots (pdf)
创建时间:
2025-02-19
二维码
社区交流群
二维码
科研交流群
商业服务