five

anonneuripsuser/stem-gym-benchmark

收藏
Hugging Face2026-04-02 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/anonneuripsuser/stem-gym-benchmark
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - reinforcement-learning tags: - stem-microscopy - electron-microscopy - materials-science - gymnasium - benchmark - dose-efficiency - autonomous-microscopy pretty_name: STEMGym Benchmark size_categories: - 1K<n<10K --- # STEMGym: A Gymnasium Environment for Benchmarking Dose-Efficient Autonomous Scanning Transmission Electron Microscopy A Gymnasium-based benchmark environment for evaluating autonomous dose-efficient scanning transmission electron microscopy (STEM) agents. ## Dataset Description STEMGym provides simulated STEM specimens as HDF5 world files. Each world contains: - **Overview image**: Low-magnification survey of the full specimen - **Tile grid**: High-resolution STEM-HAADF images (128x128 px tiles, 4px overlap, stride=124) arranged in an 8x8 grid - **Ground truth annotations**: Atom positions, defect types, and phase maps for scoring - **Metadata**: Pixel size, accelerating voltage, detector geometry, material parameters ### Materials | Material | Zone Axis | FOV | Defect Types | Task | |----------|-----------|-----|-------------|------| | SrTiO3 | [001] | ~100 nm | O vacancies clustered along grain boundary | Defect Census | | BaTiO3 | [001] | ~100 nm | Cubic/tetragonal phase boundaries + O vacancies | Phase Mapping + Defect Census | | SiGe | [110] | ~50 nm | Ge substitutions concentrated in one quadrant | Targeted Characterization | | GaN | [11-20] | ~80 nm | InGaN quantum-well substitutions | Defect Census | | Pt nanoparticles | — | ~60 nm | Pt nanoparticles on amorphous carbon | Particle Census | ### Difficulty Levels Each material is provided at three difficulty levels: | Level | Vacancy Rate | Phonon Configs | Dose (e-/A^2) | |-------|-------------|----------------|---------------| | Easy | 5% | 4 | 1e4 | | Medium | 3% | 8 | 5e3 | | Hard | 1% | 16 | 1e3 | ## Files ### Simulated Worlds (`worlds/`) | File | Material | Difficulty | Size | |------|----------|-----------|------| | `test_world.h5` | Synthetic (Gaussian blobs) | — | 9.7 MB | | `replay_world.h5` | Synthetic (replay validation) | — | 2.5 MB | | `srtio3_clustered_easy.h5` | SrTiO3 | Easy | 884 MB | | `srtio3_clustered_medium.h5` | SrTiO3 | Medium | 884 MB | | `srtio3_clustered_hard.h5` | SrTiO3 | Hard | 885 MB | | `batio3_interface_easy.h5` | BaTiO3 | Easy | 911 MB | | `batio3_interface_medium.h5` | BaTiO3 | Medium | 911 MB | | `batio3_interface_hard.h5` | BaTiO3 | Hard | 911 MB | | `sige_gradient_clustered_easy.h5` | SiGe | Easy | 150 MB | | `sige_gradient_clustered_medium.h5` | SiGe | Medium | 148 MB | | `sige_gradient_clustered_hard.h5` | SiGe | Hard | 147 MB | | `gan_easy.h5` | GaN | Easy | 451 MB | | `gan_medium.h5` | GaN | Medium | 449 MB | | `gan_hard.h5` | GaN | Hard | 448 MB | | `pt_nanoparticles_easy.h5` | Pt | Easy | 129 MB | | `pt_nanoparticles_medium.h5` | Pt | Medium | 129 MB | | `pt_nanoparticles_hard.h5` | Pt | Hard | 129 MB | ### Model Checkpoints (`checkpoints/`) | File | Model | Description | Size | |------|-------|-------------|------| | `atom_finder.pt` | AtomFinderUNet | Atomic column detection ensemble (3 members) | 88 MB | | `defect_classifier.pt` | DefectClassifierCNN | Defect type classification | 1.4 MB | | `phase_identifier.pt` | PhaseIdentifierResNet | Material phase identification | 7.4 MB | | `dqn_agent.zip` | DQN (SB3) | RL navigation baseline | 0.8 MB | | `ppo_agent.zip` | PPO (SB3) | RL navigation baseline | 1.0 MB | | `sac_agent.zip` | SAC (SB3) | RL navigation baseline | 8.4 MB | ### Transfer Checkpoints (`checkpoints/transfer/`) Material-specific analyst models trained on individual materials for transfer experiments: | Directory | Contents | |-----------|----------| | `transfer/srtio3/` | atom_finder.pt, defect_classifier.pt, phase_identifier.pt, training_metadata.json | | `transfer/batio3/` | atom_finder.pt, defect_classifier.pt, phase_identifier.pt, training_metadata.json | | `transfer/sige/` | atom_finder.pt, defect_classifier.pt, phase_identifier.pt, training_metadata.json | | `transfer/gan/` | atom_finder.pt, defect_classifier.pt, phase_identifier.pt, training_metadata.json | ## HDF5 World Format Each world file follows this layout: ``` /metadata/ pixel_size_nm # Physical pixel size tile_size_px # Tile dimensions (128) grid_shape # Grid dimensions (rows, cols) fov_nm # Field of view in nm scenario # Material/scenario name difficulty # easy / medium / hard /overview # (H, W) low-res overview image /tiles/{row}_{col} # (128, 128) float32 normalized HAADF tiles /ground_truth/ atom_positions # (N, 2) in nm atom_types # (N,) int [0=pristine, 1=vacancy, 2=substitution] defect_mask # (N,) bool defect_types # (N,) str ["pristine"/"vacancy"/"substitution"] phase_map # (H, W) int32, optional /valid_region # (H, W) bool ``` ## Usage ```bash pip install -e . python stem_gym/scripts/download_data.py stemgym run --agent raster_equipped --task defect_census --world srtio3_clustered_easy --seeds 3 ``` ## License This dataset is released under the [CC-BY-4.0](https://creativecommons.org/licenses/by/4.0/) license. ## Citation ```bibtex @inproceedings{anonymous2026stemgym, title={STEMGym: A Gymnasium Environment for Benchmarking Dose-Efficient Autonomous Scanning Transmission Electron Microscopy}, author={Anonymous}, year={2026} } ```
提供机构:
anonneuripsuser
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作