Flexible Trajectory Planning with the Dyna Reinforcement Learning Architecture Under Computation Constraints
收藏DataCite Commons2026-01-18 更新2026-05-03 收录
下载链接:
http://dataverse.jpl.nasa.gov/citation?persistentId=doi:10.48577/jpl.8NWX5E
下载链接
链接失效反馈官方服务:
资源简介:
Spacecraft autonomy is increasingly critical for missions involving long communication delays, spacecraft fleets, or fault-prone actuators. Prior work demonstrated that Dyna-style reinforcement learning (RL) architectures, which combine model-based planning with model-free updates, can adapt to actuator degradation and changing dynamics. However, these implementations assumed fixed computational availability, limiting robustness under dynamic processor loads. This work extends the prior Dyna framework by incorporating time- and processor-aware planning, dynamic batching of critic updates, and improved activation functions to mitigate neuron inactivation. Hardware experiments on a planar air-bearing platform and large-scale simulations demonstrate that the modified architecture reduces control cost, lowers failure probability under thruster degradation, and decouples performance from processor speed. These results highlight the importance of integrating computational awareness into RL-based controllers and provide a practical methodology for improving the resilience and adaptability of spacecraft guidance, navigation, and control systems.
提供机构:
Root
创建时间:
2026-01-18



