five

Bidirectional Q-Learning for recycling path planning of used appliances under strong and weak constraints

收藏
ETS-Data2024-09-09 更新2026-02-07 收录
下载链接:
https://doi.org/10.26599/ETSD.2024.9190030
下载链接
链接失效反馈
官方服务:
资源简介:
The Layered Bidirectional Q-Learning (LBQ) algorithm is designed for path planning, tackling the complexities inherent in multilayer path planning during the recycling process. This approach incorporates a bidirectional update mechanism that minimizes the unpredictability associated with initial exploration phases. Additionally, the algorithm employs a hierarchical reinforcement learning strategy, which breaks down intricate tasks into more manageable subtasks. Through the strategic design of reward functions that address various constraints, the LBQ algorithm successfully optimizes paths under multiple conditions.
二维码
社区交流群
二维码
科研交流群
商业服务