The data of the article "Combining Deep Reinforcement Learning with Heuristics for Solving the Traveling Salesman Problem"
收藏DataCite Commons2025-04-27 更新2025-05-18 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=43a1f08c812b4da2a3cc258fab65bd61
下载链接
链接失效反馈官方服务:
资源简介:
We have developed a new framework for learning improvement heuristics, which automatically discovers better improvement policies for heuristics to iteratively solve the TSP. Our framework proposes a novel approach (named RL-SA), which combines deep RL with SA algorithm, to improve 2-opt heuristic algorithm’s performance in learning a pairwise selected policy of next promising solutions. The novelty of the RL-SA approach also lies in that it leverages the Whale Optimization Algorithm (WOA) to generate an initial solution for better sampling efficiency, and that it adopts the Gaussian perturbation strategy for perturbing the state to tackle the sparse reward problem of the RL algorithm. Our program achieved excellent performance in solving the Traveling Salesman Problem.
提供机构:
Science Data Bank
创建时间:
2024-11-19



