Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shaochen Han

Generalization in Deep RL for TSP Problems via Equivariance and Local Search

Oct 07, 2021

Wenbin Ouyang, Yisen Wang, Paul Weng, Shaochen Han

Figure 1 for Generalization in Deep RL for TSP Problems via Equivariance and Local Search

Figure 2 for Generalization in Deep RL for TSP Problems via Equivariance and Local Search

Figure 3 for Generalization in Deep RL for TSP Problems via Equivariance and Local Search

Figure 4 for Generalization in Deep RL for TSP Problems via Equivariance and Local Search

Abstract:Deep reinforcement learning (RL) has proved to be a competitive heuristic for solving small-sized instances of traveling salesman problems (TSP), but its performance on larger-sized instances is insufficient. Since training on large instances is impractical, we design a novel deep RL approach with a focus on generalizability. Our proposition consisting of a simple deep learning architecture that learns with novel RL training techniques, exploits two main ideas. First, we exploit equivariance to facilitate training. Second, we interleave efficient local search heuristics with the usual RL training to smooth the value landscape. In order to validate the whole approach, we empirically evaluate our proposition on random and realistic TSP problems against relevant state-of-the-art deep RL methods. Moreover, we present an ablation study to understand the contribution of each of its component

* 18 pages, 2 figures

Via

Access Paper or Ask Questions

Improving Generalization of Deep Reinforcement Learning-based TSP Solvers

Oct 06, 2021

Wenbin Ouyang, Yisen Wang, Shaochen Han, Zhejian Jin, Paul Weng

Figure 1 for Improving Generalization of Deep Reinforcement Learning-based TSP Solvers

Figure 2 for Improving Generalization of Deep Reinforcement Learning-based TSP Solvers

Figure 3 for Improving Generalization of Deep Reinforcement Learning-based TSP Solvers

Figure 4 for Improving Generalization of Deep Reinforcement Learning-based TSP Solvers

Abstract:Recent work applying deep reinforcement learning (DRL) to solve traveling salesman problems (TSP) has shown that DRL-based solvers can be fast and competitive with TSP heuristics for small instances, but do not generalize well to larger instances. In this work, we propose a novel approach named MAGIC that includes a deep learning architecture and a DRL training method. Our architecture, which integrates a multilayer perceptron, a graph neural network, and an attention model, defines a stochastic policy that sequentially generates a TSP solution. Our training method includes several innovations: (1) we interleave DRL policy gradient updates with local search (using a new local search technique), (2) we use a novel simple baseline, and (3) we apply curriculum learning. Finally, we empirically demonstrate that MAGIC is superior to other DRL-based methods on random TSP instances, both in terms of performance and generalizability. Moreover, our method compares favorably against TSP heuristics and other state-of-the-art approach in terms of performance and computational time.

* 8 pages, 2 figures

Via

Access Paper or Ask Questions