Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Deep Policy Dynamic Programming for Vehicle Routing Problems

Feb 23, 2021

Wouter Kool, Herke van Hoof, Joaquim Gromicho, Max Welling

Figure 1 for Deep Policy Dynamic Programming for Vehicle Routing Problems

Figure 2 for Deep Policy Dynamic Programming for Vehicle Routing Problems

Figure 3 for Deep Policy Dynamic Programming for Vehicle Routing Problems

Figure 4 for Deep Policy Dynamic Programming for Vehicle Routing Problems

Share this with someone who'll enjoy it:

Abstract:Routing problems are a class of combinatorial problems with many practical applications. Recently, end-to-end deep learning methods have been proposed to learn approximate solution heuristics for such problems. In contrast, classical dynamic programming (DP) algorithms can find optimal solutions, but scale badly with the problem size. We propose Deep Policy Dynamic Programming (DPDP), which aims to combine the strengths of learned neural heuristics with those of DP algorithms. DPDP prioritizes and restricts the DP state space using a policy derived from a deep neural network, which is trained to predict edges from example solutions. We evaluate our framework on the travelling salesman problem (TSP) and the vehicle routing problem (VRP) and show that the neural policy improves the performance of (restricted) DP algorithms, making them competitive to strong alternatives such as LKH, while also outperforming other `neural approaches' for solving TSPs and VRPs with 100 nodes.

* 12 pages, 7 figures

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Deep Policy Dynamic Programming for Vehicle Routing Problems

Paper and Code