Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wouter van Heeswijk

Machine Learning Predictions for Traffic Equilibria in Road Renovation Scheduling

Jun 06, 2025

Robbert Bosch, Wouter van Heeswijk, Patricia Rogetzer, Martijn Mes

Abstract:Accurately estimating the impact of road maintenance schedules on traffic conditions is important because maintenance operations can substantially worsen congestion if not carefully planned. Reliable estimates allow planners to avoid excessive delays during periods of roadwork. Since the exact increase in congestion is difficult to predict analytically, traffic simulations are commonly used to assess the redistribution of the flow of traffic. However, when applied to long-term maintenance planning involving many overlapping projects and scheduling alternatives, these simulations must be run thousands of times, resulting in a significant computational burden. This paper investigates the use of machine learning-based surrogate models to predict network-wide congestion caused by simultaneous road renovations. We frame the problem as a supervised learning task, using one-hot encodings, engineered traffic features, and heuristic approximations. A range of linear, ensemble-based, probabilistic, and neural regression models is evaluated under an online learning framework in which data progressively becomes available. The experimental results show that the Costliest Subset Heuristic provides a reasonable approximation when limited training data is available, and that most regression models fail to outperform it, with the exception of XGBoost, which achieves substantially better accuracy. In overall performance, XGBoost significantly outperforms alternatives in a range of metrics, most strikingly Mean Absolute Percentage Error (MAPE) and Pinball loss, where it achieves a MAPE of 11% and outperforms the next-best model by 20% and 38% respectively. This modeling approach has the potential to reduce the computational burden of large-scale traffic assignment problems in maintenance planning.

* 15 pages, 2 figures, submitted as conference paper to ICCL 2025

Via

Access Paper or Ask Questions

The Stochastic Dynamic Post-Disaster Inventory Allocation Problem with Trucks and UAVs

Nov 30, 2023

Robert van Steenbergen, Wouter van Heeswijk, Martijn Mes

Figure 1 for The Stochastic Dynamic Post-Disaster Inventory Allocation Problem with Trucks and UAVs

Figure 2 for The Stochastic Dynamic Post-Disaster Inventory Allocation Problem with Trucks and UAVs

Figure 3 for The Stochastic Dynamic Post-Disaster Inventory Allocation Problem with Trucks and UAVs

Figure 4 for The Stochastic Dynamic Post-Disaster Inventory Allocation Problem with Trucks and UAVs

Abstract:Humanitarian logistics operations face increasing difficulties due to rising demands for aid in disaster areas. This paper investigates the dynamic allocation of scarce relief supplies across multiple affected districts over time. It introduces a novel stochastic dynamic post-disaster inventory allocation problem with trucks and unmanned aerial vehicles delivering relief goods under uncertain supply and demand. The relevance of this humanitarian logistics problem lies in the importance of considering the inter-temporal social impact of deliveries. We achieve this by incorporating deprivation costs when allocating scarce supplies. Furthermore, we consider the inherent uncertainties of disaster areas and the potential use of cargo UAVs to enhance operational efficiency. This study proposes two anticipatory solution methods based on approximate dynamic programming, specifically decomposed linear value function approximation and neural network value function approximation to effectively manage uncertainties in the dynamic allocation process. We compare DL-VFA and NN-VFA with various state-of-the-art methods (exact re-optimization, PPO) and results show a 6-8% improvement compared to the best benchmarks. NN-VFA provides the best performance and captures nonlinearities in the problem, whereas DL-VFA shows excellent scalability against a minor performance loss. The experiments reveal that consideration of deprivation costs results in improved allocation of scarce supplies both across affected districts and over time. Finally, results show that deploying UAVs can play a crucial role in the allocation of relief goods, especially in the first stages after a disaster. The use of UAVs reduces transportation- and deprivation costs together by 16-20% and reduces maximum deprivation times by 19-40%, while maintaining similar levels of demand coverage, showcasing efficient and effective operations.

Via

Access Paper or Ask Questions

Handling Large Discrete Action Spaces via Dynamic Neighborhood Construction

May 31, 2023

Fabian Akkerman, Julius Luy, Wouter van Heeswijk, Maximilian Schiffer

Abstract:Large discrete action spaces remain a central challenge for reinforcement learning methods. Such spaces are encountered in many real-world applications, e.g., recommender systems, multi-step planning, and inventory replenishment. The mapping of continuous proxies to discrete actions is a promising paradigm for handling large discrete action spaces. Existing continuous-to-discrete mapping approaches involve searching for discrete neighboring actions in a static pre-defined neighborhood, which requires discrete neighbor lookups across the entire action space. Hence, scalability issues persist. To mitigate this drawback, we propose a novel Dynamic Neighborhood Construction (DNC) method, which dynamically constructs a discrete neighborhood to map the continuous proxy, thus efficiently exploiting the underlying action space. We demonstrate the robustness of our method by benchmarking it against three state-of-the-art approaches designed for large discrete action spaces across three different environments. Our results show that DNC matches or outperforms state-of-the-art approaches while being more computationally efficient. Furthermore, our method scales to action spaces that so far remained computationally intractable for existing methodologies.

Via

Access Paper or Ask Questions

Strategic bidding in freight transport using deep reinforcement learning

Feb 18, 2021

Wouter van Heeswijk

Figure 1 for Strategic bidding in freight transport using deep reinforcement learning

Figure 2 for Strategic bidding in freight transport using deep reinforcement learning

Figure 3 for Strategic bidding in freight transport using deep reinforcement learning

Figure 4 for Strategic bidding in freight transport using deep reinforcement learning

Abstract:This paper presents a multi-agent reinforcement learning algorithm to represent strategic bidding behavior in freight transport markets. Using this algorithm, we investigate whether feasible market equilibriums arise without any central control or communication between agents. Studying behavior in such environments may serve as a stepping stone towards self-organizing logistics systems like the Physical Internet. We model an agent-based environment in which a shipper and a carrier actively learn bidding strategies using policy gradient methods, posing bid- and ask prices at the individual container level. Both agents aim to learn the best response given the expected behavior of the opposing agent. A neutral broker allocates jobs based on bid-ask spreads. Our game-theoretical analysis and numerical experiments focus on behavioral insights. To evaluate system performance, we measure adherence to Nash equilibria, fairness of reward division and utilization of transport capacity. We observe good performance both in predictable, deterministic settings (~95% adherence to Nash equilibria) and highly stochastic environments (~85% adherence). Risk-seeking behavior may increase an agent's reward share, as long as the strategies are not overly aggressive. The results suggest a potential for full automation and decentralization of freight transport markets.

Via

Access Paper or Ask Questions

Smart Containers With Bidding Capacity: A Policy Gradient Algorithm for Semi-Cooperative Learning

May 01, 2020

Wouter van Heeswijk

Figure 1 for Smart Containers With Bidding Capacity: A Policy Gradient Algorithm for Semi-Cooperative Learning

Figure 2 for Smart Containers With Bidding Capacity: A Policy Gradient Algorithm for Semi-Cooperative Learning

Figure 3 for Smart Containers With Bidding Capacity: A Policy Gradient Algorithm for Semi-Cooperative Learning

Figure 4 for Smart Containers With Bidding Capacity: A Policy Gradient Algorithm for Semi-Cooperative Learning

Abstract:Smart modular freight containers -- as propagated in the Physical Internet paradigm -- are equipped with sensors, data storage capability and intelligence that enable them to route themselves from origin to destination without manual intervention or central governance. In this self-organizing setting, containers can autonomously place bids on transport services in a spot market setting. However, for individual containers it may be difficult to learn good bidding policies due to limited observations. By sharing information and costs between one another, smart containers can jointly learn bidding policies, even though simultaneously competing for the same transport capacity. We replicate this behavior by learning stochastic bidding policies in a semi-cooperative multi agent setting. To this end, we develop a reinforcement learning algorithm based on the policy gradient framework. Numerical experiments show that sharing solely bids and acceptance decisions leads to stable bidding policies. Additional system information only marginally improves performance; individual job properties suffice to place appropriate bids. Furthermore, we find that carriers may have incentives not to share information with the smart containers. The experiments give rise to several directions for follow-up research, in particular the interaction between smart containers and transport services in self-organizing logistics.

* 15 pages

Via

Access Paper or Ask Questions

Approximate Dynamic Programming with Neural Networks in Linear Discrete Action Spaces

Feb 26, 2019

Wouter van Heeswijk, Han La Poutré

Figure 1 for Approximate Dynamic Programming with Neural Networks in Linear Discrete Action Spaces

Figure 2 for Approximate Dynamic Programming with Neural Networks in Linear Discrete Action Spaces

Figure 3 for Approximate Dynamic Programming with Neural Networks in Linear Discrete Action Spaces

Figure 4 for Approximate Dynamic Programming with Neural Networks in Linear Discrete Action Spaces

Abstract:Real-world problems of operations research are typically high-dimensional and combinatorial. Linear programs are generally used to formulate and efficiently solve these large decision problems. However, in multi-period decision problems, we must often compute expected downstream values corresponding to current decisions. When applying stochastic methods to approximate these values, linear programs become restrictive for designing value function approximations (VFAs). In particular, the manual design of a polynomial VFA is challenging. This paper presents an integrated approach for complex optimization problems, focusing on applications in the domain of operations research. It develops a hybrid solution method that combines linear programming and neural networks as part of approximate dynamic programming. Our proposed solution method embeds neural network VFAs into linear decision problems, combining the nonlinear expressive power of neural networks with the efficiency of solving linear programs. As a proof of concept, we perform numerical experiments on a transportation problem. The neural network VFAs consistently outperform polynomial VFAs, with limited design and tuning effort.

Via

Access Paper or Ask Questions