Picture for Khaled Nakhleh

Khaled Nakhleh

Simulation-Based Optimistic Policy Iteration For Multi-Agent MDPs with Kullback-Leibler Control Cost

Add code
Oct 19, 2024
Figure 1 for Simulation-Based Optimistic Policy Iteration For Multi-Agent MDPs with Kullback-Leibler Control Cost
Figure 2 for Simulation-Based Optimistic Policy Iteration For Multi-Agent MDPs with Kullback-Leibler Control Cost
Viaarxiv icon

SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations

Add code
Mar 21, 2023
Figure 1 for SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Figure 2 for SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Figure 3 for SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Figure 4 for SACPlanner: Real-World Collision Avoidance with a Soft Actor Critic Local Planner and Polar State Representations
Viaarxiv icon

DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs

Add code
Sep 28, 2022
Figure 1 for DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Figure 2 for DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Figure 3 for DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Figure 4 for DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Viaarxiv icon

NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL

Add code
Oct 05, 2021
Figure 1 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Figure 2 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Figure 3 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Figure 4 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Viaarxiv icon