Picture for I-Hong Hou

I-Hong Hou

Deep Index Policy for Multi-Resource Restless Matching Bandit and Its Application in Multi-Channel Scheduling

Add code
Aug 13, 2024
Viaarxiv icon

Timely Communications for Remote Inference

Add code
Apr 25, 2024
Viaarxiv icon

Distributed No-Regret Learning for Multi-Stage Systems with End-to-End Bandit Feedback

Add code
Apr 06, 2024
Viaarxiv icon

Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making

Add code
Mar 22, 2024
Viaarxiv icon

DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs

Add code
Sep 28, 2022
Figure 1 for DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Figure 2 for DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Figure 3 for DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Figure 4 for DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Viaarxiv icon

NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL

Add code
Oct 05, 2021
Figure 1 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Figure 2 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Figure 3 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Figure 4 for NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Viaarxiv icon