Picture for Aditya Mahajan

Aditya Mahajan

Sub-optimality bounds for certainty equivalent policies in partially observed systems

Add code
Feb 02, 2026
Viaarxiv icon

Task load dependent decision referrals for joint binary classification in human-automation teams

Add code
Apr 05, 2025
Viaarxiv icon

A Theoretical Justification for Asymmetric Actor-Critic Algorithms

Add code
Jan 31, 2025
Viaarxiv icon

Concentration of Cumulative Reward in Markov Decision Processes

Add code
Nov 27, 2024
Viaarxiv icon

Periodic agent-state based Q-learning for POMDPs

Add code
Jul 08, 2024
Figure 1 for Periodic agent-state based Q-learning for POMDPs
Figure 2 for Periodic agent-state based Q-learning for POMDPs
Figure 3 for Periodic agent-state based Q-learning for POMDPs
Figure 4 for Periodic agent-state based Q-learning for POMDPs
Viaarxiv icon

Model approximation in MDPs with unbounded per-step cost

Add code
Feb 13, 2024
Figure 1 for Model approximation in MDPs with unbounded per-step cost
Figure 2 for Model approximation in MDPs with unbounded per-step cost
Figure 3 for Model approximation in MDPs with unbounded per-step cost
Figure 4 for Model approximation in MDPs with unbounded per-step cost
Viaarxiv icon

Bridging State and History Representations: Understanding Self-Predictive RL

Add code
Jan 17, 2024
Figure 1 for Bridging State and History Representations: Understanding Self-Predictive RL
Figure 2 for Bridging State and History Representations: Understanding Self-Predictive RL
Figure 3 for Bridging State and History Representations: Understanding Self-Predictive RL
Figure 4 for Bridging State and History Representations: Understanding Self-Predictive RL
Viaarxiv icon

Approximate information state based convergence analysis of recurrent Q-learning

Add code
Jun 09, 2023
Viaarxiv icon

Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning

Add code
Feb 06, 2023
Figure 1 for Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Figure 2 for Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Figure 3 for Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Figure 4 for Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning
Viaarxiv icon

On learning history based policies for controlling Markov decision processes

Add code
Nov 06, 2022
Viaarxiv icon