Picture for Pradeep Varakantham

Pradeep Varakantham

IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health

Add code
Dec 11, 2024
Viaarxiv icon

Semantic loss guided data efficient supervised fine tuning for Safe Responses in LLMs

Add code
Dec 07, 2024
Viaarxiv icon

UNIQ: Offline Inverse Q-learning for Avoiding Undesirable Demonstrations

Add code
Oct 10, 2024
Viaarxiv icon

Towards Neural Network based Cognitive Models of Dynamic Decision-Making by Humans

Add code
Jul 24, 2024
Viaarxiv icon

Preserving the Privacy of Reward Functions in MDPs through Deception

Add code
Jul 13, 2024
Viaarxiv icon

Safety through feedback in Constrained RL

Add code
Jun 28, 2024
Viaarxiv icon

EduQate: Generating Adaptive Curricula through RMABs in Education Settings

Add code
Jun 20, 2024
Viaarxiv icon

Unlocking Large Language Model's Planning Capabilities with Maximum Diversity Fine-tuning

Add code
Jun 15, 2024
Viaarxiv icon

Bootstrapping Language Models with DPO Implicit Rewards

Add code
Jun 14, 2024
Viaarxiv icon

Probabilistic Perspectives on Error Minimization in Adversarial Reinforcement Learning

Add code
Jun 07, 2024
Viaarxiv icon