Picture for Pramod Kaushik

Pramod Kaushik

Multi-State-Action Tokenisation in Decision Transformers for Multi-Discrete Action Spaces

Add code
Jul 01, 2024
Viaarxiv icon

Exploring Value Biases: How LLMs Deviate Towards the Ideal

Add code
Feb 21, 2024
Viaarxiv icon

A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies

Add code
Mar 25, 2022
Figure 1 for A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies
Figure 2 for A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies
Viaarxiv icon