Picture for Rahul Madhavan

Rahul Madhavan

SWEPO: Simultaneous Weighted Preference Optimization for Group Contrastive Alignment

Add code
Dec 05, 2024
Viaarxiv icon

Time-Reversal Provides Unsupervised Feedback to LLMs

Add code
Dec 03, 2024
Viaarxiv icon

Causal Contextual Bandits with Adaptive Context

Add code
May 28, 2024
Viaarxiv icon

Causal ATE Mitigates Unintended Bias in Controlled Text Generation

Add code
Nov 19, 2023
Viaarxiv icon

CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation

Add code
Jun 01, 2023
Figure 1 for CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation
Figure 2 for CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation
Figure 3 for CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation
Figure 4 for CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation
Viaarxiv icon

Learning Good Interventions in Causal Graphs via Covering

Add code
May 08, 2023
Viaarxiv icon

Intervention Efficient Algorithm for Two-Stage Causal MDPs

Add code
Nov 01, 2021
Figure 1 for Intervention Efficient Algorithm for Two-Stage Causal MDPs
Figure 2 for Intervention Efficient Algorithm for Two-Stage Causal MDPs
Figure 3 for Intervention Efficient Algorithm for Two-Stage Causal MDPs
Viaarxiv icon

Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning

Add code
Apr 15, 2021
Figure 1 for Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning
Figure 2 for Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning
Figure 3 for Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning
Figure 4 for Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning
Viaarxiv icon