Picture for Sumitra Ganesh

Sumitra Ganesh

Approximate Equivariance in Reinforcement Learning

Add code
Nov 06, 2024
Viaarxiv icon

Simulate and Optimise: A two-layer mortgage simulator for designing novel mortgage assistance products

Add code
Nov 01, 2024
Viaarxiv icon

Scalable Representation Learning for Multimodal Tabular Transactions

Add code
Oct 10, 2024
Viaarxiv icon

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment

Add code
Oct 10, 2024
Figure 1 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 2 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 3 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Figure 4 for GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment
Viaarxiv icon

Partially Observable Contextual Bandits with Linear Payoffs

Add code
Sep 17, 2024
Viaarxiv icon

Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning

Add code
Feb 01, 2024
Viaarxiv icon

Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach

Add code
Nov 18, 2023
Figure 1 for Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach
Figure 2 for Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach
Figure 3 for Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach
Figure 4 for Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach
Viaarxiv icon

O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models

Add code
Oct 22, 2023
Figure 1 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 2 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 3 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 4 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Viaarxiv icon

Sequential Fair Resource Allocation under a Markov Decision Process Framework

Add code
Jan 10, 2023
Figure 1 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Figure 2 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Figure 3 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Figure 4 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Viaarxiv icon

Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

Add code
Nov 28, 2022
Viaarxiv icon