Picture for Kshitija Taywade

Kshitija Taywade

Personalizing Task-oriented Dialog Systems via Zero-shot Generalizable Reward Function

Add code
Mar 24, 2023
Viaarxiv icon

Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand

Add code
Jan 03, 2022
Figure 1 for Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand
Figure 2 for Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand
Figure 3 for Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand
Figure 4 for Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand
Viaarxiv icon

Modelling Cournot Games as Multi-agent Multi-armed Bandits

Add code
Jan 01, 2022
Figure 1 for Modelling Cournot Games as Multi-agent Multi-armed Bandits
Figure 2 for Modelling Cournot Games as Multi-agent Multi-armed Bandits
Figure 3 for Modelling Cournot Games as Multi-agent Multi-armed Bandits
Figure 4 for Modelling Cournot Games as Multi-agent Multi-armed Bandits
Viaarxiv icon

Reinforcement Learning for Decentralized Stable Matching

Add code
May 03, 2020
Figure 1 for Reinforcement Learning for Decentralized Stable Matching
Figure 2 for Reinforcement Learning for Decentralized Stable Matching
Viaarxiv icon