Picture for Udari Madhushani

Udari Madhushani

AI Risk Management Should Incorporate Both Safety and Security

Add code
May 29, 2024
Viaarxiv icon

O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models

Add code
Oct 22, 2023
Figure 1 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 2 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 3 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 4 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Viaarxiv icon

Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas

Add code
May 01, 2023
Figure 1 for Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Figure 2 for Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Figure 3 for Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Figure 4 for Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas
Viaarxiv icon

Melting Pot 2.0

Add code
Dec 13, 2022
Viaarxiv icon

A Regret Minimization Approach to Multi-Agent Control

Add code
Feb 01, 2022
Figure 1 for A Regret Minimization Approach to Multi-Agent Control
Figure 2 for A Regret Minimization Approach to Multi-Agent Control
Viaarxiv icon

One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

Add code
Nov 24, 2021
Figure 1 for One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Figure 2 for One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Figure 3 for One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Figure 4 for One More Step Towards Reality: Cooperative Bandits with Imperfect Communication
Viaarxiv icon

Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication

Add code
Oct 14, 2021
Figure 1 for Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication
Figure 2 for Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication
Viaarxiv icon

When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits

Add code
Oct 08, 2021
Figure 1 for When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits
Figure 2 for When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits
Figure 3 for When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits
Viaarxiv icon

Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL

Add code
Dec 06, 2020
Figure 1 for Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL
Figure 2 for Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL
Figure 3 for Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL
Figure 4 for Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL
Viaarxiv icon

Distributed Bandits: Probabilistic Communication on $d$-regular Graphs

Add code
Nov 16, 2020
Figure 1 for Distributed Bandits: Probabilistic Communication on $d$-regular Graphs
Figure 2 for Distributed Bandits: Probabilistic Communication on $d$-regular Graphs
Figure 3 for Distributed Bandits: Probabilistic Communication on $d$-regular Graphs
Figure 4 for Distributed Bandits: Probabilistic Communication on $d$-regular Graphs
Viaarxiv icon