Picture for Longbo Huang

Longbo Huang

Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration

Add code
Oct 25, 2024
Figure 1 for Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Figure 2 for Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Figure 3 for Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Figure 4 for Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Viaarxiv icon

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs

Add code
Oct 04, 2024
Viaarxiv icon

Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

Add code
Oct 03, 2024
Figure 1 for Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Figure 2 for Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Figure 3 for Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Figure 4 for Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Viaarxiv icon

Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training

Add code
Sep 28, 2024
Viaarxiv icon

Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks

Add code
Aug 29, 2024
Viaarxiv icon

Mixed Sparsity Training: Achieving 4$\times$ FLOP Reduction for Transformer Pretraining

Add code
Aug 21, 2024
Viaarxiv icon

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning

Add code
Mar 07, 2024
Viaarxiv icon

Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation

Add code
Feb 28, 2024
Viaarxiv icon

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Add code
Feb 28, 2024
Viaarxiv icon

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Add code
Nov 09, 2023
Viaarxiv icon