Picture for Yingru Li

Yingru Li

Logit Dynamics in Softmax Policy Gradient Methods

Add code
Jun 15, 2025
Viaarxiv icon

OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Add code
May 29, 2025
Viaarxiv icon

Divergence-Augmented Policy Optimization

Add code
Jan 25, 2025
Viaarxiv icon

Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation

Add code
Jul 18, 2024
Viaarxiv icon

Prior-dependent analysis of posterior sampling reinforcement learning with function approximation

Add code
Mar 17, 2024
Viaarxiv icon

Radar Anti-jamming Strategy Learning via Domain-knowledge Enhanced Online Convex Optimization

Add code
Feb 29, 2024
Viaarxiv icon

Probability Tools for Sequential Random Projection

Add code
Feb 27, 2024
Viaarxiv icon

Optimistic Thompson Sampling for No-Regret Learning in Unknown Games

Add code
Feb 25, 2024
Viaarxiv icon

Simple, unified analysis of Johnson-Lindenstrauss with applications

Add code
Feb 21, 2024
Viaarxiv icon

HyperAgent: A Simple, Scalable, Efficient and Provable Reinforcement Learning Framework for Complex Environments

Add code
Feb 05, 2024
Viaarxiv icon