Picture for Jiashuo Jiang

Jiashuo Jiang

Cross-Epoch Adaptive Rollout Optimization for RL Post-Training

Add code
Jun 04, 2026
Viaarxiv icon

Resource-Constrained Adaptive Inference for Sequential Pricing

Add code
Jun 02, 2026
Viaarxiv icon

The Value of Information in Resource-Constrained Pricing

Add code
Mar 26, 2026
Viaarxiv icon

Online Semi-infinite Linear Programming: Efficient Algorithms via Function Approximation

Add code
Mar 17, 2026
Viaarxiv icon

Ask, Clarify, Optimize: Human-LLM Agent Collaboration for Smarter Inventory Control

Add code
Dec 31, 2025
Viaarxiv icon

A Lyapunov Drift-Plus-Penalty Method Tailored for Reinforcement Learning with Queue Stability

Add code
Jun 04, 2025
Viaarxiv icon

Adaptive Resolving Methods for Reinforcement Learning with Function Approximations

Add code
May 17, 2025
Figure 1 for Adaptive Resolving Methods for Reinforcement Learning with Function Approximations
Figure 2 for Adaptive Resolving Methods for Reinforcement Learning with Function Approximations
Figure 3 for Adaptive Resolving Methods for Reinforcement Learning with Function Approximations
Viaarxiv icon

Adaptive Bidding Policies for First-Price Auctions with Budget Constraints under Non-stationarity

Add code
May 05, 2025
Viaarxiv icon

Efficiently Solving Discounted MDPs with Predictions on Transition Matrices

Add code
Feb 21, 2025
Figure 1 for Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
Figure 2 for Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
Figure 3 for Efficiently Solving Discounted MDPs with Predictions on Transition Matrices
Viaarxiv icon

Online Scheduling for LLM Inference with KV Cache Constraints

Add code
Feb 10, 2025
Figure 1 for Online Scheduling for LLM Inference with KV Cache Constraints
Figure 2 for Online Scheduling for LLM Inference with KV Cache Constraints
Figure 3 for Online Scheduling for LLM Inference with KV Cache Constraints
Figure 4 for Online Scheduling for LLM Inference with KV Cache Constraints
Viaarxiv icon