Picture for Raymond Zhang

Raymond Zhang

Thompson Sampling For Combinatorial Bandits: Polynomial Regret and Mismatched Sampling Paradox

Add code
Oct 07, 2024
Viaarxiv icon

ACPO: AI-Enabled Compiler-Driven Program Optimization

Add code
Dec 15, 2023
Figure 1 for ACPO: AI-Enabled Compiler-Driven Program Optimization
Figure 2 for ACPO: AI-Enabled Compiler-Driven Program Optimization
Figure 3 for ACPO: AI-Enabled Compiler-Driven Program Optimization
Figure 4 for ACPO: AI-Enabled Compiler-Driven Program Optimization
Viaarxiv icon

Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems

Add code
Apr 30, 2021
Figure 1 for Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems
Figure 2 for Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems
Figure 3 for Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems
Figure 4 for Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems
Viaarxiv icon

On the Suboptimality of Thompson Sampling in High Dimensions

Add code
Feb 10, 2021
Figure 1 for On the Suboptimality of Thompson Sampling in High Dimensions
Figure 2 for On the Suboptimality of Thompson Sampling in High Dimensions
Figure 3 for On the Suboptimality of Thompson Sampling in High Dimensions
Figure 4 for On the Suboptimality of Thompson Sampling in High Dimensions
Viaarxiv icon