Picture for Zhaolin Gao

Zhaolin Gao

End-to-end Training for Recommendation with Language-based User Profiles

Add code
Oct 24, 2024
Viaarxiv icon

Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF

Add code
Oct 06, 2024
Figure 1 for Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Figure 2 for Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Figure 3 for Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Figure 4 for Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Viaarxiv icon

REBEL: Reinforcement Learning via Regressing Relative Rewards

Add code
Apr 25, 2024
Viaarxiv icon

Reviewer2: Optimizing Review Generation Through Prompt Generation

Add code
Feb 16, 2024
Viaarxiv icon

Shoestring: Graph-Based Semi-Supervised Learning with Severely Limited Labeled Data

Add code
Oct 28, 2019
Figure 1 for Shoestring: Graph-Based Semi-Supervised Learning with Severely Limited Labeled Data
Figure 2 for Shoestring: Graph-Based Semi-Supervised Learning with Severely Limited Labeled Data
Figure 3 for Shoestring: Graph-Based Semi-Supervised Learning with Severely Limited Labeled Data
Figure 4 for Shoestring: Graph-Based Semi-Supervised Learning with Severely Limited Labeled Data
Viaarxiv icon