Picture for Siyan Zhao

Siyan Zhao

MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants

Add code
Dec 17, 2024
Viaarxiv icon

DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting

Add code
Oct 15, 2024
Figure 1 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Figure 2 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Figure 3 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Figure 4 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Viaarxiv icon

Probing the Decision Boundaries of In-context Learning in Large Language Models

Add code
Jun 17, 2024
Viaarxiv icon

Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models

Add code
Apr 15, 2024
Viaarxiv icon

Group Preference Optimization: Few-Shot Alignment of Large Language Models

Add code
Oct 17, 2023
Viaarxiv icon

Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models

Add code
Jun 09, 2023
Viaarxiv icon