Picture for Zhuoran Yang

Zhuoran Yang

In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention

Add code
Mar 17, 2025
Viaarxiv icon

Nash Equilibrium Constrained Auto-bidding With Bi-level Reinforcement Learning

Add code
Mar 13, 2025
Viaarxiv icon

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Add code
Feb 23, 2025
Viaarxiv icon

DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization

Add code
Feb 11, 2025
Viaarxiv icon

Active Advantage-Aligned Online Reinforcement Learning with Offline Data

Add code
Feb 11, 2025
Viaarxiv icon

Learning Task Representations from In-Context Learning

Add code
Feb 08, 2025
Viaarxiv icon

An Instrumental Value for Data Production and its Application to Data Pricing

Add code
Dec 24, 2024
Viaarxiv icon

Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory

Add code
Dec 23, 2024
Viaarxiv icon

Physical Informed Driving World Model

Add code
Dec 13, 2024
Figure 1 for Physical Informed Driving World Model
Figure 2 for Physical Informed Driving World Model
Figure 3 for Physical Informed Driving World Model
Figure 4 for Physical Informed Driving World Model
Viaarxiv icon

Pysical Informed Driving World Model

Add code
Dec 11, 2024
Figure 1 for Pysical Informed Driving World Model
Figure 2 for Pysical Informed Driving World Model
Figure 3 for Pysical Informed Driving World Model
Figure 4 for Pysical Informed Driving World Model
Viaarxiv icon