Picture for Quanquan Gu

Quanquan Gu

Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

Add code
Nov 07, 2024
Viaarxiv icon

CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing

Add code
Oct 22, 2024
Viaarxiv icon

Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers

Add code
Oct 18, 2024
Viaarxiv icon

DPLM-2: A Multimodal Diffusion Protein Language Model

Add code
Oct 17, 2024
Viaarxiv icon

Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization

Add code
Oct 11, 2024
Viaarxiv icon

CryoFM: A Flow-based Foundation Model for Cryo-EM Densities

Add code
Oct 11, 2024
Viaarxiv icon

Accelerated Preference Optimization for Large Language Model Alignment

Add code
Oct 08, 2024
Viaarxiv icon

Convergence of Score-Based Discrete Diffusion Models: A Discrete-Time Analysis

Add code
Oct 03, 2024
Viaarxiv icon

LLaVA-Critic: Learning to Evaluate Multimodal Models

Add code
Oct 03, 2024
Viaarxiv icon

General Preference Modeling with Preference Representations for Aligning Language Models

Add code
Oct 03, 2024
Viaarxiv icon