Picture for Hanjun Dai

Hanjun Dai

SDDBench: A Benchmark for Synthesizable Drug Design

Add code
Nov 13, 2024
Viaarxiv icon

Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment

Add code
Oct 28, 2024
Viaarxiv icon

Matryoshka: Learning to Drive Black-Box LLMs with LLMs

Add code
Oct 28, 2024
Viaarxiv icon

Autoregressive Large Language Models are Computationally Universal

Add code
Oct 04, 2024
Viaarxiv icon

Exploring and Benchmarking the Planning Capabilities of Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models

Add code
Jun 04, 2024
Figure 1 for Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models
Figure 2 for Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models
Figure 3 for Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models
Figure 4 for Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models
Viaarxiv icon

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

Add code
May 29, 2024
Figure 1 for Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
Figure 2 for Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
Figure 3 for Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
Viaarxiv icon

Beyond Expectations: Learning with Stochastic Dominance Made Practical

Add code
Feb 05, 2024
Viaarxiv icon

SQLPrompt: In-Context Text-to-SQL with Minimal Labeled Data

Add code
Nov 06, 2023
Viaarxiv icon

On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval

Add code
Nov 01, 2023
Viaarxiv icon