Picture for Dylan Zhang

Dylan Zhang

Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarsity

Add code
Feb 17, 2025
Viaarxiv icon

The Best Instruction-Tuning Data are Those That Fit

Add code
Feb 07, 2025
Viaarxiv icon

Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities

Add code
Jan 21, 2025
Viaarxiv icon

Entropy-Regularized Process Reward Model

Add code
Dec 15, 2024
Viaarxiv icon

$\textbf{Only-IF}$:Revealing the Decisive Effect of Instruction Diversity on Generalization

Add code
Oct 07, 2024
Viaarxiv icon

Visual Prompting in LLMs for Enhancing Emotion Recognition

Add code
Oct 03, 2024
Viaarxiv icon

QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning

Add code
Aug 17, 2024
Figure 1 for QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning
Figure 2 for QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning
Figure 3 for QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning
Figure 4 for QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning
Viaarxiv icon

PLUM: Preference Learning Plus Test Cases Yields Better Code Language Models

Add code
Jun 11, 2024
Viaarxiv icon

From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers

Add code
May 31, 2024
Viaarxiv icon

Instruction Diversity Drives Generalization To Unseen Tasks

Add code
Feb 16, 2024
Viaarxiv icon