Picture for Dylan Zhang

Dylan Zhang

Entropy-Regularized Process Reward Model

Add code
Dec 15, 2024
Viaarxiv icon

$\textbf{Only-IF}$:Revealing the Decisive Effect of Instruction Diversity on Generalization

Add code
Oct 07, 2024
Viaarxiv icon

Visual Prompting in LLMs for Enhancing Emotion Recognition

Add code
Oct 03, 2024
Viaarxiv icon

QEDCartographer: Automating Formal Verification Using Reward-Free Reinforcement Learning

Add code
Aug 17, 2024
Viaarxiv icon

PLUM: Preference Learning Plus Test Cases Yields Better Code Language Models

Add code
Jun 11, 2024
Viaarxiv icon

From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers

Add code
May 31, 2024
Viaarxiv icon

Instruction Diversity Drives Generalization To Unseen Tasks

Add code
Feb 16, 2024
Viaarxiv icon

Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion

Add code
Jan 23, 2024
Viaarxiv icon

PACE-LM: Prompting and Augmentation for Calibrated Confidence Estimation with GPT-4 in Cloud Incident Root Cause Analysis

Add code
Sep 29, 2023
Viaarxiv icon