Picture for Cho-Jui Hsieh

Cho-Jui Hsieh

On the loss of context-awareness in general instruction fine-tuning

Add code
Nov 05, 2024
Viaarxiv icon

LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization

Add code
Oct 27, 2024
Viaarxiv icon

Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review

Add code
Sep 10, 2024
Figure 1 for Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review
Figure 2 for Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review
Figure 3 for Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review
Figure 4 for Accelerating Large Language Model Pretraining via LFR Pedagogy: Learn, Focus, and Review
Viaarxiv icon

CLUE: Concept-Level Uncertainty Estimation for Large Language Models

Add code
Sep 04, 2024
Viaarxiv icon

Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models

Add code
Jul 30, 2024
Viaarxiv icon

Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns?

Add code
Jul 06, 2024
Viaarxiv icon

One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts

Add code
Jun 28, 2024
Figure 1 for One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts
Figure 2 for One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts
Figure 3 for One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts
Figure 4 for One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts
Viaarxiv icon

On Discrete Prompt Optimization for Diffusion Models

Add code
Jun 27, 2024
Viaarxiv icon

Large Language Models are Interpretable Learners

Add code
Jun 25, 2024
Viaarxiv icon

MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?

Add code
Jun 22, 2024
Viaarxiv icon