Picture for Jeff Yang

Jeff Yang

IDE-Bench: Evaluating Large Language Models as IDE Agents on Real-World Software Engineering Tasks

Add code
Jan 28, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Automatic Prompt Selection for Large Language Models

Add code
Apr 03, 2024
Figure 1 for Automatic Prompt Selection for Large Language Models
Figure 2 for Automatic Prompt Selection for Large Language Models
Figure 3 for Automatic Prompt Selection for Large Language Models
Figure 4 for Automatic Prompt Selection for Large Language Models
Viaarxiv icon

When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust

Add code
May 12, 2023
Viaarxiv icon