Picture for Jacob Andreas

Jacob Andreas

Can Gradient Descent Simulate Prompting?

Add code
Jun 26, 2025
Viaarxiv icon

Sequential-Parallel Duality in Prefix Scannable Models

Add code
Jun 12, 2025
Viaarxiv icon

Line of Sight: On Linear Representations in VLLMs

Add code
Jun 05, 2025
Viaarxiv icon

Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation

Add code
May 20, 2025
Viaarxiv icon

Self-Steering Language Models

Add code
Apr 09, 2025
Viaarxiv icon

ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning

Add code
Apr 02, 2025
Viaarxiv icon

(How) Do Language Models Track State?

Add code
Mar 04, 2025
Viaarxiv icon

LM Agents for Coordinating Multi-User Information Gathering

Add code
Feb 17, 2025
Viaarxiv icon

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

Add code
Nov 11, 2024
Figure 1 for The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Figure 2 for The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Figure 3 for The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Figure 4 for The Surprising Effectiveness of Test-Time Training for Abstract Reasoning
Viaarxiv icon

LoRA vs Full Fine-tuning: An Illusion of Equivalence

Add code
Oct 28, 2024
Figure 1 for LoRA vs Full Fine-tuning: An Illusion of Equivalence
Figure 2 for LoRA vs Full Fine-tuning: An Illusion of Equivalence
Figure 3 for LoRA vs Full Fine-tuning: An Illusion of Equivalence
Figure 4 for LoRA vs Full Fine-tuning: An Illusion of Equivalence
Viaarxiv icon