Picture for Ulyana Piterbarg

Ulyana Piterbarg

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Add code
Nov 20, 2024
Viaarxiv icon

Training Language Models on Synthetic Edit Sequences Improves Code Synthesis

Add code
Oct 03, 2024
Figure 1 for Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
Figure 2 for Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
Figure 3 for Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
Figure 4 for Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
Viaarxiv icon

diff History for Long-Context Language Agents

Add code
Dec 12, 2023
Figure 1 for diff History for Long-Context Language Agents
Figure 2 for diff History for Long-Context Language Agents
Figure 3 for diff History for Long-Context Language Agents
Figure 4 for diff History for Long-Context Language Agents
Viaarxiv icon

NetHack is Hard to Hack

Add code
May 30, 2023
Viaarxiv icon