Picture for Tim Rocktäschel

Tim Rocktäschel

DéjàQ: Open-Ended Evolution of Diverse, Learnable and Verifiable Problems

Add code
Jan 05, 2026
Viaarxiv icon

Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents

Add code
Sep 03, 2025
Viaarxiv icon

LLM-First Search: Self-Guided Exploration of the Solution Space

Add code
Jun 05, 2025
Viaarxiv icon

D3PO: Preference-Based Alignment of Discrete Diffusion Models

Add code
Mar 11, 2025
Viaarxiv icon

Investigating Non-Transitivity in LLM-as-a-Judge

Add code
Feb 19, 2025
Viaarxiv icon

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Add code
Nov 20, 2024
Figure 1 for BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Figure 2 for BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Figure 3 for BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Figure 4 for BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Viaarxiv icon

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Add code
Nov 19, 2024
Figure 1 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Figure 2 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Figure 3 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Figure 4 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Viaarxiv icon

TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation

Add code
Oct 04, 2024
Figure 1 for TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
Figure 2 for TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
Figure 3 for TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
Figure 4 for TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
Viaarxiv icon

Outliers and Calibration Sets have Diminishing Effect on Quantization of Modern LLMs

Add code
Jun 03, 2024
Viaarxiv icon

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Add code
Feb 26, 2024
Figure 1 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 2 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 3 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Figure 4 for Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
Viaarxiv icon