Picture for Julia Hockenmaier

Julia Hockenmaier

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Add code
Mar 25, 2025
Viaarxiv icon

RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Add code
Mar 17, 2025
Viaarxiv icon

Entailment-Preserving First-order Logic Representations in Natural Language Entailment

Add code
Feb 24, 2025
Viaarxiv icon

Evaluating Step-by-step Reasoning Traces: A Survey

Add code
Feb 17, 2025
Viaarxiv icon

BAP v2: An Enhanced Task Framework for Instruction Following in Minecraft Dialogues

Add code
Jan 18, 2025
Viaarxiv icon

Measuring the Reliability of Causal Probing Methods: Tradeoffs, Limitations, and the Plight of Nullifying Interventions

Add code
Aug 28, 2024
Viaarxiv icon

Analyzing the Performance of Large Language Models on Code Summarization

Add code
Apr 10, 2024
Viaarxiv icon

Attack and Reset for Unlearning: Exploiting Adversarial Noise toward Machine Unlearning through Parameter Re-initialization

Add code
Jan 17, 2024
Viaarxiv icon

A Framework for Bidirectional Decoding: Case Study in Morphological Inflection

Add code
May 21, 2023
Figure 1 for A Framework for Bidirectional Decoding: Case Study in Morphological Inflection
Figure 2 for A Framework for Bidirectional Decoding: Case Study in Morphological Inflection
Figure 3 for A Framework for Bidirectional Decoding: Case Study in Morphological Inflection
Figure 4 for A Framework for Bidirectional Decoding: Case Study in Morphological Inflection
Viaarxiv icon

Multimedia Generative Script Learning for Task Planning

Add code
Aug 25, 2022
Figure 1 for Multimedia Generative Script Learning for Task Planning
Figure 2 for Multimedia Generative Script Learning for Task Planning
Figure 3 for Multimedia Generative Script Learning for Task Planning
Figure 4 for Multimedia Generative Script Learning for Task Planning
Viaarxiv icon