Picture for Barbara Plank

Barbara Plank

Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration

Add code
Jan 06, 2026
Viaarxiv icon

Decoupling the Effect of Chain-of-Thought Reasoning: A Human Label Variation Perspective

Add code
Jan 06, 2026
Viaarxiv icon

EVADE: LLM-Based Explanation Generation and Validation for Error Detection in NLI

Add code
Nov 12, 2025
Viaarxiv icon

BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)

Add code
Oct 14, 2025
Figure 1 for BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Figure 2 for BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Figure 3 for BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Figure 4 for BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Viaarxiv icon

Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects

Add code
Oct 09, 2025
Figure 1 for Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
Figure 2 for Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
Figure 3 for Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
Figure 4 for Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
Viaarxiv icon

Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort

Add code
Oct 01, 2025
Figure 1 for Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort
Figure 2 for Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort
Figure 3 for Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort
Figure 4 for Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort
Viaarxiv icon

Evaluating Large Language Models for Cross-Lingual Retrieval

Add code
Sep 18, 2025
Viaarxiv icon

Revisiting Active Learning under (Human) Label Variation

Add code
Jul 03, 2025
Viaarxiv icon

Evaluation Should Not Ignore Variation: On the Impact of Reference Set Choice on Summarization Metrics

Add code
Jun 17, 2025
Viaarxiv icon

Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?

Add code
Jun 11, 2025
Viaarxiv icon