Picture for Yonatan Belinkov

Yonatan Belinkov

Trust Me, I'm Wrong: High-Certainty Hallucinations in LLMs

Add code
Feb 18, 2025
Viaarxiv icon

Unsupervised Translation of Emergent Communication

Add code
Feb 11, 2025
Viaarxiv icon

Position-aware Automatic Circuit Discovery

Add code
Feb 07, 2025
Viaarxiv icon

Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models

Add code
Jan 12, 2025
Viaarxiv icon

Semantics and Spatiality of Emergent Communication

Add code
Nov 15, 2024
Figure 1 for Semantics and Spatiality of Emergent Communication
Figure 2 for Semantics and Spatiality of Emergent Communication
Figure 3 for Semantics and Spatiality of Emergent Communication
Figure 4 for Semantics and Spatiality of Emergent Communication
Viaarxiv icon

Growing a Tail: Increasing Output Diversity in Large Language Models

Add code
Nov 05, 2024
Viaarxiv icon

Distinguishing Ignorance from Error in LLM Hallucinations

Add code
Oct 29, 2024
Figure 1 for Distinguishing Ignorance from Error in LLM Hallucinations
Figure 2 for Distinguishing Ignorance from Error in LLM Hallucinations
Figure 3 for Distinguishing Ignorance from Error in LLM Hallucinations
Figure 4 for Distinguishing Ignorance from Error in LLM Hallucinations
Viaarxiv icon

Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics

Add code
Oct 28, 2024
Viaarxiv icon

Context-aware Prompt Tuning: Advancing In-Context Learning with Adversarial Methods

Add code
Oct 22, 2024
Viaarxiv icon

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Add code
Oct 03, 2024
Figure 1 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Figure 2 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Figure 3 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Figure 4 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Viaarxiv icon