Picture for Yonatan Belinkov

Yonatan Belinkov

Growing a Tail: Increasing Output Diversity in Large Language Models

Add code
Nov 05, 2024
Viaarxiv icon

Distinguishing Ignorance from Error in LLM Hallucinations

Add code
Oct 29, 2024
Viaarxiv icon

Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics

Add code
Oct 28, 2024
Viaarxiv icon

Context-aware Prompt Tuning: Advancing In-Context Learning with Adversarial Methods

Add code
Oct 22, 2024
Viaarxiv icon

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Add code
Oct 03, 2024
Figure 1 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Figure 2 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Figure 3 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Figure 4 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Viaarxiv icon

Fast Forwarding Low-Rank Training

Add code
Sep 06, 2024
Viaarxiv icon

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Add code
Aug 22, 2024
Figure 1 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 2 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 3 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Figure 4 for Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Viaarxiv icon

The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability

Add code
Aug 02, 2024
Viaarxiv icon

Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions

Add code
Jul 21, 2024
Viaarxiv icon

Confidence Regulation Neurons in Language Models

Add code
Jun 24, 2024
Viaarxiv icon