Picture for Zorik Gekhman

Zorik Gekhman

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Add code
Oct 03, 2024
Figure 1 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Figure 2 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Figure 3 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Figure 4 for LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Viaarxiv icon

NL-Eye: Abductive NLI for Images

Add code
Oct 03, 2024
Viaarxiv icon

Can LLMs Learn Macroeconomic Narratives from Social Media?

Add code
Jun 17, 2024
Figure 1 for Can LLMs Learn Macroeconomic Narratives from Social Media?
Figure 2 for Can LLMs Learn Macroeconomic Narratives from Social Media?
Figure 3 for Can LLMs Learn Macroeconomic Narratives from Social Media?
Figure 4 for Can LLMs Learn Macroeconomic Narratives from Social Media?
Viaarxiv icon

Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?

Add code
May 09, 2024
Figure 1 for Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
Figure 2 for Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
Figure 3 for Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
Figure 4 for Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
Viaarxiv icon

Using Text Injection to Improve Recognition of Personal Identifiers in Speech

Add code
Aug 14, 2023
Figure 1 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 2 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 3 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 4 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Viaarxiv icon

Measuring the Robustness of Natural Language Processing Models to Domain Shifts

Add code
May 31, 2023
Viaarxiv icon

TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models

Add code
May 18, 2023
Viaarxiv icon

On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method

Add code
Jun 29, 2022
Figure 1 for On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method
Figure 2 for On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method
Figure 3 for On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method
Figure 4 for On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method
Viaarxiv icon

RED-ACE: Robust Error Detection for ASR using Confidence Embeddings

Add code
Mar 14, 2022
Figure 1 for RED-ACE: Robust Error Detection for ASR using Confidence Embeddings
Figure 2 for RED-ACE: Robust Error Detection for ASR using Confidence Embeddings
Figure 3 for RED-ACE: Robust Error Detection for ASR using Confidence Embeddings
Figure 4 for RED-ACE: Robust Error Detection for ASR using Confidence Embeddings
Viaarxiv icon

KoBE: Knowledge-Based Machine Translation Evaluation

Add code
Sep 23, 2020
Figure 1 for KoBE: Knowledge-Based Machine Translation Evaluation
Figure 2 for KoBE: Knowledge-Based Machine Translation Evaluation
Figure 3 for KoBE: Knowledge-Based Machine Translation Evaluation
Figure 4 for KoBE: Knowledge-Based Machine Translation Evaluation
Viaarxiv icon