Picture for Aryo Pradipta Gema

Aryo Pradipta Gema

DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations

Add code
Oct 24, 2024
Figure 1 for DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Figure 2 for DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Figure 3 for DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Figure 4 for DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Viaarxiv icon

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Add code
Oct 21, 2024
Figure 1 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 2 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 3 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 4 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Viaarxiv icon

Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Add code
Oct 21, 2024
Figure 1 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 2 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 3 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 4 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Viaarxiv icon

CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning

Add code
Oct 14, 2024
Figure 1 for CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning
Figure 2 for CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning
Figure 3 for CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning
Figure 4 for CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning
Viaarxiv icon

A Comparative Study on Patient Language across Therapeutic Domains for Effective Patient Voice Classification in Online Health Discussions

Add code
Jul 23, 2024
Viaarxiv icon

Are We Done with MMLU?

Add code
Jun 07, 2024
Figure 1 for Are We Done with MMLU?
Figure 2 for Are We Done with MMLU?
Figure 3 for Are We Done with MMLU?
Figure 4 for Are We Done with MMLU?
Viaarxiv icon

Edinburgh Clinical NLP at MEDIQA-CORR 2024: Guiding Large Language Models with Hints

Add code
May 28, 2024
Viaarxiv icon

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Add code
Apr 08, 2024
Viaarxiv icon

Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model unless you have access to GPT-4

Add code
Mar 30, 2024
Viaarxiv icon

Can GPT-3.5 Generate and Code Discharge Summaries?

Add code
Jan 24, 2024
Viaarxiv icon