Picture for Aryo Pradipta Gema

Aryo Pradipta Gema

Self-Training Large Language Models for Tool-Use Without Demonstrations

Add code
Feb 09, 2025
Viaarxiv icon

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

Add code
Feb 07, 2025
Viaarxiv icon

DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations

Add code
Oct 24, 2024
Figure 1 for DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Figure 2 for DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Figure 3 for DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Figure 4 for DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Viaarxiv icon

Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Add code
Oct 21, 2024
Figure 1 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 2 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 3 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 4 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Viaarxiv icon

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Add code
Oct 21, 2024
Figure 1 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 2 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 3 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 4 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Viaarxiv icon

CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning

Add code
Oct 14, 2024
Figure 1 for CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning
Figure 2 for CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning
Figure 3 for CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning
Figure 4 for CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning
Viaarxiv icon

A Comparative Study on Patient Language across Therapeutic Domains for Effective Patient Voice Classification in Online Health Discussions

Add code
Jul 23, 2024
Viaarxiv icon

Are We Done with MMLU?

Add code
Jun 07, 2024
Figure 1 for Are We Done with MMLU?
Figure 2 for Are We Done with MMLU?
Figure 3 for Are We Done with MMLU?
Figure 4 for Are We Done with MMLU?
Viaarxiv icon

Edinburgh Clinical NLP at MEDIQA-CORR 2024: Guiding Large Language Models with Hints

Add code
May 28, 2024
Viaarxiv icon

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Add code
Apr 08, 2024
Viaarxiv icon