Picture for Jerry Huang

Jerry Huang

RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Add code
Mar 17, 2025
Viaarxiv icon

Do Large Language Models Know How Much They Know?

Add code
Feb 26, 2025
Viaarxiv icon

ZETA: Leveraging Z-order Curves for Efficient Top-k Attention

Add code
Jan 24, 2025
Viaarxiv icon

Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination

Add code
Oct 22, 2024
Figure 1 for Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Figure 2 for Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Figure 3 for Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Figure 4 for Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Viaarxiv icon

Predicting adaptively chosen observables in quantum systems

Add code
Oct 20, 2024
Viaarxiv icon

Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models

Add code
Aug 16, 2024
Viaarxiv icon

How Well Can a Long Sequence Model Model Long Sequences? Comparing Architechtural Inductive Biases on Long-Context Abilities

Add code
Jul 11, 2024
Viaarxiv icon

Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective

Add code
May 24, 2024
Figure 1 for Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Figure 2 for Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Figure 3 for Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Figure 4 for Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective
Viaarxiv icon

Towards Practical Tool Usage for Continually Learning LLMs

Add code
Apr 14, 2024
Viaarxiv icon

EpiK-Eval: Evaluation for Language Models as Epistemic Models

Add code
Oct 23, 2023
Viaarxiv icon