Picture for Xuzheng He

Xuzheng He

Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning

Add code
Oct 07, 2024
Figure 1 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 2 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 3 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 4 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Viaarxiv icon

Interpretable Contrastive Monte Carlo Tree Search Reasoning

Add code
Oct 02, 2024
Figure 1 for Interpretable Contrastive Monte Carlo Tree Search Reasoning
Figure 2 for Interpretable Contrastive Monte Carlo Tree Search Reasoning
Figure 3 for Interpretable Contrastive Monte Carlo Tree Search Reasoning
Figure 4 for Interpretable Contrastive Monte Carlo Tree Search Reasoning
Viaarxiv icon

Demonstrating Mutual Reinforcement Effect through Information Flow

Add code
Mar 05, 2024
Figure 1 for Demonstrating Mutual Reinforcement Effect through Information Flow
Figure 2 for Demonstrating Mutual Reinforcement Effect through Information Flow
Figure 3 for Demonstrating Mutual Reinforcement Effect through Information Flow
Figure 4 for Demonstrating Mutual Reinforcement Effect through Information Flow
Viaarxiv icon

StableMask: Refining Causal Masking in Decoder-only Transformer

Add code
Feb 07, 2024
Figure 1 for StableMask: Refining Causal Masking in Decoder-only Transformer
Figure 2 for StableMask: Refining Causal Masking in Decoder-only Transformer
Figure 3 for StableMask: Refining Causal Masking in Decoder-only Transformer
Figure 4 for StableMask: Refining Causal Masking in Decoder-only Transformer
Viaarxiv icon

RWKV: Reinventing RNNs for the Transformer Era

Add code
May 22, 2023
Viaarxiv icon