Picture for Amirhossein Kazemnejad

Amirhossein Kazemnejad

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Add code
Oct 02, 2024
Figure 1 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Figure 2 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Figure 3 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Figure 4 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Viaarxiv icon

The Impact of Positional Encoding on Length Generalization in Transformers

Add code
May 31, 2023
Viaarxiv icon

Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models

Add code
May 24, 2023
Viaarxiv icon

The Curious Case of Absolute Position Embeddings

Add code
Oct 23, 2022
Viaarxiv icon