Picture for Jeffrey Quesnelle

Jeffrey Quesnelle

DeMo: Decoupled Momentum Optimization

Add code
Nov 29, 2024
Viaarxiv icon

Hermes 3 Technical Report

Add code
Aug 15, 2024
Viaarxiv icon

YaRN: Efficient Context Window Extension of Large Language Models

Add code
Aug 31, 2023
Figure 1 for YaRN: Efficient Context Window Extension of Large Language Models
Figure 2 for YaRN: Efficient Context Window Extension of Large Language Models
Figure 3 for YaRN: Efficient Context Window Extension of Large Language Models
Figure 4 for YaRN: Efficient Context Window Extension of Large Language Models
Viaarxiv icon