Picture for Zijia Chen

Zijia Chen

Hymba: A Hybrid-head Architecture for Small Language Models

Add code
Nov 20, 2024
Viaarxiv icon