Picture for Yanwen Kong

Yanwen Kong

Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms

Add code
Jan 15, 2025
Viaarxiv icon