Picture for Yuzhang Wu

Yuzhang Wu

Cross-layer Attention Sharing for Large Language Models

Add code
Aug 04, 2024
Figure 1 for Cross-layer Attention Sharing for Large Language Models
Figure 2 for Cross-layer Attention Sharing for Large Language Models
Figure 3 for Cross-layer Attention Sharing for Large Language Models
Figure 4 for Cross-layer Attention Sharing for Large Language Models
Viaarxiv icon

Translate-and-Revise: Boosting Large Language Models for Constrained Translation

Add code
Jul 18, 2024
Viaarxiv icon

Large Language Models are Parallel Multilingual Learners

Add code
Mar 14, 2024
Viaarxiv icon