Picture for Meizhi Zhong

Meizhi Zhong

ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty

Add code
Dec 12, 2024
Viaarxiv icon

MoDification: Mixture of Depths Made Easy

Add code
Oct 18, 2024
Figure 1 for MoDification: Mixture of Depths Made Easy
Figure 2 for MoDification: Mixture of Depths Made Easy
Figure 3 for MoDification: Mixture of Depths Made Easy
Figure 4 for MoDification: Mixture of Depths Made Easy
Viaarxiv icon

Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective

Add code
Jun 19, 2024
Figure 1 for Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
Figure 2 for Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
Figure 3 for Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
Figure 4 for Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
Viaarxiv icon

On the Hallucination in Simultaneous Machine Translation

Add code
Jun 11, 2024
Viaarxiv icon

Context Consistency between Training and Testing in Simultaneous Machine Translation

Add code
Nov 13, 2023
Viaarxiv icon