Picture for Haoran Lian

Haoran Lian

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models

Add code
Dec 10, 2024
Viaarxiv icon

LBPE: Long-token-first Tokenization to Improve Large Language Models

Add code
Nov 08, 2024
Figure 1 for LBPE: Long-token-first Tokenization to Improve Large Language Models
Figure 2 for LBPE: Long-token-first Tokenization to Improve Large Language Models
Figure 3 for LBPE: Long-token-first Tokenization to Improve Large Language Models
Figure 4 for LBPE: Long-token-first Tokenization to Improve Large Language Models
Viaarxiv icon

MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts

Add code
Jul 13, 2024
Viaarxiv icon

Temporal Scaling Law for Large Language Models

Add code
Apr 27, 2024
Viaarxiv icon

Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal

Add code
Apr 27, 2024
Viaarxiv icon