Picture for Konrad Staniszewski

Konrad Staniszewski

Analysing The Impact of Sequence Composition on Language Model Pre-Training

Add code
Feb 21, 2024
Viaarxiv icon

Structured Packing in LLM Training Improves Long Context Utilization

Add code
Jan 02, 2024
Viaarxiv icon

Focused Transformer: Contrastive Training for Context Scaling

Add code
Jul 06, 2023
Viaarxiv icon