Picture for Shahar Katz

Shahar Katz

Segment-Based Attention Masking for GPTs

Add code
Dec 24, 2024
Viaarxiv icon

Reversed Attention: On The Gradient Descent Of Attention Layers In GPT

Add code
Dec 22, 2024
Viaarxiv icon

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space

Add code
Feb 20, 2024
Viaarxiv icon

Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT

Add code
May 22, 2023
Viaarxiv icon