Picture for Ivan Titov

Ivan Titov

A Unified View of Attention and Residual Sinks: Outlier-Driven Rescaling is Essential for Transformer Training

Add code
Jan 30, 2026
Viaarxiv icon

Enhancing Long Document Long Form Summarisation with Self-Planning

Add code
Dec 19, 2025
Viaarxiv icon

Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces

Add code
Sep 30, 2025
Figure 1 for Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
Figure 2 for Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
Figure 3 for Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
Figure 4 for Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
Viaarxiv icon

Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling

Add code
Jul 02, 2025
Viaarxiv icon

M-Wanda: Improving One-Shot Pruning for Multilingual LLMs

Add code
May 27, 2025
Figure 1 for M-Wanda: Improving One-Shot Pruning for Multilingual LLMs
Figure 2 for M-Wanda: Improving One-Shot Pruning for Multilingual LLMs
Figure 3 for M-Wanda: Improving One-Shot Pruning for Multilingual LLMs
Figure 4 for M-Wanda: Improving One-Shot Pruning for Multilingual LLMs
Viaarxiv icon

Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations

Add code
Apr 07, 2025
Figure 1 for Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations
Figure 2 for Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations
Figure 3 for Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations
Figure 4 for Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations
Viaarxiv icon

Joint Localization and Activation Editing for Low-Resource Fine-Tuning

Add code
Feb 03, 2025
Figure 1 for Joint Localization and Activation Editing for Low-Resource Fine-Tuning
Figure 2 for Joint Localization and Activation Editing for Low-Resource Fine-Tuning
Figure 3 for Joint Localization and Activation Editing for Low-Resource Fine-Tuning
Figure 4 for Joint Localization and Activation Editing for Low-Resource Fine-Tuning
Viaarxiv icon

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Add code
Jan 21, 2025
Figure 1 for Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
Figure 2 for Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
Figure 3 for Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
Figure 4 for Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
Viaarxiv icon

Language Agents Meet Causality -- Bridging LLMs and Causal World Models

Add code
Oct 25, 2024
Viaarxiv icon

What's New in My Data? Novelty Exploration via Contrastive Generation

Add code
Oct 18, 2024
Viaarxiv icon