Picture for Xiangming Gu

Xiangming Gu

The Illusion of Stochasticity in LLMs

Add code
Apr 08, 2026
Viaarxiv icon

Understanding Performance Gap Between Parallel and Sequential Sampling in Large Reasoning Models

Add code
Apr 07, 2026
Viaarxiv icon

Unlocking Large Audio-Language Models for Interactive Language Learning

Add code
Jan 21, 2026
Viaarxiv icon

Why do LLMs attend to the first token?

Add code
Apr 03, 2025
Viaarxiv icon

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Add code
Mar 19, 2025
Figure 1 for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Figure 2 for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Figure 3 for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Figure 4 for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Viaarxiv icon

When Attention Sink Emerges in Language Models: An Empirical View

Add code
Oct 14, 2024
Figure 1 for When Attention Sink Emerges in Language Models: An Empirical View
Figure 2 for When Attention Sink Emerges in Language Models: An Empirical View
Figure 3 for When Attention Sink Emerges in Language Models: An Empirical View
Figure 4 for When Attention Sink Emerges in Language Models: An Empirical View
Viaarxiv icon

On Calibration of LLM-based Guard Models for Reliable Content Moderation

Add code
Oct 14, 2024
Figure 1 for On Calibration of LLM-based Guard Models for Reliable Content Moderation
Figure 2 for On Calibration of LLM-based Guard Models for Reliable Content Moderation
Figure 3 for On Calibration of LLM-based Guard Models for Reliable Content Moderation
Figure 4 for On Calibration of LLM-based Guard Models for Reliable Content Moderation
Viaarxiv icon

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Add code
Feb 13, 2024
Figure 1 for Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Figure 2 for Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Figure 3 for Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Figure 4 for Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Viaarxiv icon

On Memorization in Diffusion Models

Add code
Oct 04, 2023
Figure 1 for On Memorization in Diffusion Models
Figure 2 for On Memorization in Diffusion Models
Figure 3 for On Memorization in Diffusion Models
Figure 4 for On Memorization in Diffusion Models
Viaarxiv icon

Elucidate Gender Fairness in Singing Voice Transcription

Add code
Aug 05, 2023
Figure 1 for Elucidate Gender Fairness in Singing Voice Transcription
Figure 2 for Elucidate Gender Fairness in Singing Voice Transcription
Figure 3 for Elucidate Gender Fairness in Singing Voice Transcription
Figure 4 for Elucidate Gender Fairness in Singing Voice Transcription
Viaarxiv icon