Picture for Deqing Fu

Deqing Fu

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning

Add code
Jul 22, 2025
Viaarxiv icon

Resa: Transparent Reasoning Models via SAEs

Add code
Jun 11, 2025
Viaarxiv icon

Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models

Add code
May 20, 2025
Figure 1 for Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
Figure 2 for Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
Figure 3 for Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
Figure 4 for Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
Viaarxiv icon

VisualLens: Personalization through Visual History

Add code
Nov 25, 2024
Figure 1 for VisualLens: Personalization through Visual History
Figure 2 for VisualLens: Personalization through Visual History
Figure 3 for VisualLens: Personalization through Visual History
Figure 4 for VisualLens: Personalization through Visual History
Viaarxiv icon

TLDR: Token-Level Detective Reward Model for Large Vision Language Models

Add code
Oct 07, 2024
Figure 1 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Figure 2 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Figure 3 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Figure 4 for TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Viaarxiv icon

Pre-trained Large Language Models Use Fourier Features to Compute Addition

Add code
Jun 05, 2024
Figure 1 for Pre-trained Large Language Models Use Fourier Features to Compute Addition
Figure 2 for Pre-trained Large Language Models Use Fourier Features to Compute Addition
Figure 3 for Pre-trained Large Language Models Use Fourier Features to Compute Addition
Figure 4 for Pre-trained Large Language Models Use Fourier Features to Compute Addition
Viaarxiv icon

IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations

Add code
Apr 02, 2024
Figure 1 for IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
Figure 2 for IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
Figure 3 for IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
Figure 4 for IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations
Viaarxiv icon

Simplicity Bias of Transformers to Learn Low Sensitivity Functions

Add code
Mar 11, 2024
Viaarxiv icon

DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models

Add code
Feb 04, 2024
Viaarxiv icon

DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback

Add code
Nov 29, 2023
Figure 1 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 2 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 3 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Figure 4 for DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Viaarxiv icon