Picture for Qi Fan

Qi Fan

VMonarch: Efficient Video Diffusion Transformers with Structured Attention

Add code
Jan 29, 2026
Viaarxiv icon

FUSE-RSVLM: Feature Fusion Vision-Language Model for Remote Sensing

Add code
Dec 30, 2025
Viaarxiv icon

TimeBill: Time-Budgeted Inference for Large Language Models

Add code
Dec 26, 2025
Figure 1 for TimeBill: Time-Budgeted Inference for Large Language Models
Figure 2 for TimeBill: Time-Budgeted Inference for Large Language Models
Figure 3 for TimeBill: Time-Budgeted Inference for Large Language Models
Figure 4 for TimeBill: Time-Budgeted Inference for Large Language Models
Viaarxiv icon

A Benchmark for Ultra-High-Resolution Remote Sensing MLLMs

Add code
Dec 19, 2025
Viaarxiv icon

Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank

Add code
Dec 13, 2025
Figure 1 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 2 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 3 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 4 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Viaarxiv icon

Denoising Vision Transformer Autoencoder with Spectral Self-Regularization

Add code
Nov 16, 2025
Viaarxiv icon

CUDA-LLM: LLMs Can Write Efficient CUDA Kernels

Add code
Jun 10, 2025
Figure 1 for CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
Figure 2 for CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
Figure 3 for CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
Figure 4 for CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
Viaarxiv icon

Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs

Add code
Jun 08, 2025
Figure 1 for Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
Figure 2 for Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
Figure 3 for Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
Figure 4 for Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
Viaarxiv icon

Adapting In-Domain Few-Shot Segmentation to New Domains without Retraining

Add code
Apr 30, 2025
Figure 1 for Adapting In-Domain Few-Shot Segmentation to New Domains without Retraining
Figure 2 for Adapting In-Domain Few-Shot Segmentation to New Domains without Retraining
Figure 3 for Adapting In-Domain Few-Shot Segmentation to New Domains without Retraining
Figure 4 for Adapting In-Domain Few-Shot Segmentation to New Domains without Retraining
Viaarxiv icon

Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation

Add code
Apr 16, 2024
Figure 1 for Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation
Figure 2 for Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation
Figure 3 for Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation
Figure 4 for Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation
Viaarxiv icon