Picture for Qi Fan

Qi Fan

Prompt-Free Universal Region Proposal Network

Add code
Mar 18, 2026
Viaarxiv icon

DreamWorld: Unified World Modeling in Video Generation

Add code
Feb 28, 2026
Viaarxiv icon

PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models

Add code
Feb 28, 2026
Viaarxiv icon

Annotation-Free Visual Reasoning for High-Resolution Large Multimodal Models via Reinforcement Learning

Add code
Feb 27, 2026
Viaarxiv icon

Pathwise Test-Time Correction for Autoregressive Long Video Generation

Add code
Feb 05, 2026
Viaarxiv icon

VMonarch: Efficient Video Diffusion Transformers with Structured Attention

Add code
Jan 29, 2026
Viaarxiv icon

FUSE-RSVLM: Feature Fusion Vision-Language Model for Remote Sensing

Add code
Dec 30, 2025
Viaarxiv icon

TimeBill: Time-Budgeted Inference for Large Language Models

Add code
Dec 26, 2025
Figure 1 for TimeBill: Time-Budgeted Inference for Large Language Models
Figure 2 for TimeBill: Time-Budgeted Inference for Large Language Models
Figure 3 for TimeBill: Time-Budgeted Inference for Large Language Models
Figure 4 for TimeBill: Time-Budgeted Inference for Large Language Models
Viaarxiv icon

A Benchmark for Ultra-High-Resolution Remote Sensing MLLMs

Add code
Dec 19, 2025
Viaarxiv icon

Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank

Add code
Dec 13, 2025
Figure 1 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 2 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 3 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 4 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Viaarxiv icon