Picture for Zhantao Yang

Zhantao Yang

OmniCamera: A Unified Framework for Multi-task Video Generation with Arbitrary Camera Control

Add code
Apr 07, 2026
Viaarxiv icon

Hierarchical SVG Tokenization: Learning Compact Visual Programs for Scalable Vector Graphics Modeling

Add code
Apr 06, 2026
Viaarxiv icon

OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention

Add code
Feb 05, 2026
Viaarxiv icon

Addressing the ID-Matching Challenge in Long Video Captioning

Add code
Oct 08, 2025
Viaarxiv icon

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Add code
Sep 09, 2025
Viaarxiv icon

STELAR-VISION: Self-Topology-Aware Efficient Learning for Aligned Reasoning in Vision

Add code
Aug 12, 2025
Viaarxiv icon

Accelerating Diffusion Sampling via Exploiting Local Transition Coherence

Add code
Mar 12, 2025
Viaarxiv icon

The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control

Add code
Dec 04, 2024
Figure 1 for The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Figure 2 for The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Figure 3 for The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Figure 4 for The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Viaarxiv icon

Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce

Add code
Oct 28, 2024
Figure 1 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Figure 2 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Figure 3 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Figure 4 for Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce
Viaarxiv icon

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

Add code
Jul 03, 2024
Figure 1 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 2 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 3 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 4 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Viaarxiv icon