Picture for Chao Ma

Chao Ma

Patch-Discontinuity Mining for Generalized Deepfake Detection

Add code
Dec 26, 2025
Viaarxiv icon

LiteFusion: Taming 3D Object Detectors from Vision-Based to Multi-Modal with Minimal Adaptation

Add code
Dec 23, 2025
Viaarxiv icon

Grounding Everything in Tokens for Multimodal Large Language Models

Add code
Dec 11, 2025
Viaarxiv icon

Thermally Activated Dual-Modal Adversarial Clothing against AI Surveillance Systems

Add code
Nov 17, 2025
Viaarxiv icon

Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method

Add code
Oct 27, 2025
Figure 1 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Figure 2 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Figure 3 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Figure 4 for Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method
Viaarxiv icon

BachVid: Training-Free Video Generation with Consistent Background and Character

Add code
Oct 24, 2025
Viaarxiv icon

AnchorSync: Global Consistency Optimization for Long Video Editing

Add code
Aug 20, 2025
Viaarxiv icon

Cross-Architecture Distillation Made Simple with Redundancy Suppression

Add code
Jul 29, 2025
Viaarxiv icon

Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning

Add code
Jul 10, 2025
Viaarxiv icon

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Add code
May 23, 2025
Viaarxiv icon