Picture for Yanhao Zheng

Yanhao Zheng

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Add code
Mar 30, 2025
Viaarxiv icon

Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation

Add code
Apr 12, 2024
Viaarxiv icon