Picture for Wenwu Zhu

Wenwu Zhu

CST

VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding

Add code
Oct 11, 2024
Viaarxiv icon

Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models

Add code
Aug 05, 2024
Viaarxiv icon

U2UData: A Large-scale Cooperative Perception Dataset for Swarm UAVs Autonomous Flight

Add code
Aug 05, 2024
Viaarxiv icon

Multi-sentence Video Grounding for Long Video Generation

Add code
Jul 18, 2024
Figure 1 for Multi-sentence Video Grounding for Long Video Generation
Figure 2 for Multi-sentence Video Grounding for Long Video Generation
Figure 3 for Multi-sentence Video Grounding for Long Video Generation
Figure 4 for Multi-sentence Video Grounding for Long Video Generation
Viaarxiv icon

PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference

Add code
Jul 06, 2024
Viaarxiv icon

Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers

Add code
Jun 25, 2024
Viaarxiv icon

Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification

Add code
Jun 24, 2024
Figure 1 for Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification
Figure 2 for Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification
Figure 3 for Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification
Figure 4 for Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification
Viaarxiv icon

Causal-Aware Graph Neural Architecture Search under Distribution Shifts

Add code
May 26, 2024
Viaarxiv icon

DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control

Add code
May 21, 2024
Figure 1 for DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Figure 2 for DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Figure 3 for DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Figure 4 for DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Viaarxiv icon

TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models

Add code
Apr 15, 2024
Viaarxiv icon