Picture for Zeyang Zhang

Zeyang Zhang

OCCO: LVM-guided Infrared and Visible Image Fusion Framework based on Object-aware and Contextual COntrastive Learning

Add code
Mar 24, 2025
Viaarxiv icon

Learning a Unified Degradation-aware Representation Model for Multi-modal Image Fusion

Add code
Mar 10, 2025
Viaarxiv icon

One Model for ALL: Low-Level Task Interaction Is a Key to Task-Agnostic Image Fusion

Add code
Feb 27, 2025
Viaarxiv icon

VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding

Add code
Oct 11, 2024
Viaarxiv icon

Multi-sentence Video Grounding for Long Video Generation

Add code
Jul 18, 2024
Figure 1 for Multi-sentence Video Grounding for Long Video Generation
Figure 2 for Multi-sentence Video Grounding for Long Video Generation
Figure 3 for Multi-sentence Video Grounding for Long Video Generation
Figure 4 for Multi-sentence Video Grounding for Long Video Generation
Viaarxiv icon

Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification

Add code
Jun 24, 2024
Figure 1 for Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification
Figure 2 for Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification
Figure 3 for Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification
Figure 4 for Towards Lightweight Graph Neural Network Search with Curriculum Graph Sparsification
Viaarxiv icon

CoMoFusion: Fast and High-quality Fusion of Infrared and Visible Image with Consistency Model

Add code
May 31, 2024
Viaarxiv icon

Causal-Aware Graph Neural Architecture Search under Distribution Shifts

Add code
May 26, 2024
Viaarxiv icon

DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control

Add code
May 21, 2024
Figure 1 for DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Figure 2 for DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Figure 3 for DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Figure 4 for DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Viaarxiv icon

LLM-Enhanced Causal Discovery in Temporal Domain from Interventional Data

Add code
Apr 23, 2024
Viaarxiv icon