Picture for Zikun Zhou

Zikun Zhou

MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking

Add code
Nov 23, 2024
Figure 1 for MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
Figure 2 for MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
Figure 3 for MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
Figure 4 for MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
Viaarxiv icon

Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models

Add code
May 17, 2024
Figure 1 for Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models
Figure 2 for Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models
Figure 3 for Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models
Figure 4 for Driving Referring Video Object Segmentation with Vision-Language Pre-trained Models
Viaarxiv icon

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

Add code
May 16, 2024
Figure 1 for Bilateral Event Mining and Complementary for Event Stream Super-Resolution
Figure 2 for Bilateral Event Mining and Complementary for Event Stream Super-Resolution
Figure 3 for Bilateral Event Mining and Complementary for Event Stream Super-Resolution
Figure 4 for Bilateral Event Mining and Complementary for Event Stream Super-Resolution
Viaarxiv icon

Motion-aware Latent Diffusion Models for Video Frame Interpolation

Add code
Apr 21, 2024
Figure 1 for Motion-aware Latent Diffusion Models for Video Frame Interpolation
Figure 2 for Motion-aware Latent Diffusion Models for Video Frame Interpolation
Figure 3 for Motion-aware Latent Diffusion Models for Video Frame Interpolation
Figure 4 for Motion-aware Latent Diffusion Models for Video Frame Interpolation
Viaarxiv icon

RTracker: Recoverable Tracking via PN Tree Structured Memory

Add code
Mar 28, 2024
Viaarxiv icon

Robust 3D Tracking with Quality-Aware Shape Completion

Add code
Dec 17, 2023
Viaarxiv icon

Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation

Add code
Aug 24, 2023
Figure 1 for Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation
Figure 2 for Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation
Figure 3 for Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation
Figure 4 for Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation
Viaarxiv icon

Cross-Modality Proposal-guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection

Add code
Aug 23, 2023
Viaarxiv icon

Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation

Add code
Mar 25, 2023
Viaarxiv icon

Joint Visual Grounding and Tracking with Natural Language Specification

Add code
Mar 21, 2023
Viaarxiv icon