Picture for Zhenyu He

Zhenyu He

Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning

Add code
Feb 12, 2025
Viaarxiv icon

ZeroBP: Learning Position-Aware Correspondence for Zero-shot 6D Pose Estimation in Bin-Picking

Add code
Feb 03, 2025
Viaarxiv icon

PRSI: Privacy-Preserving Recommendation Model Based on Vector Splitting and Interactive Protocols

Add code
Nov 27, 2024
Viaarxiv icon

MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking

Add code
Nov 23, 2024
Figure 1 for MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
Figure 2 for MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
Figure 3 for MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
Figure 4 for MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking
Viaarxiv icon

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

Add code
Sep 09, 2024
Figure 1 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 2 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 3 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 4 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Viaarxiv icon

Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS

Add code
Aug 29, 2024
Figure 1 for Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Figure 2 for Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Figure 3 for Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Viaarxiv icon

Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model

Add code
Aug 07, 2024
Viaarxiv icon

Exploiting Pre-trained Models for Drug Target Affinity Prediction with Nearest Neighbors

Add code
Jul 21, 2024
Viaarxiv icon

GRAPE: Generalizable and Robust Multi-view Facial Capture

Add code
Jul 14, 2024
Viaarxiv icon

Learning Spatial-Semantic Features for Robust Video Object Segmentation

Add code
Jul 10, 2024
Viaarxiv icon