Picture for Ziyao Zeng

Ziyao Zeng

PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation

Add code
Nov 24, 2024
Viaarxiv icon

RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions

Add code
Oct 03, 2024
Figure 1 for RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Figure 2 for RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Figure 3 for RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Figure 4 for RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Viaarxiv icon

NeuroBind: Towards Unified Multimodal Representations for Neural Signals

Add code
Jul 19, 2024
Viaarxiv icon

WorDepth: Variational Language Prior for Monocular Depth Estimation

Add code
Apr 05, 2024
Viaarxiv icon

Binding Touch to Everything: Learning Unified Multimodal Tactile Representations

Add code
Jan 31, 2024
Viaarxiv icon

iQuery: Instruments as Queries for Audio-Visual Sound Separation

Add code
Dec 08, 2022
Viaarxiv icon

PointCLIP V2: Adapting CLIP for Powerful 3D Open-world Learning

Add code
Nov 21, 2022
Viaarxiv icon

Can Language Understand Depth?

Add code
Jul 09, 2022
Figure 1 for Can Language Understand Depth?
Figure 2 for Can Language Understand Depth?
Figure 3 for Can Language Understand Depth?
Figure 4 for Can Language Understand Depth?
Viaarxiv icon

VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts

Add code
Dec 04, 2021
Figure 1 for VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts
Figure 2 for VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts
Figure 3 for VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts
Figure 4 for VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts
Viaarxiv icon

DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion

Add code
Nov 19, 2021
Figure 1 for DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion
Figure 2 for DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion
Figure 3 for DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion
Figure 4 for DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion
Viaarxiv icon