Picture for Yanhao Zhang

Yanhao Zhang

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Add code
Apr 01, 2025
Viaarxiv icon

H2VU-Benchmark: A Comprehensive Benchmark for Hierarchical Holistic Video Understanding

Add code
Mar 31, 2025
Viaarxiv icon

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

Add code
Mar 12, 2025
Viaarxiv icon

HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator

Add code
Nov 26, 2024
Figure 1 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Figure 2 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Figure 3 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Figure 4 for HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator
Viaarxiv icon

LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image

Add code
Aug 14, 2024
Figure 1 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 2 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 3 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 4 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Viaarxiv icon

R$^2$-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction

Add code
May 31, 2024
Viaarxiv icon

PoseAnimate: Zero-shot high fidelity pose controllable character animation

Add code
Apr 30, 2024
Viaarxiv icon

LoopAnimate: Loopable Salient Object Animation

Add code
Apr 14, 2024
Viaarxiv icon

Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration

Add code
Apr 14, 2024
Viaarxiv icon

Homography Guided Temporal Fusion for Road Line and Marking Segmentation

Add code
Apr 11, 2024
Viaarxiv icon