Picture for Ziyang Song

Ziyang Song

Depth Anything in $360^\circ$: Towards Scale Invariance in the Wild

Add code
Dec 28, 2025
Viaarxiv icon

ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

Add code
Dec 28, 2025
Viaarxiv icon

Anatomy-R1: Enhancing Anatomy Reasoning in Multimodal Large Language Models via Anatomical Similarity Curriculum and Group Diversity Augmentation

Add code
Dec 24, 2025
Viaarxiv icon

CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization

Add code
Dec 22, 2025
Figure 1 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 2 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 3 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Figure 4 for CETCAM: Camera-Controllable Video Generation via Consistent and Extensible Tokenization
Viaarxiv icon

Cross-modal Retrieval Models for Stripped Binary Analysis

Add code
Dec 11, 2025
Viaarxiv icon

Multimodal Causal-Driven Representation Learning for Generalizable Medical Image Segmentation

Add code
Aug 07, 2025
Viaarxiv icon

FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity

Add code
Jun 09, 2025
Viaarxiv icon

DepthMaster: Taming Diffusion Models for Monocular Depth Estimation

Add code
Jan 05, 2025
Figure 1 for DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Figure 2 for DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Figure 3 for DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Figure 4 for DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
Viaarxiv icon

DeepSeek-V3 Technical Report

Add code
Dec 27, 2024
Figure 1 for DeepSeek-V3 Technical Report
Figure 2 for DeepSeek-V3 Technical Report
Figure 3 for DeepSeek-V3 Technical Report
Figure 4 for DeepSeek-V3 Technical Report
Viaarxiv icon

P2DFlow: A Protein Ensemble Generative Model with SE(3) Flow Matching

Add code
Nov 26, 2024
Viaarxiv icon