Picture for Yu Deng

Yu Deng

IBM

VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image

Add code
Dec 16, 2025
Viaarxiv icon

Native and Compact Structured Latents for 3D Generation

Add code
Dec 16, 2025
Viaarxiv icon

Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision

Add code
Nov 13, 2025
Figure 1 for Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision
Figure 2 for Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision
Figure 3 for Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision
Figure 4 for Depth-Consistent 3D Gaussian Splatting via Physical Defocus Modeling and Multi-View Geometric Supervision
Viaarxiv icon

STORM: Segment, Track, and Object Re-Localization from a Single 3D Model

Add code
Nov 12, 2025
Viaarxiv icon

Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

Add code
Oct 24, 2025
Figure 1 for Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Figure 2 for Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Figure 3 for Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Figure 4 for Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Viaarxiv icon

MoGe-2: Accurate Monocular Geometry with Metric Scale and Sharp Details

Add code
Jul 03, 2025
Figure 1 for MoGe-2: Accurate Monocular Geometry with Metric Scale and Sharp Details
Figure 2 for MoGe-2: Accurate Monocular Geometry with Metric Scale and Sharp Details
Figure 3 for MoGe-2: Accurate Monocular Geometry with Metric Scale and Sharp Details
Figure 4 for MoGe-2: Accurate Monocular Geometry with Metric Scale and Sharp Details
Viaarxiv icon

Brain Imaging Foundation Models, Are We There Yet? A Systematic Review of Foundation Models for Brain Imaging and Biomedical Research

Add code
Jun 16, 2025
Viaarxiv icon

Cardiac Digital Twins at Scale from MRI: Open Tools and Representative Models from ~55000 UK Biobank Participants

Add code
May 27, 2025
Figure 1 for Cardiac Digital Twins at Scale from MRI: Open Tools and Representative Models from ~55000 UK Biobank Participants
Figure 2 for Cardiac Digital Twins at Scale from MRI: Open Tools and Representative Models from ~55000 UK Biobank Participants
Figure 3 for Cardiac Digital Twins at Scale from MRI: Open Tools and Representative Models from ~55000 UK Biobank Participants
Figure 4 for Cardiac Digital Twins at Scale from MRI: Open Tools and Representative Models from ~55000 UK Biobank Participants
Viaarxiv icon

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

Add code
Feb 07, 2025
Figure 1 for ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks
Figure 2 for ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks
Figure 3 for ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks
Figure 4 for ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks
Viaarxiv icon

MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction

Add code
Dec 14, 2024
Figure 1 for MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction
Figure 2 for MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction
Figure 3 for MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction
Figure 4 for MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction
Viaarxiv icon