Picture for Jiaolong Yang

Jiaolong Yang

UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping

Add code
Dec 03, 2024
Viaarxiv icon

Structured 3D Latents for Scalable and Versatile 3D Generation

Add code
Dec 02, 2024
Viaarxiv icon

CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Add code
Nov 29, 2024
Viaarxiv icon

PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting

Add code
Oct 29, 2024
Viaarxiv icon

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Add code
Oct 24, 2024
Viaarxiv icon

Seal: Advancing Speech Language Models to be Few-Shot Learners

Add code
Jul 20, 2024
Viaarxiv icon

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models

Add code
Jul 11, 2024
Figure 1 for RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Figure 2 for RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Figure 3 for RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Figure 4 for RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Viaarxiv icon

VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

Add code
Apr 16, 2024
Figure 1 for VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Figure 2 for VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Figure 3 for VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Figure 4 for VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Viaarxiv icon

GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling

Add code
Apr 05, 2024
Viaarxiv icon

Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors

Add code
Mar 18, 2024
Viaarxiv icon