Picture for Jingyi Yu

Jingyi Yu

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image

Add code
Feb 18, 2025
Viaarxiv icon

BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video

Add code
Feb 12, 2025
Viaarxiv icon

TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints

Add code
Feb 10, 2025
Viaarxiv icon

UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

Add code
Feb 09, 2025
Viaarxiv icon

LLaVA-SLT: Visual Language Tuning for Sign Language Translation

Add code
Dec 21, 2024
Figure 1 for LLaVA-SLT: Visual Language Tuning for Sign Language Translation
Figure 2 for LLaVA-SLT: Visual Language Tuning for Sign Language Translation
Figure 3 for LLaVA-SLT: Visual Language Tuning for Sign Language Translation
Figure 4 for LLaVA-SLT: Visual Language Tuning for Sign Language Translation
Viaarxiv icon

CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings

Add code
Dec 10, 2024
Viaarxiv icon

Unsupervised Multi-Parameter Inverse Solving for Reducing Ring Artifacts in 3D X-Ray CBCT

Add code
Dec 08, 2024
Figure 1 for Unsupervised Multi-Parameter Inverse Solving for Reducing Ring Artifacts in 3D X-Ray CBCT
Viaarxiv icon

AffordDP: Generalizable Diffusion Policy with Transferable Affordance

Add code
Dec 04, 2024
Figure 1 for AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Figure 2 for AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Figure 3 for AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Figure 4 for AffordDP: Generalizable Diffusion Policy with Transferable Affordance
Viaarxiv icon

NLPrompt: Noise-Label Prompt Learning for Vision-Language Models

Add code
Dec 02, 2024
Figure 1 for NLPrompt: Noise-Label Prompt Learning for Vision-Language Models
Figure 2 for NLPrompt: Noise-Label Prompt Learning for Vision-Language Models
Figure 3 for NLPrompt: Noise-Label Prompt Learning for Vision-Language Models
Figure 4 for NLPrompt: Noise-Label Prompt Learning for Vision-Language Models
Viaarxiv icon

SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model

Add code
Dec 02, 2024
Figure 1 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Figure 2 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Figure 3 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Figure 4 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Viaarxiv icon