Picture for Yuhang Yang

Yuhang Yang

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Add code
Dec 13, 2024
Viaarxiv icon

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding

Add code
Nov 29, 2024
Viaarxiv icon

ResCLIP: Residual Attention for Training-free Dense Vision-language Inference

Add code
Nov 24, 2024
Viaarxiv icon

Versatile Cataract Fundus Image Restoration Model Utilizing Unpaired Cataract and High-quality Images

Add code
Nov 19, 2024
Viaarxiv icon

TableGPT2: A Large Multimodal Model with Tabular Data Integration

Add code
Nov 04, 2024
Viaarxiv icon

The Dawn of Video Generation: Preliminary Explorations with SORA-like Models

Add code
Oct 07, 2024
Viaarxiv icon

Grounding 3D Scene Affordance From Egocentric Interactions

Add code
Sep 29, 2024
Figure 1 for Grounding 3D Scene Affordance From Egocentric Interactions
Figure 2 for Grounding 3D Scene Affordance From Egocentric Interactions
Figure 3 for Grounding 3D Scene Affordance From Egocentric Interactions
Figure 4 for Grounding 3D Scene Affordance From Egocentric Interactions
Viaarxiv icon

EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views

Add code
May 22, 2024
Figure 1 for EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Figure 2 for EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Figure 3 for EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Figure 4 for EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
Viaarxiv icon

LEMON: Learning 3D Human-Object Interaction Relation from 2D Images

Add code
Dec 14, 2023
Viaarxiv icon

Adapting OpenAI's Whisper for Speech Recognition on Code-Switch Mandarin-English SEAME and ASRU2019 Datasets

Add code
Nov 29, 2023
Viaarxiv icon