Picture for Ming-Hsuan Yang

Ming-Hsuan Yang

Multi-subject Open-set Personalization in Video Generation

Add code
Jan 10, 2025
Viaarxiv icon

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Add code
Jan 07, 2025
Figure 1 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 2 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 3 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 4 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Viaarxiv icon

VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception

Add code
Jan 06, 2025
Figure 1 for VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
Figure 2 for VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
Figure 3 for VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
Figure 4 for VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
Viaarxiv icon

FaceLift: Single Image to 3D Head with View Generation and GS-LRM

Add code
Dec 23, 2024
Viaarxiv icon

Move-in-2D: 2D-Conditioned Human Motion Generation

Add code
Dec 17, 2024
Figure 1 for Move-in-2D: 2D-Conditioned Human Motion Generation
Figure 2 for Move-in-2D: 2D-Conditioned Human Motion Generation
Figure 3 for Move-in-2D: 2D-Conditioned Human Motion Generation
Figure 4 for Move-in-2D: 2D-Conditioned Human Motion Generation
Viaarxiv icon

DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes

Add code
Dec 15, 2024
Viaarxiv icon

Adapter-Enhanced Semantic Prompting for Continual Learning

Add code
Dec 15, 2024
Viaarxiv icon

Ranking-aware adapter for text-driven image ordering with CLIP

Add code
Dec 09, 2024
Viaarxiv icon

Hierarchical Information Flow for Generalized Efficient Image Restoration

Add code
Nov 27, 2024
Figure 1 for Hierarchical Information Flow for Generalized Efficient Image Restoration
Figure 2 for Hierarchical Information Flow for Generalized Efficient Image Restoration
Figure 3 for Hierarchical Information Flow for Generalized Efficient Image Restoration
Figure 4 for Hierarchical Information Flow for Generalized Efficient Image Restoration
Viaarxiv icon

HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior

Add code
Nov 27, 2024
Viaarxiv icon