Picture for Yu-Chiang Frank Wang

Yu-Chiang Frank Wang

VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models

Add code
Mar 27, 2025
Viaarxiv icon

Segment Anything, Even Occluded

Add code
Mar 08, 2025
Viaarxiv icon

Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation

Add code
Feb 28, 2025
Viaarxiv icon

Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

Add code
Feb 23, 2025
Viaarxiv icon

MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching

Add code
Feb 18, 2025
Viaarxiv icon

3D Gaussian Inpainting with Depth-Guided Cross-View Consistency

Add code
Feb 17, 2025
Viaarxiv icon

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation

Add code
Feb 04, 2025
Viaarxiv icon

Towards Affordance-Aware Articulation Synthesis for Rigged Objects

Add code
Jan 21, 2025
Viaarxiv icon

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Add code
Jan 14, 2025
Viaarxiv icon

Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits

Add code
Jan 07, 2025
Figure 1 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 2 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 3 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Figure 4 for Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Viaarxiv icon