Picture for Ming-Hsuan Yang

Ming-Hsuan Yang

Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model

Add code
Apr 08, 2025
Viaarxiv icon

Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models

Add code
Apr 02, 2025
Viaarxiv icon

4th PVUW MeViS 3rd Place Report: Sa2VA

Add code
Apr 01, 2025
Viaarxiv icon

Consistent Subject Generation via Contrastive Instantiated Concepts

Add code
Mar 31, 2025
Viaarxiv icon

MIRAGE: Multimodal Immersive Reasoning and Guided Exploration for Red-Team Jailbreak Attacks

Add code
Mar 24, 2025
Viaarxiv icon

Unified Dense Prediction of Video Diffusion

Add code
Mar 12, 2025
Viaarxiv icon

Controllable 3D Outdoor Scene Generation via Scene Graphs

Add code
Mar 10, 2025
Viaarxiv icon

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

Add code
Feb 28, 2025
Viaarxiv icon

Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training

Add code
Feb 25, 2025
Viaarxiv icon

Optimizing Singular Spectrum for Large Language Model Compression

Add code
Feb 20, 2025
Viaarxiv icon