Picture for Ming Yang

Ming Yang

BMIP: Bi-directional Modality Interaction Prompt Learning for VLM

Add code
Jan 14, 2025
Viaarxiv icon

Referencing Where to Focus: Improving VisualGrounding with Referential Query

Add code
Dec 26, 2024
Viaarxiv icon

Cross-View Image Set Geo-Localization

Add code
Dec 25, 2024
Viaarxiv icon

GraphicsDreamer: Image to 3D Generation with Physical Consistency

Add code
Dec 18, 2024
Figure 1 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Figure 2 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Figure 3 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Figure 4 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Viaarxiv icon

Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings

Add code
Dec 16, 2024
Viaarxiv icon

STDHL: Spatio-Temporal Dynamic Hypergraph Learning for Wind Power Forecasting

Add code
Dec 16, 2024
Viaarxiv icon

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

Add code
Dec 08, 2024
Viaarxiv icon

Mimir: Improving Video Diffusion Models for Precise Text Understanding

Add code
Dec 04, 2024
Figure 1 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 2 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 3 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 4 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Viaarxiv icon

Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures

Add code
Dec 02, 2024
Figure 1 for Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures
Figure 2 for Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures
Figure 3 for Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures
Figure 4 for Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures
Viaarxiv icon

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Add code
Nov 29, 2024
Viaarxiv icon