Picture for Ming Yang

Ming Yang

STDHL: Spatio-Temporal Dynamic Hypergraph Learning for Wind Power Forecasting

Add code
Dec 16, 2024
Viaarxiv icon

Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings

Add code
Dec 16, 2024
Viaarxiv icon

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

Add code
Dec 08, 2024
Viaarxiv icon

Mimir: Improving Video Diffusion Models for Precise Text Understanding

Add code
Dec 04, 2024
Figure 1 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 2 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 3 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 4 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Viaarxiv icon

Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures

Add code
Dec 02, 2024
Figure 1 for Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures
Figure 2 for Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures
Figure 3 for Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures
Figure 4 for Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures
Viaarxiv icon

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Add code
Nov 29, 2024
Viaarxiv icon

LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis

Add code
Nov 29, 2024
Figure 1 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 2 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 3 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 4 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Viaarxiv icon

Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts

Add code
Nov 22, 2024
Viaarxiv icon

DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding

Add code
Nov 19, 2024
Viaarxiv icon

Try-On-Adapter: A Simple and Flexible Try-On Paradigm

Add code
Nov 15, 2024
Figure 1 for Try-On-Adapter: A Simple and Flexible Try-On Paradigm
Figure 2 for Try-On-Adapter: A Simple and Flexible Try-On Paradigm
Figure 3 for Try-On-Adapter: A Simple and Flexible Try-On Paradigm
Figure 4 for Try-On-Adapter: A Simple and Flexible Try-On Paradigm
Viaarxiv icon