Picture for Jingdong Chen

Jingdong Chen

Cross-View Image Set Geo-Localization

Add code
Dec 25, 2024
Viaarxiv icon

GraphicsDreamer: Image to 3D Generation with Physical Consistency

Add code
Dec 18, 2024
Figure 1 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Figure 2 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Figure 3 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Figure 4 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Viaarxiv icon

Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings

Add code
Dec 16, 2024
Viaarxiv icon

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

Add code
Dec 08, 2024
Viaarxiv icon

Mimir: Improving Video Diffusion Models for Precise Text Understanding

Add code
Dec 04, 2024
Figure 1 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 2 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 3 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 4 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Viaarxiv icon

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Add code
Nov 29, 2024
Viaarxiv icon

LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis

Add code
Nov 29, 2024
Figure 1 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 2 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 3 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 4 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Viaarxiv icon

Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts

Add code
Nov 22, 2024
Figure 1 for Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
Figure 2 for Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
Figure 3 for Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
Figure 4 for Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
Viaarxiv icon

Try-On-Adapter: A Simple and Flexible Try-On Paradigm

Add code
Nov 15, 2024
Figure 1 for Try-On-Adapter: A Simple and Flexible Try-On Paradigm
Figure 2 for Try-On-Adapter: A Simple and Flexible Try-On Paradigm
Figure 3 for Try-On-Adapter: A Simple and Flexible Try-On Paradigm
Figure 4 for Try-On-Adapter: A Simple and Flexible Try-On Paradigm
Viaarxiv icon

HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation

Add code
Nov 11, 2024
Viaarxiv icon