Picture for Jingdong Chen

Jingdong Chen

Advances in Microphone Array Processing and Multichannel Speech Enhancement

Add code
Feb 13, 2025
Viaarxiv icon

PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection

Add code
Jan 23, 2025
Figure 1 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 2 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 3 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 4 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Viaarxiv icon

Cross-View Image Set Geo-Localization

Add code
Dec 25, 2024
Viaarxiv icon

GraphicsDreamer: Image to 3D Generation with Physical Consistency

Add code
Dec 18, 2024
Figure 1 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Figure 2 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Figure 3 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Figure 4 for GraphicsDreamer: Image to 3D Generation with Physical Consistency
Viaarxiv icon

Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings

Add code
Dec 16, 2024
Viaarxiv icon

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

Add code
Dec 08, 2024
Viaarxiv icon

Mimir: Improving Video Diffusion Models for Precise Text Understanding

Add code
Dec 04, 2024
Figure 1 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 2 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 3 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 4 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Viaarxiv icon

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis

Add code
Nov 29, 2024
Viaarxiv icon

LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis

Add code
Nov 29, 2024
Figure 1 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 2 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 3 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Figure 4 for LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
Viaarxiv icon

Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts

Add code
Nov 22, 2024
Figure 1 for Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
Figure 2 for Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
Figure 3 for Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
Figure 4 for Panther: Illuminate the Sight of Multimodal LLMs with Instruction-Guided Visual Prompts
Viaarxiv icon