Picture for Zhidong Deng

Zhidong Deng

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

Add code
Oct 10, 2024
Figure 1 for PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Figure 2 for PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Figure 3 for PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Figure 4 for PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Viaarxiv icon

StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads

Add code
Sep 14, 2024
Figure 1 for StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Figure 2 for StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Figure 3 for StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Figure 4 for StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Viaarxiv icon

LLaVA-SG: Leveraging Scene Graphs as Visual Semantic Expression in Vision-Language Models

Add code
Aug 30, 2024
Viaarxiv icon

Video-CCAM: Enhancing Video-Language Understanding with Causal Cross-Attention Masks for Short and Long Videos

Add code
Aug 26, 2024
Viaarxiv icon

Unifying 3D Vision-Language Understanding via Promptable Queries

Add code
May 19, 2024
Figure 1 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 2 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 3 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 4 for Unifying 3D Vision-Language Understanding via Promptable Queries
Viaarxiv icon

Improving Detection in Aerial Images by Capturing Inter-Object Relationships

Add code
Apr 05, 2024
Figure 1 for Improving Detection in Aerial Images by Capturing Inter-Object Relationships
Figure 2 for Improving Detection in Aerial Images by Capturing Inter-Object Relationships
Figure 3 for Improving Detection in Aerial Images by Capturing Inter-Object Relationships
Figure 4 for Improving Detection in Aerial Images by Capturing Inter-Object Relationships
Viaarxiv icon

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Add code
Dec 15, 2023
Viaarxiv icon

Feedback RoI Features Improve Aerial Object Detection

Add code
Nov 28, 2023
Figure 1 for Feedback RoI Features Improve Aerial Object Detection
Figure 2 for Feedback RoI Features Improve Aerial Object Detection
Figure 3 for Feedback RoI Features Improve Aerial Object Detection
Figure 4 for Feedback RoI Features Improve Aerial Object Detection
Viaarxiv icon

3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment

Add code
Aug 08, 2023
Viaarxiv icon

Improving Scene Graph Generation with Superpixel-Based Interaction Learning

Add code
Aug 04, 2023
Figure 1 for Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Figure 2 for Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Figure 3 for Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Figure 4 for Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Viaarxiv icon