Picture for Zhidong Deng

Zhidong Deng

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

Add code
Oct 10, 2024
Figure 1 for PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Figure 2 for PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Figure 3 for PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Figure 4 for PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Viaarxiv icon

StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads

Add code
Sep 14, 2024
Figure 1 for StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Figure 2 for StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Figure 3 for StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Figure 4 for StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Viaarxiv icon

LLaVA-SG: Leveraging Scene Graphs as Visual Semantic Expression in Vision-Language Models

Add code
Aug 30, 2024
Viaarxiv icon

Video-CCAM: Enhancing Video-Language Understanding with Causal Cross-Attention Masks for Short and Long Videos

Add code
Aug 26, 2024
Viaarxiv icon

Unifying 3D Vision-Language Understanding via Promptable Queries

Add code
May 19, 2024
Figure 1 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 2 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 3 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 4 for Unifying 3D Vision-Language Understanding via Promptable Queries
Viaarxiv icon

Improving Detection in Aerial Images by Capturing Inter-Object Relationships

Add code
Apr 05, 2024
Figure 1 for Improving Detection in Aerial Images by Capturing Inter-Object Relationships
Figure 2 for Improving Detection in Aerial Images by Capturing Inter-Object Relationships
Figure 3 for Improving Detection in Aerial Images by Capturing Inter-Object Relationships
Figure 4 for Improving Detection in Aerial Images by Capturing Inter-Object Relationships
Viaarxiv icon

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Add code
Dec 15, 2023
Viaarxiv icon

Feedback RoI Features Improve Aerial Object Detection

Add code
Nov 28, 2023
Viaarxiv icon

3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment

Add code
Aug 08, 2023
Viaarxiv icon

Improving Scene Graph Generation with Superpixel-Based Interaction Learning

Add code
Aug 04, 2023
Figure 1 for Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Figure 2 for Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Figure 3 for Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Figure 4 for Improving Scene Graph Generation with Superpixel-Based Interaction Learning
Viaarxiv icon