Picture for Dingkang Liang

Dingkang Liang

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

Add code
Jan 24, 2025
Figure 1 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 2 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 3 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 4 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Viaarxiv icon

MINIMA: Modality Invariant Image Matching

Add code
Dec 27, 2024
Viaarxiv icon

Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning

Add code
Oct 10, 2024
Viaarxiv icon

Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression

Add code
Sep 01, 2024
Figure 1 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 2 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 3 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 4 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Viaarxiv icon

Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models

Add code
Aug 09, 2024
Viaarxiv icon

Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping

Add code
Aug 04, 2024
Viaarxiv icon

A Unified Framework for 3D Scene Understanding

Add code
Jul 03, 2024
Viaarxiv icon

SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection

Add code
Jul 01, 2024
Viaarxiv icon

MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks

Add code
Jun 07, 2024
Figure 1 for MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
Figure 2 for MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
Figure 3 for MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
Figure 4 for MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
Viaarxiv icon

Anomaly Detection by Adapting a pre-trained Vision Language Model

Add code
Mar 14, 2024
Viaarxiv icon