Picture for Zhongxue Gan

Zhongxue Gan

VINGS-Mono: Visual-Inertial Gaussian Splatting Monocular SLAM in Large Scenes

Add code
Jan 14, 2025
Viaarxiv icon

UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery

Add code
Jan 03, 2025
Figure 1 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Figure 2 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Figure 3 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Figure 4 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Viaarxiv icon

Planning by Simulation: Motion Planning with Learning-based Parallel Scenario Prediction for Autonomous Driving

Add code
Nov 15, 2024
Viaarxiv icon

CTA-Net: A CNN-Transformer Aggregation Network for Improving Multi-Scale Feature Extraction

Add code
Oct 15, 2024
Figure 1 for CTA-Net: A CNN-Transformer Aggregation Network for Improving Multi-Scale Feature Extraction
Figure 2 for CTA-Net: A CNN-Transformer Aggregation Network for Improving Multi-Scale Feature Extraction
Figure 3 for CTA-Net: A CNN-Transformer Aggregation Network for Improving Multi-Scale Feature Extraction
Figure 4 for CTA-Net: A CNN-Transformer Aggregation Network for Improving Multi-Scale Feature Extraction
Viaarxiv icon

Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception

Add code
Sep 26, 2024
Figure 1 for Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception
Figure 2 for Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception
Figure 3 for Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception
Figure 4 for Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception
Viaarxiv icon

HGS-Planner: Hierarchical Planning Framework for Active Scene Reconstruction Using 3D Gaussian Splatting

Add code
Sep 26, 2024
Viaarxiv icon

OccLLaMA: An Occupancy-Language-Action Generative World Model for Autonomous Driving

Add code
Sep 05, 2024
Viaarxiv icon

A Survey on Facial Expression Recognition of Static and Dynamic Emotions

Add code
Aug 28, 2024
Viaarxiv icon

ReplanVLM: Replanning Robotic Tasks with Visual Language Models

Add code
Jul 31, 2024
Viaarxiv icon

InsightSee: Advancing Multi-agent Vision-Language Models for Enhanced Visual Understanding

Add code
May 31, 2024
Viaarxiv icon