Picture for Shihao Wang

Shihao Wang

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Add code
Aug 28, 2024
Figure 1 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 2 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 3 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 4 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Viaarxiv icon

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

Add code
Jun 11, 2024
Viaarxiv icon

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

Add code
May 02, 2024
Viaarxiv icon

Far3D: Expanding the Horizon for Surround-view 3D Object Detection

Add code
Aug 18, 2023
Figure 1 for Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Figure 2 for Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Figure 3 for Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Figure 4 for Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Viaarxiv icon

OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation

Add code
Aug 17, 2023
Figure 1 for OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation
Figure 2 for OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation
Figure 3 for OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation
Figure 4 for OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation
Viaarxiv icon

Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Add code
Mar 21, 2023
Viaarxiv icon

Focal-PETR: Embracing Foreground for Efficient Multi-Camera 3D Object Detection

Add code
Dec 13, 2022
Viaarxiv icon

Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans

Add code
Feb 12, 2021
Figure 1 for Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans
Figure 2 for Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans
Figure 3 for Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans
Figure 4 for Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans
Viaarxiv icon

Efficient Multi-objective Evolutionary 3D Neural Architecture Search for COVID-19 Detection with Chest CT Scans

Add code
Jan 26, 2021
Figure 1 for Efficient Multi-objective Evolutionary 3D Neural Architecture Search for COVID-19 Detection with Chest CT Scans
Figure 2 for Efficient Multi-objective Evolutionary 3D Neural Architecture Search for COVID-19 Detection with Chest CT Scans
Figure 3 for Efficient Multi-objective Evolutionary 3D Neural Architecture Search for COVID-19 Detection with Chest CT Scans
Figure 4 for Efficient Multi-objective Evolutionary 3D Neural Architecture Search for COVID-19 Detection with Chest CT Scans
Viaarxiv icon

PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning

Add code
Jan 22, 2020
Figure 1 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Figure 2 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Figure 3 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Figure 4 for PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Viaarxiv icon