Picture for Shihao Wang

Shihao Wang

InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction

Add code
Mar 26, 2025
Viaarxiv icon

Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training

Add code
Mar 15, 2025
Viaarxiv icon

L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression

Add code
Dec 24, 2024
Viaarxiv icon

Transformer-based toxin-protein interaction analysis prioritizes airborne particulate matter components with potential adverse health effects

Add code
Dec 21, 2024
Viaarxiv icon

StreamChat: Chatting with Streaming Video

Add code
Dec 11, 2024
Viaarxiv icon

Beyond Feature Mapping GAP: Integrating Real HDRTV Priors for Superior SDRTV-to-HDRTV Conversion

Add code
Nov 16, 2024
Viaarxiv icon

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Add code
Aug 28, 2024
Figure 1 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 2 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 3 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 4 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Viaarxiv icon

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

Add code
Jun 11, 2024
Figure 1 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 2 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 3 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 4 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Viaarxiv icon

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

Add code
May 02, 2024
Figure 1 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Figure 2 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Figure 3 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Figure 4 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Viaarxiv icon

Far3D: Expanding the Horizon for Surround-view 3D Object Detection

Add code
Aug 18, 2023
Figure 1 for Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Figure 2 for Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Figure 3 for Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Figure 4 for Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Viaarxiv icon