Picture for Zehui Chen

Zehui Chen

LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation

Add code
Nov 19, 2024
Viaarxiv icon

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Add code
Jul 29, 2024
Viaarxiv icon

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Add code
Jun 06, 2024
Figure 1 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 2 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 3 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 4 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Viaarxiv icon

Are We on the Right Way for Evaluating Large Vision-Language Models?

Add code
Apr 09, 2024
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition

Add code
Mar 26, 2024
Figure 1 for PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Figure 2 for PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Figure 3 for PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Figure 4 for PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Viaarxiv icon

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection

Add code
Mar 25, 2024
Figure 1 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Figure 2 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Figure 3 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Figure 4 for Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
Viaarxiv icon

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Add code
Mar 19, 2024
Viaarxiv icon

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge -- Multi-Task Robustness Track

Add code
Feb 27, 2024
Viaarxiv icon

Stream Query Denoising for Vectorized HD Map Construction

Add code
Jan 18, 2024
Viaarxiv icon