Picture for Jianbing Shen

Jianbing Shen

ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction

Add code
Nov 12, 2024
Viaarxiv icon

Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution

Add code
Nov 05, 2024
Viaarxiv icon

Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models

Add code
Oct 25, 2024
Figure 1 for Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models
Figure 2 for Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models
Figure 3 for Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models
Figure 4 for Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models
Viaarxiv icon

High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior

Add code
Aug 01, 2024
Viaarxiv icon

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

Add code
Jul 15, 2024
Viaarxiv icon

AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D Occupancy and Flow Prediction

Add code
Jul 01, 2024
Figure 1 for AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D Occupancy and Flow Prediction
Figure 2 for AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D Occupancy and Flow Prediction
Figure 3 for AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D Occupancy and Flow Prediction
Figure 4 for AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D Occupancy and Flow Prediction
Viaarxiv icon

Multi-threshold Deep Metric Learning for Facial Expression Recognition

Add code
Jun 24, 2024
Figure 1 for Multi-threshold Deep Metric Learning for Facial Expression Recognition
Figure 2 for Multi-threshold Deep Metric Learning for Facial Expression Recognition
Figure 3 for Multi-threshold Deep Metric Learning for Facial Expression Recognition
Figure 4 for Multi-threshold Deep Metric Learning for Facial Expression Recognition
Viaarxiv icon

Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?

Add code
May 28, 2024
Viaarxiv icon

IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection

Add code
Mar 22, 2024
Viaarxiv icon

Visual In-Context Learning for Large Vision-Language Models

Add code
Feb 18, 2024
Viaarxiv icon