Picture for Wentong Li

Wentong Li

VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration

Add code
Jan 30, 2026
Viaarxiv icon

Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Add code
Dec 30, 2025
Viaarxiv icon

Tailored Teaching with Balanced Difficulty: Elevating Reasoning in Multimodal Chain-of-Thought via Prompt Curriculum

Add code
Aug 26, 2025
Viaarxiv icon

EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?

Add code
Jun 05, 2025
Figure 1 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Figure 2 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Figure 3 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Figure 4 for EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Viaarxiv icon

PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning

Add code
Apr 22, 2025
Figure 1 for PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning
Figure 2 for PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning
Figure 3 for PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning
Figure 4 for PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning
Viaarxiv icon

OrderChain: A General Prompting Paradigm to Improve Ordinal Understanding Ability of MLLM

Add code
Apr 07, 2025
Viaarxiv icon

Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction

Add code
Mar 29, 2025
Viaarxiv icon

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Add code
Jan 08, 2025
Viaarxiv icon

Scalable Autoregressive Monocular Depth Estimation

Add code
Nov 18, 2024
Figure 1 for Scalable Autoregressive Monocular Depth Estimation
Figure 2 for Scalable Autoregressive Monocular Depth Estimation
Figure 3 for Scalable Autoregressive Monocular Depth Estimation
Figure 4 for Scalable Autoregressive Monocular Depth Estimation
Viaarxiv icon

ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning

Add code
Sep 26, 2024
Figure 1 for ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning
Figure 2 for ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning
Figure 3 for ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning
Figure 4 for ReliOcc: Towards Reliable Semantic Occupancy Prediction via Uncertainty Learning
Viaarxiv icon