Picture for Xinlei Chen

Xinlei Chen

Enabling High-Frequency Cross-Modality Visual Positioning Service for Accurate Drone Landing

Add code
Oct 01, 2025
Viaarxiv icon

COMPASS: Confined-space Manipulation Planning with Active Sensing Strategy

Add code
Sep 18, 2025
Viaarxiv icon

MetaCLIP 2: A Worldwide Scaling Recipe

Add code
Jul 29, 2025
Figure 1 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 2 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 3 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 4 for MetaCLIP 2: A Worldwide Scaling Recipe
Viaarxiv icon

Context-Aware Sentiment Forecasting via LLM-based Multi-Perspective Role-Playing Agents

Add code
May 30, 2025
Viaarxiv icon

KEVER^2: Knowledge-Enhanced Visual Emotion Reasoning and Retrieval

Add code
May 30, 2025
Viaarxiv icon

Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization

Add code
May 28, 2025
Viaarxiv icon

What Can RL Bring to VLA Generalization? An Empirical Study

Add code
May 26, 2025
Viaarxiv icon

DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking

Add code
May 18, 2025
Viaarxiv icon

CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory

Add code
May 08, 2025
Viaarxiv icon

EDmamba: A Simple yet Effective Event Denoising Method with State Space Model

Add code
May 08, 2025
Viaarxiv icon