Picture for Xin Jin

Xin Jin

Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning

Add code
Mar 18, 2025
Viaarxiv icon

UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection

Add code
Mar 15, 2025
Viaarxiv icon

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

Add code
Mar 11, 2025
Viaarxiv icon

Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices

Add code
Mar 08, 2025
Viaarxiv icon

ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning

Add code
Mar 08, 2025
Viaarxiv icon

Unified Arbitrary-Time Video Frame Interpolation and Prediction

Add code
Mar 04, 2025
Viaarxiv icon

Exploring Simple Siamese Network for High-Resolution Video Quality Assessment

Add code
Mar 04, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Viaarxiv icon

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Add code
Feb 18, 2025
Viaarxiv icon

TINQ: Temporal Inconsistency Guided Blind Video Quality Assessment

Add code
Dec 25, 2024
Viaarxiv icon