Picture for Kai Chen

Kai Chen

Tony

Towards Intelligible Human-Robot Interaction: An Active Inference Approach to Occluded Pedestrian Scenarios

Add code
Feb 26, 2026
Viaarxiv icon

Real-time Monocular 2D and 3D Perception of Endoluminal Scenes for Controlling Flexible Robotic Endoscopic Instruments

Add code
Feb 16, 2026
Viaarxiv icon

ScalSelect: Scalable Training-Free Multimodal Data Selection for Efficient Visual Instruction Tuning

Add code
Feb 12, 2026
Viaarxiv icon

DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

Add code
Feb 11, 2026
Viaarxiv icon

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Add code
Feb 10, 2026
Viaarxiv icon

Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models

Add code
Feb 09, 2026
Viaarxiv icon

A$^2$-LLM: An End-to-end Conversational Audio Avatar Large Language Model

Add code
Feb 04, 2026
Viaarxiv icon

ALORE: Autonomous Large-Object Rearrangement with a Legged Manipulator

Add code
Feb 04, 2026
Viaarxiv icon

Fast and Safe Trajectory Optimization for Mobile Manipulators With Neural Configuration Space Distance Field

Add code
Jan 27, 2026
Viaarxiv icon

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Add code
Jan 27, 2026
Viaarxiv icon