Picture for Jie Li

Jie Li

University of Science and Technology of China, AnyWit Robotics Co., Ltd

Audio-Cogito: Towards Deep Audio Reasoning in Large Audio Language Models

Add code
Apr 14, 2026
Viaarxiv icon

HECTOR: Human-centric Hierarchical Coordination and Supervision of Robotic Fleets under Continual Temporal Tasks

Add code
Apr 13, 2026
Viaarxiv icon

Hitem3D 2.0: Multi-View Guided Native 3D Texture Generation

Add code
Apr 10, 2026
Viaarxiv icon

ViVa: A Video-Generative Value Model for Robot Reinforcement Learning

Add code
Apr 09, 2026
Viaarxiv icon

MECO: A Multimodal Dataset for Emotion and Cognitive Understanding in Older Adults

Add code
Apr 03, 2026
Viaarxiv icon

GigaWorld-Policy: An Efficient Action-Centered World--Action Model

Add code
Mar 18, 2026
Viaarxiv icon

KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition

Add code
Mar 16, 2026
Viaarxiv icon

FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection

Add code
Mar 15, 2026
Viaarxiv icon

GIAT: A Geologically-Informed Attention Transformer for Lithology Identification

Add code
Mar 10, 2026
Viaarxiv icon

VisNec: Measuring and Leveraging Visual Necessity for Multimodal Instruction Tuning

Add code
Mar 01, 2026
Viaarxiv icon