Picture for Xi Li

Xi Li

Mark

SPIRIT: Adapting Vision Foundation Models for Unified Single- and Multi-Frame Infrared Small Target Detection

Add code
Feb 02, 2026
Viaarxiv icon

REL-SF4PASS: Panoramic Semantic Segmentation with REL Depth Representation and Spherical Fusion

Add code
Jan 23, 2026
Viaarxiv icon

MapViT: A Two-Stage ViT-Based Framework for Real-Time Radio Quality Map Prediction in Dynamic Environments

Add code
Jan 22, 2026
Viaarxiv icon

Air-Chamber Based Soft Six-Axis Force/Torque Sensor for Human-Robot Interaction

Add code
Nov 17, 2025
Figure 1 for Air-Chamber Based Soft Six-Axis Force/Torque Sensor for Human-Robot Interaction
Figure 2 for Air-Chamber Based Soft Six-Axis Force/Torque Sensor for Human-Robot Interaction
Figure 3 for Air-Chamber Based Soft Six-Axis Force/Torque Sensor for Human-Robot Interaction
Figure 4 for Air-Chamber Based Soft Six-Axis Force/Torque Sensor for Human-Robot Interaction
Viaarxiv icon

Innovative Design of Multi-functional Supernumerary Robotic Limbs with Ellipsoid Workspace Optimization

Add code
Nov 15, 2025
Figure 1 for Innovative Design of Multi-functional Supernumerary Robotic Limbs with Ellipsoid Workspace Optimization
Figure 2 for Innovative Design of Multi-functional Supernumerary Robotic Limbs with Ellipsoid Workspace Optimization
Figure 3 for Innovative Design of Multi-functional Supernumerary Robotic Limbs with Ellipsoid Workspace Optimization
Figure 4 for Innovative Design of Multi-functional Supernumerary Robotic Limbs with Ellipsoid Workspace Optimization
Viaarxiv icon

DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains

Add code
Nov 14, 2025
Figure 1 for DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains
Figure 2 for DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains
Figure 3 for DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains
Figure 4 for DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains
Viaarxiv icon

End to End AI System for Surgical Gesture Sequence Recognition and Clinical Outcome Prediction

Add code
Nov 14, 2025
Viaarxiv icon

MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion

Add code
Aug 18, 2025
Figure 1 for IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion
Figure 2 for IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion
Figure 3 for IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion
Figure 4 for IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion
Viaarxiv icon

Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection

Add code
Jul 23, 2025
Figure 1 for Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection
Figure 2 for Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection
Figure 3 for Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection
Figure 4 for Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection
Viaarxiv icon