Picture for Yifan Xu

Yifan Xu

CoNav Chair: Design of a ROS-based Smart Wheelchair for Shared Control Navigation in the Built Environment

Add code
Jan 16, 2025
Figure 1 for CoNav Chair: Design of a ROS-based Smart Wheelchair for Shared Control Navigation in the Built Environment
Figure 2 for CoNav Chair: Design of a ROS-based Smart Wheelchair for Shared Control Navigation in the Built Environment
Figure 3 for CoNav Chair: Design of a ROS-based Smart Wheelchair for Shared Control Navigation in the Built Environment
Figure 4 for CoNav Chair: Design of a ROS-based Smart Wheelchair for Shared Control Navigation in the Built Environment
Viaarxiv icon

Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments

Add code
Jan 09, 2025
Viaarxiv icon

DynamicLip: Shape-Independent Continuous Authentication via Lip Articulator Dynamics

Add code
Jan 02, 2025
Viaarxiv icon

Fine-grained Video-Text Retrieval: A New Benchmark and Method

Add code
Dec 31, 2024
Viaarxiv icon

Cross-Task Inconsistency Based Active Learning (CTIAL) for Emotion Recognition

Add code
Dec 02, 2024
Viaarxiv icon

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Add code
Oct 31, 2024
Figure 1 for AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Figure 2 for AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Figure 3 for AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Figure 4 for AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Viaarxiv icon

AutoGLM: Autonomous Foundation Agents for GUIs

Add code
Oct 28, 2024
Viaarxiv icon

Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation

Add code
Sep 16, 2024
Figure 1 for Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation
Figure 2 for Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation
Figure 3 for Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation
Figure 4 for Point2Graph: An End-to-end Point Cloud-based 3D Open-Vocabulary Scene Graph for Robot Navigation
Viaarxiv icon

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Add code
Aug 12, 2024
Figure 1 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 2 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 3 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 4 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Viaarxiv icon

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Add code
Jun 18, 2024
Figure 1 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 2 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 3 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Figure 4 for ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
Viaarxiv icon