Picture for Xu Cao

Xu Cao

Medical Video Generation for Disease Progression Simulation

Add code
Nov 18, 2024
Viaarxiv icon

On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation

Add code
Nov 17, 2024
Figure 1 for On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation
Figure 2 for On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation
Figure 3 for On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation
Figure 4 for On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation
Viaarxiv icon

AnyECG: Foundational Models for Electrocardiogram Analysis

Add code
Nov 17, 2024
Viaarxiv icon

MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection

Add code
Nov 16, 2024
Viaarxiv icon

TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets

Add code
Jun 30, 2024
Figure 1 for TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets
Figure 2 for TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets
Figure 3 for TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets
Figure 4 for TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets
Viaarxiv icon

MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs

Add code
Jun 24, 2024
Viaarxiv icon

What is the Visual Cognition Gap between Humans and Multimodal LLMs?

Add code
Jun 14, 2024
Viaarxiv icon

The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

Add code
May 14, 2024
Figure 1 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Figure 2 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Figure 3 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Figure 4 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Viaarxiv icon

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

Add code
Apr 10, 2024
Figure 1 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 2 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 3 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 4 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Viaarxiv icon

Spurious Correlations in Machine Learning: A Survey

Add code
Feb 20, 2024
Viaarxiv icon