Picture for Xu Cao

Xu Cao

TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets

Add code
Jun 30, 2024
Viaarxiv icon

MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs

Add code
Jun 24, 2024
Viaarxiv icon

What is the Visual Cognition Gap between Humans and Multimodal LLMs?

Add code
Jun 14, 2024
Viaarxiv icon

The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

Add code
May 14, 2024
Figure 1 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Figure 2 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Figure 3 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Figure 4 for The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition
Viaarxiv icon

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

Add code
Apr 10, 2024
Figure 1 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 2 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 3 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Figure 4 for Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting
Viaarxiv icon

Spurious Correlations in Machine Learning: A Survey

Add code
Feb 20, 2024
Viaarxiv icon

Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance

Add code
Feb 08, 2024
Viaarxiv icon

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

Add code
Jan 08, 2024
Figure 1 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Figure 2 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Figure 3 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Figure 4 for If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Viaarxiv icon

SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration

Add code
Dec 08, 2023
Viaarxiv icon

LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs

Add code
Dec 07, 2023
Viaarxiv icon