Picture for Yuhao Li

Yuhao Li

DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding

Add code
Mar 13, 2025
Viaarxiv icon

C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation

Add code
Feb 27, 2025
Viaarxiv icon

MiniMax-01: Scaling Foundation Models with Lightning Attention

Add code
Jan 14, 2025
Viaarxiv icon

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Add code
Jan 10, 2025
Figure 1 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 2 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 3 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 4 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Viaarxiv icon

SaliencyI2PLoc: saliency-guided image-point cloud localization using contrastive learning

Add code
Dec 20, 2024
Figure 1 for SaliencyI2PLoc: saliency-guided image-point cloud localization using contrastive learning
Figure 2 for SaliencyI2PLoc: saliency-guided image-point cloud localization using contrastive learning
Figure 3 for SaliencyI2PLoc: saliency-guided image-point cloud localization using contrastive learning
Figure 4 for SaliencyI2PLoc: saliency-guided image-point cloud localization using contrastive learning
Viaarxiv icon

Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer

Add code
Dec 12, 2024
Figure 1 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Figure 2 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Figure 3 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Figure 4 for Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Viaarxiv icon

Spin glass model of in-context learning

Add code
Aug 05, 2024
Viaarxiv icon

Multi-Granularity Language-Guided Multi-Object Tracking

Add code
Jun 07, 2024
Figure 1 for Multi-Granularity Language-Guided Multi-Object Tracking
Figure 2 for Multi-Granularity Language-Guided Multi-Object Tracking
Figure 3 for Multi-Granularity Language-Guided Multi-Object Tracking
Figure 4 for Multi-Granularity Language-Guided Multi-Object Tracking
Viaarxiv icon

CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration

Add code
Sep 26, 2023
Figure 1 for CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration
Figure 2 for CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration
Figure 3 for CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration
Figure 4 for CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration
Viaarxiv icon

Learning to Manipulate a Commitment Optimizer

Add code
Feb 26, 2023
Viaarxiv icon