Picture for Hao Lu

Hao Lu

SuperMap Software Co., Ltd

Occ-LLM: Enhancing Autonomous Driving with Occupancy-Based Large Language Models

Add code
Feb 10, 2025
Viaarxiv icon

Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors

Add code
Jan 27, 2025
Viaarxiv icon

SeMi: When Imbalanced Semi-Supervised Learning Meets Mining Hard Examples

Add code
Jan 10, 2025
Viaarxiv icon

OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning

Add code
Dec 31, 2024
Figure 1 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 2 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 3 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 4 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Viaarxiv icon

DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving

Add code
Dec 12, 2024
Figure 1 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 2 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 3 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 4 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Viaarxiv icon

Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning

Add code
Nov 30, 2024
Figure 1 for Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning
Figure 2 for Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning
Figure 3 for Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning
Figure 4 for Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning
Viaarxiv icon

Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement

Add code
Sep 25, 2024
Viaarxiv icon

Training Matting Models without Alpha Labels

Add code
Aug 20, 2024
Figure 1 for Training Matting Models without Alpha Labels
Figure 2 for Training Matting Models without Alpha Labels
Figure 3 for Training Matting Models without Alpha Labels
Figure 4 for Training Matting Models without Alpha Labels
Viaarxiv icon

SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Add code
Jul 18, 2024
Viaarxiv icon

FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures

Add code
Jul 18, 2024
Viaarxiv icon