Picture for Hao Lu

Hao Lu

SuperMap Software Co., Ltd

CTIS-QA: Clinical Template-Informed Slide-level Question Answering for Pathology

Add code
Jan 05, 2026
Viaarxiv icon

Crowded Video Individual Counting Informed by Social Grouping and Spatial-Temporal Displacement Priors

Add code
Jan 03, 2026
Viaarxiv icon

FitControler: Toward Fit-Aware Virtual Try-On

Add code
Dec 30, 2025
Viaarxiv icon

UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving

Add code
Dec 10, 2025
Viaarxiv icon

Observability Analysis and Composite Disturbance Filtering for a Bar Tethered to Dual UAVs Subject to Multi-source Disturbances

Add code
Dec 10, 2025
Viaarxiv icon

FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data

Add code
Sep 08, 2025
Figure 1 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Figure 2 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Figure 3 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Figure 4 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Viaarxiv icon

ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding

Add code
Aug 29, 2025
Figure 1 for ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding
Figure 2 for ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding
Figure 3 for ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding
Figure 4 for ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding
Viaarxiv icon

First Place Solution to the MLCAS 2025 GWFSS Challenge: The Devil is in the Detail and Minority

Add code
Aug 24, 2025
Viaarxiv icon

Video Individual Counting With Implicit One-to-Many Matching

Add code
Jun 16, 2025
Figure 1 for Video Individual Counting With Implicit One-to-Many Matching
Figure 2 for Video Individual Counting With Implicit One-to-Many Matching
Figure 3 for Video Individual Counting With Implicit One-to-Many Matching
Figure 4 for Video Individual Counting With Implicit One-to-Many Matching
Viaarxiv icon

LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization

Add code
Jun 11, 2025
Figure 1 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 2 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 3 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 4 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Viaarxiv icon