Picture for Xuming He

Xuming He

GUI-Rise: Structured Reasoning and History Summarization for GUI Navigation

Add code
Oct 31, 2025
Viaarxiv icon

Incremental Human-Object Interaction Detection with Invariant Relation Representation Learning

Add code
Oct 30, 2025
Viaarxiv icon

Pack and Force Your Memory: Long-form and Consistent Video Generation

Add code
Oct 02, 2025
Viaarxiv icon

RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts

Add code
Aug 17, 2025
Figure 1 for RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts
Figure 2 for RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts
Figure 3 for RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts
Figure 4 for RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts
Viaarxiv icon

Non-Asymptotic Analysis of Online Local Private Learning with SGD

Add code
Jul 09, 2025
Figure 1 for Non-Asymptotic Analysis of Online Local Private Learning with SGD
Figure 2 for Non-Asymptotic Analysis of Online Local Private Learning with SGD
Figure 3 for Non-Asymptotic Analysis of Online Local Private Learning with SGD
Figure 4 for Non-Asymptotic Analysis of Online Local Private Learning with SGD
Viaarxiv icon

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Add code
Jun 12, 2025
Viaarxiv icon

OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data

Add code
May 29, 2025
Viaarxiv icon

Freeze and Cluster: A Simple Baseline for Rehearsal-Free Continual Category Discovery

Add code
Mar 12, 2025
Viaarxiv icon

Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation

Add code
Dec 26, 2024
Figure 1 for Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
Figure 2 for Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
Figure 3 for Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
Figure 4 for Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
Viaarxiv icon

FastGrasp: Efficient Grasp Synthesis with Diffusion

Add code
Nov 22, 2024
Figure 1 for FastGrasp: Efficient Grasp Synthesis with Diffusion
Figure 2 for FastGrasp: Efficient Grasp Synthesis with Diffusion
Figure 3 for FastGrasp: Efficient Grasp Synthesis with Diffusion
Figure 4 for FastGrasp: Efficient Grasp Synthesis with Diffusion
Viaarxiv icon