Picture for Xiao Ma

Xiao Ma

ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search

Add code
Jan 30, 2026
Viaarxiv icon

RePose: A Real-Time 3D Human Pose Estimation and Biomechanical Analysis Framework for Rehabilitation

Add code
Jan 02, 2026
Viaarxiv icon

GR-Dexter Technical Report

Add code
Dec 30, 2025
Viaarxiv icon

Diffusion priors enhanced velocity model building from time-lag images using a neural operator

Add code
Dec 29, 2025
Viaarxiv icon

WMPO: World Model-based Policy Optimization for Vision-Language-Action Models

Add code
Nov 12, 2025
Figure 1 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Figure 2 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Figure 3 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Figure 4 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Viaarxiv icon

An effective physics-informed neural operator framework for predicting wavefields

Add code
Jul 22, 2025
Viaarxiv icon

Flow-Based Policy for Online Reinforcement Learning

Add code
Jun 15, 2025
Figure 1 for Flow-Based Policy for Online Reinforcement Learning
Figure 2 for Flow-Based Policy for Online Reinforcement Learning
Figure 3 for Flow-Based Policy for Online Reinforcement Learning
Figure 4 for Flow-Based Policy for Online Reinforcement Learning
Viaarxiv icon

Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal

Add code
Jun 13, 2025
Figure 1 for Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal
Figure 2 for Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal
Figure 3 for Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal
Figure 4 for Simple Radiology VLLM Test-time Scaling with Thought Graph Traversal
Viaarxiv icon

Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation

Add code
Jun 11, 2025
Viaarxiv icon

BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models

Add code
Jun 09, 2025
Viaarxiv icon