Picture for Minghui Wang

Minghui Wang

Hy-Embodied-0.5-VLA: From Vision-Language-Action Models to a Real-World Robot Learning Stack

Add code
Jun 12, 2026
Viaarxiv icon

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Add code
Apr 08, 2026
Viaarxiv icon

Multi-Scale Deformable Transformers for Student Learning Behavior Detection in Smart Classroom

Add code
Oct 10, 2024
Viaarxiv icon

SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation

Add code
Apr 29, 2022
Figure 1 for SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
Figure 2 for SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
Figure 3 for SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
Figure 4 for SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
Viaarxiv icon