Picture for Mingyang Sun

Mingyang Sun

Sophia: A Persistent Agent Framework of Artificial Life

Add code
Dec 20, 2025
Viaarxiv icon

VICTOR: Dataset Copyright Auditing in Video Recognition Systems

Add code
Dec 16, 2025
Viaarxiv icon

Executable Analytic Concepts as the Missing Link Between VLM Insight and Precise Manipulation

Add code
Oct 09, 2025
Viaarxiv icon

On-Device Training of PV Power Forecasting Models in a Smart Meter for Grid Edge Intelligence

Add code
Jul 09, 2025
Viaarxiv icon

PIG: Physically-based Multi-Material Interaction with 3D Gaussians

Add code
Jun 09, 2025
Viaarxiv icon

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Add code
May 18, 2025
Viaarxiv icon

2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting

Add code
Mar 04, 2025
Figure 1 for 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting
Figure 2 for 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting
Figure 3 for 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting
Figure 4 for 2DGS-Avatar: Animatable High-fidelity Clothed Avatar via 2D Gaussian Splatting
Viaarxiv icon

SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning

Add code
Mar 03, 2025
Viaarxiv icon

Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport

Add code
Feb 18, 2025
Viaarxiv icon

QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning

Add code
Dec 23, 2024
Figure 1 for QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning
Figure 2 for QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning
Figure 3 for QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning
Figure 4 for QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning
Viaarxiv icon