Picture for Guangming Shi

Guangming Shi

Are VLMs Lost Between Sky and Space? LinkS$^2$Bench for UAV-Satellite Dynamic Cross-View Spatial Intelligence

Add code
Apr 02, 2026
Viaarxiv icon

A3R: Agentic Affordance Reasoning via Cross-Dimensional Evidence in 3D Gaussian Scenes

Add code
Apr 02, 2026
Viaarxiv icon

Scaling Dense Event-Stream Pretraining from Visual Foundation Models

Add code
Mar 04, 2026
Viaarxiv icon

On the Rate-Distortion-Complexity Tradeoff for Semantic Communication

Add code
Feb 16, 2026
Viaarxiv icon

SANet: A Semantic-aware Agentic AI Networking Framework for Cross-layer Optimization in 6G

Add code
Dec 27, 2025
Viaarxiv icon

SITP: A High-Reliability Semantic Information Transport Protocol Without Retransmission for Semantic Communication

Add code
Dec 10, 2025
Viaarxiv icon

NOC4SC: A Bandwidth-Efficient Multi-User Semantic Communication Framework for Interference-Resilient Transmission

Add code
Dec 10, 2025
Figure 1 for NOC4SC: A Bandwidth-Efficient Multi-User Semantic Communication Framework for Interference-Resilient Transmission
Figure 2 for NOC4SC: A Bandwidth-Efficient Multi-User Semantic Communication Framework for Interference-Resilient Transmission
Figure 3 for NOC4SC: A Bandwidth-Efficient Multi-User Semantic Communication Framework for Interference-Resilient Transmission
Figure 4 for NOC4SC: A Bandwidth-Efficient Multi-User Semantic Communication Framework for Interference-Resilient Transmission
Viaarxiv icon

Parse Graph-Based Visual-Language Interaction for Human Pose Estimation

Add code
Sep 09, 2025
Figure 1 for Parse Graph-Based Visual-Language Interaction for Human Pose Estimation
Figure 2 for Parse Graph-Based Visual-Language Interaction for Human Pose Estimation
Figure 3 for Parse Graph-Based Visual-Language Interaction for Human Pose Estimation
Figure 4 for Parse Graph-Based Visual-Language Interaction for Human Pose Estimation
Viaarxiv icon

Optimizing Multi-Modal Trackers via Sensitivity-aware Regularized Tuning

Add code
Aug 24, 2025
Figure 1 for Optimizing Multi-Modal Trackers via Sensitivity-aware Regularized Tuning
Figure 2 for Optimizing Multi-Modal Trackers via Sensitivity-aware Regularized Tuning
Figure 3 for Optimizing Multi-Modal Trackers via Sensitivity-aware Regularized Tuning
Figure 4 for Optimizing Multi-Modal Trackers via Sensitivity-aware Regularized Tuning
Viaarxiv icon

SeqAffordSplat: Scene-level Sequential Affordance Reasoning on 3D Gaussian Splatting

Add code
Jul 31, 2025
Viaarxiv icon