Picture for Song Guo

Song Guo

TTVS: Boosting Self-Exploring Reinforcement Learning via Test-time Variational Synthesis

Add code
Apr 09, 2026
Viaarxiv icon

Generative 3D Gaussian Splatting for Arbitrary-ResolutionAtmospheric Downscaling and Forecasting

Add code
Apr 09, 2026
Viaarxiv icon

VisionCreator-R1: A Reflection-Enhanced Native Visual-Generation Agentic Model

Add code
Mar 09, 2026
Viaarxiv icon

VisionCreator: A Native Visual-Generation Agentic Model with Understanding, Thinking, Planning and Creation

Add code
Mar 03, 2026
Viaarxiv icon

DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern

Add code
Mar 02, 2026
Viaarxiv icon

UNICBench: UNIfied Counting Benchmark for MLLM

Add code
Feb 28, 2026
Viaarxiv icon

Mesh-Pro: Asynchronous Advantage-guided Ranking Preference Optimization for Artist-style Quadrilateral Mesh Generation

Add code
Feb 28, 2026
Viaarxiv icon

HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning

Add code
Feb 24, 2026
Viaarxiv icon

EMFormer: Efficient Multi-Scale Transformer for Accumulative Context Weather Forecasting

Add code
Feb 01, 2026
Viaarxiv icon

WMPO: World Model-based Policy Optimization for Vision-Language-Action Models

Add code
Nov 12, 2025
Figure 1 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Figure 2 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Figure 3 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Figure 4 for WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Viaarxiv icon