Picture for Zhen Yang

Zhen Yang

School of Communication and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 2100023, China

HiMem-WAM: Hierarchical Memory-Gated World Action Models for Robotic Manipulation

Add code
Jun 09, 2026
Viaarxiv icon

MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism

Add code
Jun 05, 2026
Viaarxiv icon

Streaming Communication in Multi-Agent Reasoning

Add code
Jun 03, 2026
Viaarxiv icon

Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

Add code
May 26, 2026
Viaarxiv icon

EnCAgg: Enhanced Clustering Aggregation for Robust Federated Learning against Dynamic Model Poisoning

Add code
May 21, 2026
Viaarxiv icon

How Do LLMs and VLMs Understand Viewpoint Rotation Without Vision? An Interpretability Study

Add code
Apr 16, 2026
Viaarxiv icon

Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification

Add code
Mar 27, 2026
Viaarxiv icon

AtomVLA: Scalable Post-Training for Robotic Manipulation via Predictive Latent World Models

Add code
Mar 09, 2026
Viaarxiv icon

The Struggle Between Continuation and Refusal: A Mechanistic Analysis of the Continuation-Triggered Jailbreak in LLMs

Add code
Mar 09, 2026
Viaarxiv icon

SiamGM: Siamese Geometry-Aware and Motion-Guided Network for Real-Time Satellite Video Object Tracking

Add code
Mar 08, 2026
Viaarxiv icon