Picture for Xin Jin

Xin Jin

GCTAM: Global and Contextual Truncated Affinity Combined Maximization Model For Unsupervised Graph Anomaly Detection

Add code
Mar 02, 2026
Viaarxiv icon

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Add code
Feb 14, 2026
Viaarxiv icon

WorldArena: A Unified Benchmark for Evaluating Perception and Functional Utility of Embodied World Models

Add code
Feb 09, 2026
Viaarxiv icon

ARIS-RSMA Enhanced ISAC System: Joint Rate Splitting and Beamforming Design

Add code
Feb 06, 2026
Viaarxiv icon

Compression Tells Intelligence: Visual Coding, Visual Token Technology, and the Unification

Add code
Jan 28, 2026
Viaarxiv icon

ReWorld: Multi-Dimensional Reward Modeling for Embodied World Models

Add code
Jan 18, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

TP-Blend: Textual-Prompt Attention Pairing for Precise Object-Style Blending in Diffusion Models

Add code
Jan 12, 2026
Viaarxiv icon

Speak While Watching: Unleashing TRUE Real-Time Video Understanding Capability of Multimodal Large Language Models

Add code
Jan 11, 2026
Viaarxiv icon

PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations

Add code
Dec 15, 2025
Viaarxiv icon