Picture for Jianan Wang

Jianan Wang

LogicEnvGen: Task-Logic Driven Generation of Diverse Simulated Environments for Embodied AI

Add code
Jan 20, 2026
Viaarxiv icon

CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos

Add code
Jan 07, 2026
Viaarxiv icon

ERIENet: An Efficient RAW Image Enhancement Network under Low-Light Environment

Add code
Dec 17, 2025
Viaarxiv icon

Mind to Hand: Purposeful Robotic Control via Embodied Reasoning

Add code
Dec 10, 2025
Viaarxiv icon

SimTac: A Physics-Based Simulator for Vision-Based Tactile Sensing with Biomorphic Structures

Add code
Nov 14, 2025
Viaarxiv icon

OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes

Add code
Oct 30, 2025
Figure 1 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Figure 2 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Figure 3 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Figure 4 for OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Viaarxiv icon

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

Add code
Oct 10, 2025
Viaarxiv icon

Context and Diversity Matter: The Emergence of In-Context Learning in World Models

Add code
Sep 26, 2025
Viaarxiv icon

DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation

Add code
Sep 19, 2025
Viaarxiv icon

Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer

Add code
Aug 12, 2025
Figure 1 for Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Figure 2 for Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Figure 3 for Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Figure 4 for Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Viaarxiv icon