Picture for Yitao Liang

Yitao Liang

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Add code
Mar 20, 2025
Viaarxiv icon

A Neural Symbolic Model for Space Physics

Add code
Mar 11, 2025
Viaarxiv icon

ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment

Add code
Mar 04, 2025
Viaarxiv icon

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Add code
Feb 28, 2025
Viaarxiv icon

Tractable Transformers for Flexible Conditional Generation

Add code
Feb 11, 2025
Viaarxiv icon

TFG-Flow: Training-free Guidance in Multimodal Generative Flow

Add code
Jan 24, 2025
Figure 1 for TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Figure 2 for TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Figure 3 for TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Figure 4 for TFG-Flow: Training-free Guidance in Multimodal Generative Flow
Viaarxiv icon

MineStudio: A Streamlined Package for Minecraft AI Agent Development

Add code
Dec 25, 2024
Viaarxiv icon

MinsStudio: A Streamlined Package for Minecraft AI Agent Development

Add code
Dec 24, 2024
Viaarxiv icon

Proposing and solving olympiad geometry with guided tree search

Add code
Dec 14, 2024
Viaarxiv icon

Optimizing Latent Goal by Learning from Trajectory Preference

Add code
Dec 03, 2024
Viaarxiv icon