Picture for Tsun-Hsuan Wang

Tsun-Hsuan Wang

Embodied Red Teaming for Auditing Robotic Foundation Models

Add code
Nov 27, 2024
Viaarxiv icon

UBSoft: A Simulation Platform for Robotic Skill Learning in Unbounded Soft Environments

Add code
Nov 19, 2024
Viaarxiv icon

Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting

Add code
Nov 14, 2024
Viaarxiv icon

Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models

Add code
Oct 16, 2024
Figure 1 for Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models
Figure 2 for Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models
Figure 3 for Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models
Figure 4 for Flex: End-to-End Text-Instructed Visual Navigation with Foundation Models
Viaarxiv icon

Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference

Add code
Sep 16, 2024
Figure 1 for Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Figure 2 for Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Figure 3 for Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Figure 4 for Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference
Viaarxiv icon

ABNet: Attention BarrierNet for Safe and Scalable Robot Learning

Add code
Jun 18, 2024
Viaarxiv icon

Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models

Add code
Jun 06, 2024
Viaarxiv icon

LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Add code
May 16, 2024
Viaarxiv icon

Probing Multimodal LLMs as World Models for Driving

Add code
May 09, 2024
Figure 1 for Probing Multimodal LLMs as World Models for Driving
Figure 2 for Probing Multimodal LLMs as World Models for Driving
Figure 3 for Probing Multimodal LLMs as World Models for Driving
Figure 4 for Probing Multimodal LLMs as World Models for Driving
Viaarxiv icon

Toward Efficient Visual Gyroscopes: Spherical Moments, Harmonics Filtering, and Masking Techniques for Spherical Camera Applications

Add code
Apr 02, 2024
Viaarxiv icon