Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments

Aug 07, 2024

Daeun Song, Jing Liang, Xuesu Xiao, Dinesh Manocha

Figure 1 for TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments

Figure 2 for TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments

Figure 3 for TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments

Figure 4 for TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments

Share this with someone who'll enjoy it:

Abstract:We present a multi-modal trajectory generation and selection algorithm for real-world mapless outdoor navigation in challenging scenarios with unstructured off-road features like buildings, grass, and curbs. Our goal is to compute suitable trajectories that (1) satisfy the environment-specific traversability constraints and (2) generate human-like paths while navigating in crosswalks, sidewalks, etc. Our formulation uses a Conditional Variational Autoencoder (CVAE) generative model enhanced with traversability constraints to generate multiple candidate trajectories for global navigation. We use VLMs and a visual prompting approach with their zero-shot ability of semantic understanding and logical reasoning to choose the best trajectory given the contextual information about the task. We evaluate our methods in various outdoor scenes with wheeled robots and compare the performance with other global navigation algorithms. In practice, we observe at least 3.35% improvement in traversability and 20.61% improvement in terms of human-like navigation in generated trajectories in challenging outdoor navigation scenarios.

View paper on

Share this with someone who'll enjoy it:

Title:TGS: Trajectory Generation and Selection using Vision Language Models in Mapless Outdoor Environments

Paper and Code