Picture for Haitong Wang

Haitong Wang

MLLM-Search: A Zero-Shot Approach to Finding People using Multimodal Large Language Models

Add code
Nov 27, 2024
Viaarxiv icon

Find Everything: A General Vision Language Model Approach to Multi-Object Search

Add code
Oct 01, 2024
Viaarxiv icon

NavFormer: A Transformer Architecture for Robot Target-Driven Navigation in Unknown and Dynamic Environments

Add code
Feb 09, 2024
Viaarxiv icon