Picture for Tianrui Guan

Tianrui Guan

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Add code
Nov 27, 2024
Figure 1 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 2 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 3 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 4 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Viaarxiv icon

Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments

Add code
Sep 30, 2024
Viaarxiv icon

SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining

Add code
Sep 26, 2024
Viaarxiv icon

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

Add code
Jun 16, 2024
Viaarxiv icon

LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation

Add code
May 08, 2024
Viaarxiv icon

AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales

Add code
Apr 04, 2024
Viaarxiv icon

AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments

Add code
Mar 20, 2024
Viaarxiv icon

Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey

Add code
Mar 14, 2024
Viaarxiv icon

On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities

Add code
Feb 24, 2024
Viaarxiv icon

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V, LLaVA-1.5, and Other Multi-modality Models

Add code
Oct 23, 2023
Viaarxiv icon