Picture for Xiyang Wu

Xiyang Wu

Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey

Add code
Jan 04, 2025
Viaarxiv icon

SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining

Add code
Sep 26, 2024
Figure 1 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 2 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 3 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Figure 4 for SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
Viaarxiv icon

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

Add code
Jun 16, 2024
Figure 1 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Figure 2 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Figure 3 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Figure 4 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Viaarxiv icon

AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales

Add code
Apr 04, 2024
Viaarxiv icon

On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities

Add code
Feb 24, 2024
Figure 1 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Figure 2 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Figure 3 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Figure 4 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Viaarxiv icon

LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments

Add code
Sep 30, 2023
Figure 1 for LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Figure 2 for LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Figure 3 for LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Figure 4 for LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Viaarxiv icon

iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning

Add code
Jun 09, 2023
Viaarxiv icon

FireCommander: An Interactive, Probabilistic Multi-agent Environment for Joint Perception-Action Tasks

Add code
Oct 31, 2020
Figure 1 for FireCommander: An Interactive, Probabilistic Multi-agent Environment for Joint Perception-Action Tasks
Figure 2 for FireCommander: An Interactive, Probabilistic Multi-agent Environment for Joint Perception-Action Tasks
Figure 3 for FireCommander: An Interactive, Probabilistic Multi-agent Environment for Joint Perception-Action Tasks
Figure 4 for FireCommander: An Interactive, Probabilistic Multi-agent Environment for Joint Perception-Action Tasks
Viaarxiv icon