Picture for Erjia Xiao

Erjia Xiao

Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track

Add code
Oct 09, 2025
Viaarxiv icon

Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models

Add code
Mar 14, 2025
Figure 1 for Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
Figure 2 for Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
Figure 3 for Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
Figure 4 for Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models
Viaarxiv icon

Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak

Add code
Jan 23, 2025
Viaarxiv icon

Uncovering Vision Modality Threats in Image-to-Image Tasks

Add code
Dec 07, 2024
Figure 1 for Uncovering Vision Modality Threats in Image-to-Image Tasks
Figure 2 for Uncovering Vision Modality Threats in Image-to-Image Tasks
Figure 3 for Uncovering Vision Modality Threats in Image-to-Image Tasks
Figure 4 for Uncovering Vision Modality Threats in Image-to-Image Tasks
Viaarxiv icon

Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye

Add code
Oct 29, 2024
Figure 1 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Figure 2 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Figure 3 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Figure 4 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Viaarxiv icon

Multi-Floor Zero-Shot Object Navigation Policy

Add code
Sep 17, 2024
Figure 1 for Multi-Floor Zero-Shot Object Navigation Policy
Figure 2 for Multi-Floor Zero-Shot Object Navigation Policy
Figure 3 for Multi-Floor Zero-Shot Object Navigation Policy
Figure 4 for Multi-Floor Zero-Shot Object Navigation Policy
Viaarxiv icon

RRAM-Based Bio-Inspired Circuits for Mobile Epileptic Correlation Extraction and Seizure Prediction

Add code
Jul 29, 2024
Viaarxiv icon

Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models

Add code
May 30, 2024
Figure 1 for Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models
Figure 2 for Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models
Figure 3 for Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models
Figure 4 for Typography Leads Semantic Diversifying: Amplifying Adversarial Transferability across Multimodal Large Language Models
Viaarxiv icon

TriHelper: Zero-Shot Object Navigation with Dynamic Assistance

Add code
Mar 22, 2024
Figure 1 for TriHelper: Zero-Shot Object Navigation with Dynamic Assistance
Figure 2 for TriHelper: Zero-Shot Object Navigation with Dynamic Assistance
Figure 3 for TriHelper: Zero-Shot Object Navigation with Dynamic Assistance
Figure 4 for TriHelper: Zero-Shot Object Navigation with Dynamic Assistance
Viaarxiv icon

Typographic Attacks in Large Multimodal Models Can be Alleviated by More Informative Prompts

Add code
Feb 29, 2024
Viaarxiv icon