Picture for Pengzhou Cheng

Pengzhou Cheng

Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents

Add code
Oct 02, 2025
Viaarxiv icon

Agent-ScanKit: Unraveling Memory and Reasoning of Multimodal Agents via Sensitivity Perturbations

Add code
Oct 02, 2025
Viaarxiv icon

VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents

Add code
Sep 09, 2025
Viaarxiv icon

Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System

Add code
Jun 10, 2025
Viaarxiv icon

On the Adaptive Psychological Persuasion of Large Language Models

Add code
Jun 07, 2025
Viaarxiv icon

Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents

Add code
May 20, 2025
Viaarxiv icon

GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection in GUI Agents

Add code
May 19, 2025
Viaarxiv icon

Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems

Add code
Feb 21, 2025
Viaarxiv icon

Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining

Add code
Dec 03, 2024
Figure 1 for Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining
Figure 2 for Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining
Figure 3 for Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining
Figure 4 for Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining
Viaarxiv icon

Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities

Add code
Jul 10, 2024
Figure 1 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 2 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 3 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 4 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Viaarxiv icon