Picture for Xinyi Huang

Xinyi Huang

Silo-Bench: A Scalable Environment for Evaluating Distributed Coordination in Multi-Agent LLM Systems

Add code
Mar 01, 2026
Viaarxiv icon

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks

Add code
Feb 02, 2026
Viaarxiv icon

Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models

Add code
Dec 26, 2025
Figure 1 for Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models
Figure 2 for Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models
Figure 3 for Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models
Figure 4 for Backdoor Attacks on Prompt-Driven Video Segmentation Foundation Models
Viaarxiv icon

VABench: A Comprehensive Benchmark for Audio-Video Generation

Add code
Dec 10, 2025
Viaarxiv icon

ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities

Add code
Aug 20, 2025
Figure 1 for ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Figure 2 for ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Figure 3 for ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Figure 4 for ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Viaarxiv icon

Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks

Add code
May 28, 2025
Figure 1 for Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks
Figure 2 for Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks
Figure 3 for Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks
Viaarxiv icon

The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework

Add code
May 25, 2025
Figure 1 for The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework
Figure 2 for The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework
Figure 3 for The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework
Figure 4 for The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework
Viaarxiv icon

JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models

Add code
May 23, 2025
Figure 1 for JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
Figure 2 for JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
Figure 3 for JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
Figure 4 for JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
Viaarxiv icon

A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition

Add code
May 05, 2025
Figure 1 for A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
Figure 2 for A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
Figure 3 for A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
Figure 4 for A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
Viaarxiv icon

Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications

Add code
Apr 30, 2025
Figure 1 for Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Figure 2 for Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Figure 3 for Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Figure 4 for Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Viaarxiv icon