Picture for Jing Shao

Jing Shao

Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard

Add code
Nov 14, 2025
Viaarxiv icon

When AI Agents Collude Online: Financial Fraud Risks by Collaborative LLM Agents on Social Platforms

Add code
Nov 09, 2025
Viaarxiv icon

ExGRPO: Learning to Reason from Experience

Add code
Oct 02, 2025
Figure 1 for ExGRPO: Learning to Reason from Experience
Figure 2 for ExGRPO: Learning to Reason from Experience
Figure 3 for ExGRPO: Learning to Reason from Experience
Figure 4 for ExGRPO: Learning to Reason from Experience
Viaarxiv icon

STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models

Add code
Sep 30, 2025
Figure 1 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Figure 2 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Figure 3 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Figure 4 for STaR-Attack: A Spatio-Temporal and Narrative Reasoning Attack Framework for Unified Multimodal Understanding and Generation Models
Viaarxiv icon

The LLM Already Knows: Estimating LLM-Perceived Question Difficulty via Hidden Representations

Add code
Sep 16, 2025
Viaarxiv icon

Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios

Add code
Sep 04, 2025
Figure 1 for Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
Figure 2 for Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
Figure 3 for Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
Figure 4 for Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
Viaarxiv icon

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Figure 1 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 2 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 3 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 4 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Viaarxiv icon

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Add code
Jul 22, 2025
Figure 1 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Figure 2 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Figure 3 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Figure 4 for Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report
Viaarxiv icon

Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection

Add code
Jul 03, 2025
Viaarxiv icon

A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis

Add code
May 29, 2025
Figure 1 for A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
Figure 2 for A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
Figure 3 for A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
Figure 4 for A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
Viaarxiv icon