adversarial


Learning to Inject: Automated Prompt Injection via Reinforcement Learning

Add code
Feb 05, 2026
Viaarxiv icon

Detecting Misbehaviors of Large Vision-Language Models by Evidential Uncertainty Quantification

Add code
Feb 05, 2026
Viaarxiv icon

Verification of the Implicit World Model in a Generative Model via Adversarial Sequences

Add code
Feb 05, 2026
Viaarxiv icon

Synthesizing Realistic Test Data without Breaking Privacy

Add code
Feb 05, 2026
Viaarxiv icon

Limitations of SGD for Multi-Index Models Beyond Statistical Queries

Add code
Feb 05, 2026
Viaarxiv icon

EdgeMask-DG*: Learning Domain-Invariant Graph Structures via Adversarial Edge Masking

Add code
Feb 05, 2026
Viaarxiv icon

Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech Generation from SSL features

Add code
Feb 05, 2026
Viaarxiv icon

Formal Synthesis of Certifiably Robust Neural Lyapunov-Barrier Certificates

Add code
Feb 05, 2026
Viaarxiv icon

Private Prediction via Shrinkage

Add code
Feb 05, 2026
Viaarxiv icon

ShapePuri: Shape Guided and Appearance Generalized Adversarial Purification

Add code
Feb 05, 2026
Viaarxiv icon