Picture for Yang Jiao

Yang Jiao

PR-Attack: Coordinated Prompt-RAG Attacks on Retrieval-Augmented Generation in Large Language Models via Bilevel Optimization

Add code
Apr 10, 2025
Viaarxiv icon

UniToken: Harmonizing Multimodal Understanding and Generation through Unified Visual Encoding

Add code
Apr 06, 2025
Viaarxiv icon

Revealing Microscopic Objects in Fluorescence Live Imaging by Video-to-video Translation Based on A Spatial-temporal Generative Adversarial Network

Add code
Feb 22, 2025
Viaarxiv icon

ATLAS: Autoformalizing Theorems through Lifting, Augmentation, and Synthesis of Data

Add code
Feb 08, 2025
Viaarxiv icon

Unlocking TriLevel Learning with Level-Wise Zeroth Order Constraints: Distributed Algorithms and Provable Non-Asymptotic Convergence

Add code
Dec 10, 2024
Viaarxiv icon

Tri-Level Navigator: LLM-Empowered Tri-Level Learning for Time Series OOD Generalization

Add code
Oct 09, 2024
Viaarxiv icon

EAGLE: Towards Efficient Arbitrary Referring Visual Prompts Comprehension for Multimodal Large Language Models

Add code
Sep 26, 2024
Figure 1 for EAGLE: Towards Efficient Arbitrary Referring Visual Prompts Comprehension for Multimodal Large Language Models
Figure 2 for EAGLE: Towards Efficient Arbitrary Referring Visual Prompts Comprehension for Multimodal Large Language Models
Figure 3 for EAGLE: Towards Efficient Arbitrary Referring Visual Prompts Comprehension for Multimodal Large Language Models
Figure 4 for EAGLE: Towards Efficient Arbitrary Referring Visual Prompts Comprehension for Multimodal Large Language Models
Viaarxiv icon

EventHallusion: Diagnosing Event Hallucinations in Video LLMs

Add code
Sep 25, 2024
Figure 1 for EventHallusion: Diagnosing Event Hallucinations in Video LLMs
Figure 2 for EventHallusion: Diagnosing Event Hallucinations in Video LLMs
Figure 3 for EventHallusion: Diagnosing Event Hallucinations in Video LLMs
Figure 4 for EventHallusion: Diagnosing Event Hallucinations in Video LLMs
Viaarxiv icon

Methodology and Real-World Applications of Dynamic Uncertain Causality Graph for Clinical Diagnosis with Explainability and Invariance

Add code
Jun 09, 2024
Viaarxiv icon

Eyes Can Deceive: Benchmarking Counterfactual Reasoning Abilities of Multi-modal Large Language Models

Add code
Apr 19, 2024
Viaarxiv icon